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“If | had lots of money... I'd have a body makeover:” 
Managing the Aging Body 


Kathleen F. Slevin, 7he College of William and Mary 


This article uses a feminist framework to explore embodied aging by analyzing in- 
depth formal interviews with 57 men and women in their 60s, 70s and 80s. Empha- 
sizing intersectionality, I focus on the interpretations and strategies these men and 
women use to make sense of their aging bodies. Their aging corporeal experiences 
allow me to examine ageist notions about aging and being old and to explore how this 
thinking, which valorizes youthfulness, shapes their aging experiences. 


Three questions guide this exploration of embodied aging. First, how does ageism 
affect the ways that these men and women view their aging bodies? Second, what 
strategies do they use to manage their aging bodies? Third, how do various social 
locations influence their views and strategies? 

Exploration of these questions makes visible what is largely invisible: pervasive 
cultural norms that denigrate old age in our society. The burgeoning anti-aging in- 
dustry underscores not only the biomedicalization of old age (Kaufman et al. 2004) 
but also the low status accorded old age. Given the cultural devaluation of old age, 
it is not surprising that people internalize negative notions of aging throughout their 
lives (Hurd 1999; Oberg and Tornstam 2001; Slevin 2006). By acknowledging the 
intersection of ageism and sexism, my analysis highlights some of the unique chal- 
lenges women face as they grow old in a culture that devalues women in general and 
old women in particular. I seek to challenge feminist scholars to incorporate aging 
into their gender analyses while also challenging scholars of aging to incorporate 
gender. Finally, this research adds empirical knowledge to a literature that is largely 
theoretical; it gives analytical space to “the voices that emanate from the bodies 
themselves.” (Nettleton and Watson 1998:2; see also Connell 1995) 


The Body, Aging and Ageism 


The body has become increasingly important to our sense of self in modern soci- 
eties (Turner 1996). Not only have our bodies become evaluative markers, they 
are also judged in ways that define our self worth. Self identities are intimately 
connected to how our bodies appear and perform (Calasanti and Slevin 2001). 


This was the 72" Presidential Address delivered at the annual meeting of the Southern Sociological 
Society, April 3, 2009 in New Orleans, Louisiana. I thank the Dean of Arts and Sciences and the 
Department of Sociology for support during my presidential year and the following persons for their 
many helpful comments: Toni Calasanti, Kay Jenkins, Neal King, Jennifer Bickham Mendez, Tom 
Linneman, Graham Ousey and Sal Saporito. Direct correspondence to Kathleen F: Slevin, Depart- 
ment of Sociology, RO. Box 8795, The College of William and Mary, Williamsburg, Virginia 
23187-8795. E-mail: kfslev@gmail.com. 
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In Discipline and Punish, Foucault reveals the emergence of a societal discipline 
focused on producing “docile bodies” whether in the army, hospitals, schools or 
other institutions (Foucault 1979). Similarly, Elias (1978) documents the civili- 
zation of the body through the production of various forms of bodily discipline 
such as etiquette and table manners. The performance and discipline of female 
bodies in areas such as make-up, fashion, hair, diet, exercise, and surgical and non- 
surgical cosmetic procedures have also been well documented by feminist scholars 
(e.g., Sontag 1979; Bartky 1990 and 1998; Bordo 1993; Butler 1993; Davis 
1995; Brumberg 1998; Dinnerstein and Weitz 1998; Gagne and McGaughey 
2002). However, most feminist scholarship has focused on younger women’s 
bodies, particularly those of affluent heterosexual white women (Calasanti and 
Slevin 2001; Slevin, 2006). The work of Furman (1997), Slevin (2006), Hurd 
Clarke and Griffin (2007) represent important exceptions that call attention to 
older women’s body work. 

While the body has garnered significant attention in recent decades, scholar- 
ship largely ignores aging bodies (Calasanti and Slevin 2001; Faircloth 2003; 

- Katz 2005; Calasanti and King 2005). Cruikshank also reminds us that despite 
the body’s critical role as a marker of age, even social gerontologists have given. 
it scant attention in understanding how we experience its aging, except in cases 
of disease and illness (Cruickshank (2003). When researchers do focus on old 
bodies, they have primarily emphasized loss of function; the emphasis has been 
on a narrative of decline (Gullette 1997). Indeed, because commonplace physical 
experiences with aging bodies are ignored, old people experience their bodies in 
an environment of “profound cultural silence.” (Twigg 2000:115) Yet, the story of 
aging is intimately connected with the meanings we ascribe to our aging bodies; 
embodiment is critical to making sense of age and aging (Laz 2003). 

The growing importance of consumer culture also highlights the increasing 
salience of the body. Consumerism touts desirable bodies as young, toned and 
thin; the media convey that to be young and beautiful is to possess the most desir- 
able form of cultural capital. Indeed, Bordo (1993) reminds us that taking care 
of one’s physical appearance has become a moral imperative in Western societies: 
self-control and will power are symbolized in muscular, trim, fit bodies while 
overweight bodies symbolize laziness and lack of control. Furthermore, we have 
witnessed a blurring of age boundaries resulting in an accelerated breakdown in 
the demarcation between mature and youthful bodies: increasingly, consumer 
messages suggest an obligation to discipline bodies through diet and exercise 
throughout the life course (Calasanti and Slevin 2001). Within this increasingly 
medicalized ideology, growing old is seen as a disease that can be remedied if one is 
committed to “aging successfully.” Indeed, such work to maintain an ever-healthy 
and trim body is closely tied to what White, Young and Gillett (1995) see as the 
moral imperative of “healthism.” The widely preached exhortation to accept old 
age with grace is, in fact, no longer supported in practice (Gilleard and Higgs 
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2000). The increasing emphasis on healthism especially affects those who lack the 
requisite economic resources or physical abilities to partake of the consumerism 
that surrounds aging (Katz 2005). 

In our consumer culture the body has become central in the framing of age-resist- 
ing practices. Exercising, dieting, cosmetics and cosmetic surgeries are cultural strate- 
gies of action to resist growing old (Gilleard and Higgs 2000). All of these strategies 
emphasize being fit, staying young, looking young or, at least, not looking old. These 
are the linchpins of successful aging. They allow those who engage in these activities 
to be judged “productive” in old age (Oberg and Tornstam 2001; Faircloth 2003). 
Indeed, current standards of successful aging “...stretch the anxieties of middle age 
across the life course.”(Marshall and Katz 2006:77) Furthermore, as Oberg (2003) 
argues, our visually oriented culture abounds with “appearance junkies” and the 
social capital assigned to experience and maturity is seriously diminished. Holstein 
(2006:324) captures the denigration of old age with this observation: “Why would 
anyone want to be old? The word o/d connotes all that American society does not 
value—slowness, dullness, unattractiveness, inactivity—rather than suggesting some- 
one mellow, glowing with soft patina, confident, accepting...” 

Because youthful appearance has become the coin of the realm in Western 
societies it is not surprising that growing old in our society is increasingly viewed 
negatively and that some old people may come to view old age as a “social con- 
tagion’” that compels them to avoid other old people and to seek the company of 
those younger than themselves (Hurd 1999; Slevin 2006). Furthermore, others 
may manage this stigmatized identity by concealing their true age and by adopting 
a strategy of “passing” as younger than their chronological age—a concept used 
by race scholars, and more recently by LBGT scholars, to signal a strategy in 
confronting prejudice and gaining power and status (Collins 2005). But, “passing” 
requires increasing amounts of work as people age. 


“...we must work harder and harder, longer and longer, to 

maintain our body... now free time must be transposed 
into “body time.” We work out, we eat differently, we 
shop for the ‘right’ clothes, and so forth. We maintain 
on ongoing orientation to the body even in our own free 
time. In turn, this ongoing and seemingly endless body 
maintenance takes away what we once thought of as free 
time and has simply replaced it with another form of work.” 
(Faircloth 2003:12) 


All of the above reminds us, as does Cruikshank, that aging is paradoxically “both 
within our control and beyond our control.”(Cruikshank 2003:203) The realities of 
aging may be postponed but not escaped; ultimately, we face the stigma of “spoiled 
identities” which generate fear, animosity and distancing (Goffman 1963:4-6). 
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We face these spoiled identities from different social locations and relationships 
to a matrix of domination (Collins 2005). Untangling how these locations shape 
embodied experiences is not easy. Gender plays a critical role in aging experiences; 
both men and women face ageist pressures to maintain youthful bodies but they 
experience this cultural weight in different degrees and forms. As Sontag (1979) 
argues, there is a double standard of aging whereby old men's bodies are viewed 
as distinguished, but old women's bodies are seen as failing to live up to feminine 
ideals—indeed, they are reviled and stigmatized. If older women fail to do the 
body work needed to “age successfully” they will be viewed as moral failures and 
will lose cultural relevance (Holstein 2006). 

Compared to men, women of all ages have historically been defined through 
cultural prisms that present images that are narrower, distorted, trivializing and 
harmful (Milkie 2002). The pervasiveness of narrow images of female beauty 
presented in various media create and sustain gender stratification and inequal- 
ity. One result is that women are more concerned than men about their physical 
appearances (Cash 2000). This is not to say that appearance is irrelevant to men, 
but it is not as important. Men are also socialized to perceive their bodies in terms ~ 
of functionality, and women to see theirs as aesthetic. Accordingly, men and 
women consistently differ in the extent to which they use physical appearance 
to define self worth (Calasanti and Slevin 2001). Not only are women of all ages 
more concerned than men with appearance, they are increasingly unhappy with 
body image, especially weight and shape issues (Cash 2000). In addition to the 
clear gender gap in body image, other social locations are also relevant when we 
examine how culture shapes responses to one’s body and to body image (Calasanti 
and Slevin 2001). Extant literature gives us some ideas of how various locations 
influence body image and body satisfaction. 

Because researchers generally focus on the middle class or affluent, it is difficult to 
know how class shapes responses to the body, or how it intersects with other social 
locations such as age, race/ethnicity or sexual orientation. What little data there are 
suggest that gender and class intersect in ways to ensure that afuent women are 
the most concerned with body image and are most likely to be dissatisfied with their 
bodies (Grogan 1999). Featherstone (1991) argues, however, that there has been a 
democratization of body shape ideals that crosses class. However ubiquitous the 
pressure to conform to hegemonic body ideals, class influences one’s ability to do the 
body work required to acquire and maintain an ideal body, and this is true whether 
one is male or female, black or white, heterosexual or homosexual. Indeed, Fallon 
(1990) argues that the body itself is now a vehicle for conspicuously demonstrating 
class difference. Likewise, Bourdieu (1984) contends that the dominant classes treat 
the body as an aesthetic, an on-going project for improvement while those in the 
lower classes are more likely to treat the body as a functioning instrument. 
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Racial differences in responses to the body are also instructive. There is some 
evidence that race and gender intersect in complicated ways. Black men are more 
vested in body image than white men and also the most positive of all groups 
about body image (Huffine 1991). Parker and colleagues (1995) also found that 
black perceptions of beauty are flexible, and that they go beyond physical charac- 
teristics. Accordingly, black women are more likely than white women to assess 
body image in themselves and others in terms that are much less rigid and more 
multi-dimensional (Parker et al. 1995; Schuler et al. 2008). But, the issue of how 
minority women view their bodies is not straightforward because other factors 
create differences within the same racial or ethnic group (Calasanti and Slevin 
2001). Two examples illustrate this point. First, Slevin and Wingrove’s 1998 study 
of older, middle- and upper-middle-class black women found them to be more 
positive about being old than a similar sample of white women. Still, some of 
these privileged black women joined their white counterparts in fretting about 
their weight. Second, a study of Latina and non-Latina women by Lopez et al. 
(1995) underscores the importance of paying attention—especially in the case 
of immigrant women—to subjects’ level of immersion in U.S. culture. Despite 
similar social class backgrounds, Latinas who immigrated to the United States 
after age 16 picked larger silhouettes for their ideal body images than Latinas who 
immigrated prior to age 17. 

Sexual orientation may also influence body perception although there is scant 
empirical research on the topic, and findings are clearer for gay men than for lesbians. 
In addition, little attention has been given to exploring how class interacts with 
sexual orientation to influence body image. New research suggests that physical ap- 
pearance is highly valued among gay men, and that gay culture reinforces youthful 
and hegemonic masculinity even more than heterosexual male culture (Slevin and 
Linneman, forthcoming). Valorization of the young, muscular man in gay culture 
leads gay men to experience “accelerated aging.”(Bennett and Thompson 1991) 
However, empirical data regarding how gay men experience this phenomenon are 
scarce and contradictory (Jones and Pugh 2005). For lesbians the literature is con- 
tradictory and even less clear. Some scholars (e.g., Wolf 1991) argue that lesbians 
downplay the importance of physical attractiveness and have higher levels of body 
satisfaction than heterosexual women. Other researchers find no such differences 
and argue that gender is a more salient predictor of body acceptance and body dis- 
satisfaction than sexual orientation (Brand, Rothblum and Soloman 1992; Slevin 
2006). Furthermore, when we consider race, class and cohort differences homogene- 
ity among lesbians may diminish or disappear (Slevin 2006). 

Oberg (2003) cautions us to beware of projecting an oversimplified, victimizing 
negative image of old women—a point that reinforces the need for empirical data 
that explore the subjective experiences of embodied aging through intersectional- 
ity. Embodied aging is framed by ageist stereotypes of decline, negative images 
of aging bodies and overall loss (Hurd 1999). It is also framed through resistance 
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to growing old. Hegemony’s insidiousness lies in the fact that it is internalized 
(Gagne and McGaughey 2002). Perhaps this helps explain the fact that Oberg and 
Tornstam (2001:21) found that, for their aging Swedish respondents, “Keeping 
a youthful look becomes more important with age.” Nonetheless, even when old 
men and women engage in the body management needed to “age successfully,” 
inevitably there will come a time when such practices will consume more and 
more of their time and energy and they will find themselves further and further 
removed from hegemonic standards of what is acceptable. 


Data and Methods 


My purposive sample of 57-year-old men and women ranging in age from 60 to 
89 face the range of health challenges typical of their ages, but I selected partici- 
pants healthy enough to care for themselves and go about daily living in typical 
ways. The interviews were formal and in-depth, lasting 1.5 to 3 hours, and were 
conducted during a two-year period from 2002 to 2004. I selected respondents 
using snowball sampling methods. While not generalizable, my qualitative analy- 
"sis validates and contributes important empirical findings to the literature on 
embodied aging and culture. ; 

Of the 31 women in my sample, 22 are heterosexual and 9 are lesbian. Twenty- 
four of the women are white, five are black and two are Latina. Seven of the 
women received a high school education and 24 had at least a bachelor’s degree. 
Twelve women were in their 60s, 13 in their 70s, and six in their 80s. Of the 26 
men in the sample, 10 are homosexual and 16 heterosexual. Twenty-one men were 
white and five were black. Six of the men in the sample had high school educations 
and 20 had at least a bachelor’s degree. Twelve of the men were in their 60s, 10 
were in their 70s and four were in their 80s. 

The interviews addressed a variety of topics related to aging and the body. 
Respondents were asked what it means to “feel” old, about the experience of 

“looking old,” how they identify with being an old person. They were asked about 
preventing the aging process, body image issues, their sexuality as they age, their 
notions of masculinity and femininity, and the media’s portrayal of the old. Broad 
questions invited opportunities to “think out loud” and to present stories of aging 
experiences. Interviews were taped and transcribed, and the resulting data form 
the basis for this inquiry. 

Narrative inquiry made sense for a number of reasons. First, because so little 
is known empirically about the topics under investigation, such inquiry is most 
likely to capture “retrospective meaning making—the shaping or ordering of past 
experience.” (Chase 2005:656) Second, it allows the researcher to explore very 
broadly the lived experiences of individuals who are willing to talk about their 
own biographical particulars and it allows for “...the uniqueness of each human 
action and event rather than their common properties.”(Chase 2005:657) As 
well, narrative inquiry provides us the opportunity to seek similarities and dif- 
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ferences across narratives on topics that are difficult to explore. It is especially 
suited to recruitment of members of “hard to find” populations, especially those 
who are reluctant to reveal their sexual identities and are often only accessible 
through person-to-person recruitment. Thus, the word-of-mouth support for 
this research was enhanced by the fact that future participants learned from those 
already interviewed that they would have an opportunity to “tell their stories” to 
a sympathetic listener. Finally, narrative inquiry is particularly well-suited for 
exploring and deepening our understanding of various oppressions but especially 
ageism, which (unlike racism, sexism and homophobia) is “more often invoked 


than analyzed.” (Cruikshank 2003:135) 
Responding to Ageism 


The men and women in this study understand that our society values youthful- 
ness and are keenly aware of both the positive cultural capital that accompanies 
youthful images and the negative capital that comes with being old or looking 
old. Tom (61)' captures this dominant sentiment: “Everyone wants to be young. 
Young is always ‘in, [it is] always cool to be young, to be fashionable.” Marian (81) 
reinforces this awareness when she says: “...everything is geared towards being 
young and beautiful, thin and stylish.” Others are critical of the media’s role in 
promoting negative notions of being old. As Patricia (75) puts it: “Well, that’s all 
you see on TV. It’s a youth-oriented society and you feel like—and they are always 
advertising all these products to make sure you get rid of your wrinkles—like they 
are a big disgrace...” 

Their narratives shed light on the complicated and often contradictory ways 
that they negotiate what it means to be old and to inhabit aging bodies in con- 
temporary society. For example, many respondents express satisfaction with their 
lives in retirement and with their lives as old people, but issues of appearance are 
a different matter; they do not see the old body in a positive light. Almost all of 
my respondents speak about old bodies in ways that demonstrate that they see 
the aging body as ugly. When asked what it means when people say they look old, 
most interviewees list negative physical traits that are commonly associated with 
aging: wrinkles, hunching and varicose veins. It is hardly surprising then that some 
are very explicit in their negative self-assessments: 


“T don’t like looking at myself [now]... | looked a lot better 
when I was younger.” (Monica, 74) 

“People my age don't like the way they look.” (Leon, 77) 

“T know I look uglier today [laughs]...” (Thad, 80) 


How do these old men and women deal with cultural messages that denigrate 
aging and old age? I articulate several strategies below that emerge from respon- 
dents’ narratives. Common to all of these strategies is an approach mentioned by 
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Minichiello, Browne and Kendig (2000:274): “...older people negotiate with age- 
ist stereotypes by trying to create an image of themselves as an ageing person who 
differs markedly from the stereotypical image.” As this study reveals in multiple 
ways, my interviewees strive actively to create images of themselves that suggest 
that they are in control of their aging and that they have not “let themselves go.” 
Resisting old age is seen not only as a viable strategy for how to age but also as a 
desirable one. 


Passing or Keeping up Appearances 


Calling on a distinctly masculine metaphor Mark (64) is explicit about the need 
to engage in body maintenance work to ward off aging: “The [body] is much like 
a car—if you dont oil it and grease it and gas it and maintain it, it’s going to fall 
apart.” While the voices of the women and men with whom I spoke are varied, 
there is a common theme of commitment and struggle to maintain acceptable 
(youthful) appearances, to conceal as best they can the signs of old age, to keep 
-up youthful appearances. Many talk about aging in language that distances them 
from old age, Peter (62) for example: “I do not feel old, so when I hear the word . 
‘old’ I can’t relate to it.” Michelle (78) describes her approach to aging with these 
words: “I’ve always lived younger than I am.” Part of not relating to the notion of 
being old may have something to do with the fact that some see growing old and 
looking old as negotiable: “[Age] is definitely a state of mind because you can do 
something about it. You can improve on it.”(Marian 81) 

Passing as younger than one’s real age is a strategy sometimes used by inter- 
viewees to respond to the ageism they encounter in their daily lives. That such a 
strategy promotes the positive capital that comes with being seen as younger is 
deftly captured by Maggie (82) who talks about why, in many circumstances, she 
does not reveal her age, “...my feeling is that if they knew my age they would treat 
me different, you know, they would think of me differently.” Gary (64), one of the 
gay interviewees, tells a similar story. When asked if he is comfortable telling his 
age, he responds: “It depends on the people... if 1 am trying to chat up some man 
I have to figure out: “Now, if I tell...‘the truth’ is he going to find out that I’m too 
old?” Gary’s story is illustrative on several fronts because it demonstrates not only 
his own internalization of ageist notions but also the stigma and disadvantage even 
privileged older gay men experience when they seek to find partners—especially 
younger men—for sexual pleasure. 

The body work needed to not look old, to pass as younger or more youth- 
ful, requires self-surveillance, it requires that one monitor one’s body constantly. 
Gemma (82) reinforces the agelessness and hard work of this struggle: “You've 
just got to work at it and work hard.” Angela at 89 is the oldest interviewee, and 
her words remind us that the struggle is work and it knows no age limit: “I try 
to keep attractive. I work on that. I’m conscious of what’s in style and very, very 
up-to-the minute...” Harvey (62) corroborates this approach when asked whether 
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or not appearance is important to him: “Yes, definitely. I want to put my best 
foot forward.” Putting one’s best foot forward requires paying attention to being 
youthful and the picture that emerges from these narratives underscores the extent 
to which both women and men are steeped in ageist notions of what it means to 
be old and to look old. Positive attitudes exist side by side with oppressive habits 
and attitudes toward aging. For example, Alan (75) exhibits an ambivalence that is 
fairly typical. On one hand he talks about how he accepts the fact “that people age 
and they change.” On the other hand, his own personal appearance is a different 
matter and his desire to look youthful leads to an unusually frank admission that 
he has recently devoted considerable time to figuring out how best to whiten his 
teeth in order to make his smile more youthful. Faith (77) adds: “I just want to 
look nice. I don’t want to look bad.” Faith’s last comment reveals the stigma of 
old age and the dread that accompanies the perception that one has not done the 
necessary body work to resist looking old. 

Beckie (69) conveys a similar drive to monitor and to correct her body as it 
changes with age. When discussing wrinkles and the need to have a flat stomach, we 
hear her anticipate the continuing work needed to not look old: “I’m going to do 
what I can do within my power to camouflage them in some way or to try to correct 
them in any way I can.” Using camouflage as a strategy to conceal a body that is less 
than youthful in its shape is also revealed by Marian (81) who extols the virtues of 
spandex and “‘good” bras as a way to re-gain a more youthful body: “...so if I've got 
a bulge here and the bust line is low, I just get a better bra and pull up the straps.” 

My interviewees understand clothes are a clue to age, and they pay attention 
to contemporary fashion in an attempt to maintain youthful images. Alan (75) 
says of his attention to dressing youthfully: “See, I dress kind of sharply... it’s not 
that I’m blaring, but I appreciate color and | like to be stylish—not overly but not 
fuddy duddy.” Maggie (82) also articulates the special need to be in vogue as a way 
to fight looking old: “I think that as you get older, you want to look good when 
you go out, and you want to be fashionable.” Two additional quotes convey the 
importance of not wearing clothes that suggest that one is old and that one has not 
paid attention to current fashion: “Well, I don’t dress like an old man,” says Wayne 


(80). And, William (75) agrees, “I don’t go around wearing old folks clothes.” 


Disciplining Old Bodies: Diet and Exercise 


Another way to resist the ageist stereotypes of old people as sedentary and lazy is to 
negotiate an image of one’s self as being physically fit, active and eating a healthy 
diet (Minichiello, Browne and Kendrig 2000). Disciplining the body in order 
to meet youthful standards of attractiveness thus emerges as a response to ageist 
notions of what acceptable female and male bodies look like. It is also an active 
strategy that reinforces and reproduces ageist standards by which to judge old 
bodies. Thus, it reinforces notions that old bodies are unhealthy. By disciplining 
one’s body, one distances one’s self from old people and resists being seen as old. 
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Controlling weight also emerged as a dominant theme. Respondents generally 
emphasize health rather than youthful appearance as the motivator for staying 
active (Hurd 1999). Women were especially bothered by weight issues, with many 
women articulating a life-long struggle with weight and body image. Violet (82) 
confides, “I’ve been through the diet syndrome all my life.” Even women at their 
ideal weights or only a few pounds over that weight talk about the ways that they 
deprive themselves of food. Margaret (68) a lifetime tennis player who at 5 feet 
4 inches weighs 115 pounds, claims, “I’ve done every diet there is in the world.” 
When asked if she was currently dieting, she responds with a refrain that is quite 
typical, “Yeah...my dieting is really abstinence.” When asked what she likes most 
about her body currently, one of the few women who has never dieted, Irene (77), 
exposes the general stigma nowadays of being overweight by responding, “Well, 
that I’m not fat, I guess.” 

Most men also voice concern about their current weight and they talk about 
various strategies for disciplining their bodies in order not only to control weight 
but “to keep... fit and trim,” as Harvey (62) states. Again, even men who are close 
‘to their ideal weights do not feel immune from the obligation to maintain fit and 
slim bodies. Thad (80), who describes himself as only 2 pounds overweight at the ~ 
time of the interview, talks about how he will have returned to his ideal weight 
by the end of the week. “Anytime I start to put on weight, then I diet,” he says. 

“Every morning I drink a can of Slimfast and then for lunch I'll eat a Slimfast bar, 
bananas and fruits.” 

In addition to dieting, many of the women and men interviewed engage in some 
form of physical exercise or activity— illustrating the increasingly ageless obligation 
to maintain fit and trim bodies. Sometimes exercise takes the form of regular walks 
or running, but for others—especially those who were financially privileged—gym 
memberships provide a venue for regular exercise. Margaret (68), a lifelong exerciser, 
falls into the latter category. When asked about what she does at the gym, she said: 

“T do the bike, the treadmill and weights.” Even Violet at 82 does 40 minutes of 
exercise every day. She explains her regime in detail: “I have a routine that addresses 
every set of muscles in the body...” Violet goes on to extol the virtues of her routine 
and she is very happy when she confides that,” I like the fact that ’'m losing my 
tummy... I think ’'m getting a flat tummy [by] weight lifting.” Violet's pleasure with 
her flatter stomach speaks volumes about her desire to capture a younger, toned body 
and it reminds us once again of the devaluation of the old body and the intense body 
work that accompanies strategies to avoid having one. 

Almost half of the men interviewed were regular exercisers and their regimes 
took the form of running, walking, swimming and workouts in the gym. Fora few, 
golf and sailing are mentioned as forms of exercise—illustrating the importance of 
financial means to keeping active. Still, those who exercise regularly often convey 
that it is not enough. Even the narratives of the two men who consider themselves 
to be the most overweight (75 and 80 pounds respectively) and who claim to be 
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relatively unbothered by their appearance, tell an ambivalent story. Landon (71) is 
initially quite cavalier about his large body and exclaims, “Look at my belly and say 
‘That's a beauty.” Later in the interview, he admits that he does not like his large 
belly at all, and that he is quite defeated in attempts to manage and to discipline 
his body through either diet or exercise. 


Cosmetic Surgery 


Cosmetic surgery represents another anti-aging strategy, a way to fight aging and 
resist looking old. Many procedures not only reinforce the dominant and hege- 
monic standards of youthfulness but they are marketed to suggest that women 
and men can alter their bodies (most especially faces) to pass as younger, as more 
youthful than they really are. Such surgery illuminates the strategies of ongoing 
appearance management that can be called upon in the face of “stigmatizing or 
discrediting social labels.” (Freitas et al. 1997:324) 

Most women and men in this study assumed a fairly laissez-faire attitude to- 
ward cosmetic surgery. Four women admitted they had cosmetic surgery (three 
had face lifts, and one had had breast reduction surgery), and 12 women (six 
heterosexual and six lesbian) and seven men (three heterosexual and four gay men) 
in the study said they would consider or were planning on having (more) cosmetic 
surgery in the future. Few shared the outright negative attitude expressed by Penny 
(70) who responded to the query about whether or not she would consider such 
surgery with these words: “I think it is just ridiculous.” 

Responses from women who had, or were considering, cosmetic surgery dem- 
onstrate that the stories are complex. Michelle (78) had a face lift when she was 
61 years old. She reveals that she was motivated to have the surgery because she 
planned to attend a high school reunion and she wanted “...to make the effort to 
look younger.” To look younger than her chronological age is also a key motivator 
for Miriam (61) for whom passing as younger is very important. When asked 
about her face lift she responded: “Oh, God, yes it makes me feel better. I look at 
pictures of myself before that, and I feel that I look younger now than I looked 
when I was 50.” Indeed, Miriam also is currently open to the idea of more cosmetic 
intervention in her continuing drive to recapture a more youthful face: “The other 
day I found myself thinking about laser resurfacing [laughs] for all the little lines 
around my mouth.” Miriam’s approach underscores the relentlessness of some 
women’s drive to retard aging. 

Most of the women who would consider cosmetic surgery— especially face 
lifts—share Beckie’s (69) philosophy that the old body is a failed body, and that 
cosmetic surgery is a very viable way to resist aging. Not surprisingly when asked 
if she would consider such surgery, Beckie claims excitedly, “Yes, I'd do it in a 
minute. Yes, I'd spend every dime I had to do it if I could improve myself in 
any way.” But, for Beckie this is strictly theoretical because her health will not 
allow such surgery; she is frustrated and sad that she is denied the opportunity 
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to surgically fight her aging face. Lack of resources also denies other women that 
opportunity. Both Monica (74), a retired hairdresser, and Mimi (77), a retired 
waitress, talk about how they would love to have cosmetic surgery if money were 
no object. “If I had the money... I would have my whole face done. I would do it 
just to see how good I could look,” Mimi says. After further discussion about the 
possible mistakes that could be made in the surgery, Mimi responds, “It couldn't 
be worse than what it is,” revealing that the altered old face is preferable at any 
cost to the unaltered face. 

Most men interviewed would not consider cosmetic surgery for themselves, 
but were not judgmental of others. Of the three heterosexual men who would 
consider surgery, one had already had his eyes altered and was considering hav- 
ing more work done on his face. Jacob (73) also considered surgery on his eyes, 
but doubted that he would follow through: “I keep saying I’m going to get the 
bags done, but I never will.” One indication that the aspiration for such surgery 
is not limited to men of privilege is illustrated in the desire of Harvey (62), the 

retired auto mechanic, to have a tummy tuck. While Harvey claims that, “The 
most objectionable thing about myself is my midriff bulge,” it is also the case that 
he lacks the financial resources to ensure a more svelte and youthful appearance ~ 
through surgical intervention. 

The narratives of the gay men in the study underline the fact that responses to 
aging bodies are diverse and complicated. The fact that they are privileged by race 
and class provides them resources that most likely would be unavailable to men of 
color or those who are in lower social classes. For example, Peter (62), who freely 
admits that he dyes his hair, is also adamant that he would not consider cosmetic 
surgery, “Never in a thousand years!” Raymond (67) who would also not consider 
cosmetic surgery is very critical of the gay culture and its treatment of aging gay 
men. He is sure that gay men who get cosmetic surgery are chasing youthful looks 
and trying to increase their likelihood of finding sexual partners. “They all do it 
because they want to look younger...,” he says. “It’s related to sex.” 

‘The gay men who would consider cosmetic surgery remind us that some assume 
that technological body enhancing and age resisting practices will help them either 
maintain their current level of acceptability or even increase it. It is also worth 
noting that they are currently without partners and feel pressure to maintain 
acceptable (youthful) appearances in order to seek companionship. The title of 
this article is a quote from John (64) who claims that, if money were no object, 
he would have a body makeover (liposuction in several body areas, facelift, eyelid 
surgery). John, whose partner of 22 years recently died, lives an affluent lifestyle 
that suggests that money is not in short supply. Alone for the first time in more 
than two decades, he feels pressure to make himself look as young as possible in 
order to be attractive. Gary (64), who lives in a gay, mostly single world, is a heavy- 
set man who has recently joined “The Bears,” a gay organization that celebrates 
and supports older gay men who do not have young, svelte, hairless bodies. While 
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Gary claims that “...age doesn’t bother me that much,” he is also emphatic that, 
“Next time I go to Brazil I’m going to have some work done.” 


/ntersectionality: The Importance of Social Locations 


A core focus of my analysis throughout this article centers on gendered ageism 
and allows us to see how the intersections of sexism and ageism shape the aging 
experiences of old women. Thus, while men and women both fear getting old, 
women have a special fear of looking old. Because growing old is accompanied 
by the inevitable failure to maintain youthful attractiveness, some part of het- 
erosexual women's fear is tied to losing male approval—to no longer being the 
object of desire that accompanies “the male gaze.” Among my sample of women 
only two felt that same-age men saw their female peers as being as attractive as 
younger women. Almost every woman I interviewed argued that aging makes 
women less desirable to men. Indeed, many invoked the “double standard” that 
disadvantages older women while advantaging older men by allowing them to be 
seen as attractive and distinguished as they age. Furthermore, most heterosexual 
women mention “no attention from men’ as a factor that makes them feel that 
they look old. Sara (65), for example, agrees that for women of all ages there 
persists an ideal of youthful female attractiveness, and she specifically ties aging 
to loss of this attractiveness in women. Indeed, she argues that a woman who is 
old has allowed “...{her]self to stray outside parameters of what is acceptable.” 
Such an assessment not only underscores the ways that ageism shapes how older 
women are viewed, it highlights their self-denigration. Lastly, it also reveals what 
many interviewees identify: the obligation to resist looking old—one that is more 
demanding of women than men. 

Not all women view their aging bodies through the same lens. When we focus 
on sexual orientation we learn that lesbians are not uniformly negative about aging 
or looking old. Beth (67), a lifetime lesbian who consciously fights heterosexual 
ideals of women and especially old women’s embodiment, claims that “physical 
appearance is not as big an issue in the lesbian world as it is in the heterosexual 
world.” Barbara (68) provides an additional argument that reinforces this no- 
tion: “I don’t think lesbians require of each other as much as men require of their 
women.” Finally, Eliza (71) claims to be quite conscious of how being lesbian 
frees her from unrealistic expectations about maintaining a youthful appearance: 

“lesbians worry less about aging, about how they look...” Maybe it is this perceived 
freedom to worry less than heterosexual women about looking old that allows 
Eliza to declare enthusiastically: “I love my gray hair.” 

Complicating the picture further, my data uncover that the lesbian interviewees 
are not immune to ageist attitudes—a discovery that lends credence to Copper’s 
argument that lesbians share with heterosexual women a traditional female so- 
cialization that is ageist (Copper 1988). This socialization is one that encourages 
and invites women to engage in a variety of ageist and sexist coping strategies 
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that include distancing themselves from old age by “age passing,” and distancing 
themselves from others whom they see as old. Thus, ageism is both internal (self- 
directed) and external (other-directed) in its manifestations. That such ageism 
(particularly woman-to-woman ageism) manifests itself in the lives of these lesbi- 
ans who are committed to gender equality, and who are likely to be more aware 
of patriarchal oppressiveness than heterosexual women, reminds us that ageism 
is not exclusively the domain of the young or middle-aged, and it is not simply a 
product of male dominance (Slevin 2006). 

Race adds an additional layer of nuance. None of the black men interviewed 
expressed an interest in cosmetic surgery—despite the fact that several were fi- 
nancially very comfortable and could afford such procedures. In this regard, John 
(82), a retired physician, captured a sentiment toward aging that is common 
among the black men in this study: “Accept who you are and do the best you can 
with the hand you've been dealt.” Martha (82) argues that black women are more 
accepting of the physical signs of aging than white women. She explains: “This 
is the way it’s meant to be, so I don’t know that we worry about it, you know.” 
Attempting to explain this racial difference, Alice (89) tells us that black women 
have, “had more problems to deal with, and this [aging] is almost immaterial.” ” 
Faith (77) believes that blacks call on humor as a coping skill: “...we [blacks] 
have one great talent that is not shared with white people [in] that we can laugh 
at ourselves, and we make a lot of jokes about getting old.” Yet, this humor, 
despite its usefulness as a coping mechanism, also underscores self-denigrated 
ageism. Finally, Beverly (78) interjects social class into her explanation of racial 
differences in approaches to aging, and specifically to the body work needed to 
resist looking old: “Well, blacks can’t afford to go to spas and all the beauty places. 
Most blacks can’t go to health clubs—those things that help you stay young.” But 
similar class disadvantages also have an impact on white women as we hear from 
Mimi (75), who expresses regret about not being able to fight her aging body by 
getting a face lift or by going to a gym. 

‘The gay men interviewees more closely resemble heterosexual women when it 
comes to appearance anxieties. They are also much more aware of the advantages 
and disadvantages of their sexual orientation than of their race and class privilege. 
Gay culture places a heavy emphasis on youthful bodies. It reinforces youthful 
and hegemonic masculinity, and old gay men, even if unwittingly, adhere to 
such notions. Several respondents say gay men age better than heterosexual men 
because they pay more attention to keeping youthful bodies through exercise and 
discipline. Gay men are also more likely than the heterosexual men I interviewed 
to favor cosmetic intervention. Responses vary as to why this might be the case. 
Eric (60) makes a general claim that “...appearance is more of a concern with 
gay men.” Like women, gay men are heavily judged by their appearances. Guy 
(85) claims that older gay men feel pressured to look youthful because “gays are 
more ageist than straights.” Raymond (67) also weighs in with these words: “... 
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old gay men feel compelled to do everything they can... They dye their hair, they 
have facelifts, they wear clothing that they think makes them look younger... 
they do a lot of exercise.” 


Conclusions 


Ageism is a complex phenomenon. The stories of my interviewees demonstrate 
what is like to grow old and to look old in a culture that valorizes youth and 
stigmatizes old age. While the intensity varies, respondents are motivated to fight 
aging, to resist being seen as old, and to develop strategies to manage their old 
bodies so they appear more youthful. Resistance to old age is not surprising. It 
makes sense. It is a way of fighting invisibility, of resisting exclusion, of trying to 
maintain positive cultural capital. 

The qualitative nature of this study deepens our understanding of inequality 
and oppression. It allows us to see how old people make sense of their aging, how 
their resistance to aging becomes a struggle, and how in these struggles they repro- 
duce the very ageism that set them on the path in the first place. The strategies of 
accommodation and complicity are full of tensions. Resisting looking old makes 
one a “good” and “moral” person, but highlights the self-imposed nature of this 
work and engages the oppressed within. Furthermore, these men and women are 
fighting “a losing battle with the ravages of time,” (Blaikie 1999:86) as the work to 
conceal old age becomes more and more labor intensive. As Holstein (1996:318) 
reminds us, aging bodies mean that, “Instead of a habitual body that we might 
ignore, we gain an attention-demanding body.” 

This article also examines the relationships between age and other forms of 
inequality, illustrating how ageism interacts with other forms of discrimination 
such as sexism and heterosexism and revealing how ageism is both similar and 
different across various groups (Calasanti and Slevin 2001). At the same time, the 
data suggest how age relations differ from other forms of inequality because age 
is a status into which we pass, not one into which we are born. Accordingly, the 
oppression that accompanies aging is fluid and constructed. Furthermore, while 
a desire for inclusion—for not being invisible— motivates many of these men and 
women to manage their aging bodies, social locations also powerfully shape the 
extent to which they can avoid exclusion. 

This qualitative study validates existing research and begs serious engagement in 
intersectionality. Social locations are salient shapers of aging experiences. Gender, 
for example, is prominent in its ability to advantage men and disadvantage women. 
Heterosexual notions of youthful, thin bodies represent a valued currency that is 
more important for women than men. Still, data suggest that the gap between 
women and men on the issue of body dissatisfaction appears to be narrowing; 
further study is needed to confirm this and how it might be complicated by 
intersecting social locations. When we focus on gender and sexual orientation, we 
uncover how their intersection shapes issues of inclusion and exclusion. For ex- 
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ample, we see how the narratives of older heterosexual women and gay men share 
a common bond in that they both seek the male gaze and both are motivated to 
maintain youthful bodies. My attention to intersectionality suggests possibilities 
for further research — especially among hard-to-find populations such as older gays 
and lesbians. Class adds a nuanced layer that highlights how economically disad- 
vantaged women and men do not have the financial means to access resources such 
as exercise and cosmetic surgery that allows others to pass as younger. While in 
some ways all respondents are engaged in body management and body work that 
emphasize the desirability of youthful bodies, it is also evident that race/ethnicity 
shapes these experiences. A final possibility for further research involves the expan- 
sion of this study to middle-aged respondents to explore how their aging experi- 
ences are similar and different from the older generations that make up this study. 


Note 


1. Throughout this article, I use pseudonyms to protect the identities of the respondents. 
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Behavior, Expectations and Status 


Murray Webster, Jr., University of Worth Carolina, Charlotte 
Lisa Slattery Rashotte, University of Worth Carolina, Charlotte 


We predict effects of behavior patterns and status on performance expectations and 
group inequality using an integrated theory developed by Fisek, Berger and Nor- 
man (1991). We next test those predictions using new experimental techniques we 
developed to control behavior patterns as independent variables. In a 10-condition 
experiment, predictions accounted for about 72 percent of variance in the data, with 
closer fit for women than for men. The gender difference may be related to features 
of the experiment, especially to the experimental design that counters cultural gender 
prescriptions in some conditions. We suggest ways to improve the experiment by 
more precisely separating behavior from inferred performance competence in later 
research. Applications include using behavior to reduce undesirable effects of status 
generalization. 








Status characteristics, culturally defined markers treated as indicators of social 
position and worth, include gender, skin color, age and others. A growing body 
of theory and research on status and expectation processes describes how status 
advantages and disadvantages can be imported from the larger society to create 
power and prestige differences in small problem solving groups such as commit- 
tees, juries and teams. In brief, when status characteristics become salient they 
create perceptions of task competence (“performance expectations”) for actors in 
task-focused team situations. Performance expectations then determine aspects of 
the group's behavioral and structural inequality, its “power and prestige hierarchy.” 
Thus, a structural feature of the larger society—the characteristics that society 
invests with status significance—is brought into task groups and reproduces the 
society's status hierarchies in the small group’s structure of interpersonal inequality. 
Wagner and Berger (2002) and Berger and Webster (2006) offer recent summaries 
of the program. 

Here we are interested in how certain kinds of behavior can modify effects of 
status characteristics on inequality structures in small groups; in particular, we 
wish to study how accepting or rejecting influence can alter effects of gender dif- 
ferences in problem-solving situations. When status generalization occurs, status 
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characteristics confer advantages and disadvantages in small task-focused groups 
such as work teams and juries. For example, individuals having high status in work 
groups, including psychiatrists in medical settings (Caudill 1958; Lichtenstein et al. 
2004) and managers in technology industries (Cohen and Zhou 1991) participate 
more and exert greater influence over group outcomes than those of lower status 
(nurses, engineers). In other settings, men, who typically have a status advantage 
over women, tend to be more directive and influential (Gerber 2001; Ridgeway and 
Correll 2004; Ridgeway and Smith-Lovin 1999), while women are less directive and 
exert less influence over other individuals and group decisions. Accompanying those 
behaviors are cultural beliefs regarding specific and general competencies. 

We might expect that behavior or manner of expression would also affect in- 
equality such as participation and influence in ways similar to how status affects 
it. How one says something and how one behaves in the presence of co-workers 
might be expected to affect others’ inferences about task competence and how 
they act towards someone.’ 

At least since Erving Goffman (1959) noted the importance of behavioral 
‘cues “given off” during interaction, sociologists have recognized that impression 
formation relies on what interactants do as well as what they say. In task situations 
where individuals must assess each other's likelihoods of contributing to success, 
they may rely on overt claims (e.g., “I know how to do this”), observable behavior 
(participating readily while leaning in and holding direct eye contact), or, most 
often, a combination of overt claims and behavior. 

George Herbert Mead (1934) discussed the conversation of gestures and sug- 
gested that individual differences in the ability to read cues and see things from 
another's perspective might be a general factor in what was then thought of as 
intelligence. A related view is available recently in the growing literature on emo- 
tional intelligence (Mayer, Roberts and Barsade 2008), popularized in Goleman’s 
(1995) book with that title.* Visual cues such as facial dominance (Mueller and 
Mazur 1996) have been shown to have powerful, persisting effects on military 
career success. Dominance and other behaviors influencing perceived competence 
and behavioral inequality may have biological as well as social components (Mazur 
2005; Booth et al. 2006). The anthropologist Adam Kendon (1995) describes 
many cross-cultural uses of gestures accompanying speech to modify the meanings 
communicated, seeing gesture as a language having its own semiotics. 

The volume and range of scholarly studies about effects of behavior demon- 
strate its significance in many areas of scholarly study. With that background, our 
goal here is a modest one: to begin to understand how behavior affects the status 
processes with which we are most familiar. We do that with a theoretically guided 
experimental study of how gender status and one type of behavior affect perceived 
competence and subsequent interaction inequality. 

While much previous research documents the importance of many behaviors 
for perceived competence and behavioral inequality—two factors that are crucial 
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in study of status processes—the literature is not organized around any consistent 
theoretical foundation. Such theoretical foundation is needed to tell which be- 
haviors are significant, and conditions under which they will or will not function. 
Additionally, with solid theoretical grounding we would also wish to know how 
behavior functions by itself and how and when it combines with other factors to 
affect status processes. Our approach here is to take an integrated theory by Fisek 
et al. (1991) that combines behavior effects with other status processes and makes 
precise predictions that may be tested. That theory has not been independently 
tested (although it was shown to be consistent with previous data from discussion 
groups). This may be due to the fact that researchers did not have a good way to 
control behavior so it could be used as an independent variable. 

The Fisek et al. (1991) theory formally defines a new concept, “behavior inter- 
change pattern,” that allows those authors to incorporate certain kinds of behavior 
within other theories of status processes. There are several advantages in using the 
Fisek et al. theory. Foremost among them, we can go beyond the widely-noted 
fact that behavior is important to specify just how much effect behavior has, and 
how behavioral effects interact with status inequalities. At the same time, we limit 
attention here to the kinds of behavior patterns formally defined by that theory 
and to the scope limitations of theories of status processes in general. We describe 
an experiment that creates behavior patterns and status relations, and measures 
their single and joint effects on performance expectations and subsequent behavior 
in face-to-face, task-focused interaction. Theoretically, this constitutes the first 
independent test of the Fisek et al. theory. Empirically, we present a new technique 
to create and control behavior patterns (without explicitly telling people how we 
want them to act), so that behavior can be used as an independent variable in 
these and other experiments. 


Status Generalization Theories 


Theories of status characteristics and performance expectations apply within cer- 
tain scope conditions, of which task focus and collective orientation are most im- 
portant. Task focus means that group members’ primary goal is to solve problems, 
rather than, for example, to enjoy the interaction process itself. Members of a 
committee or a work group are task focused, while guests attending a cocktail 
party ordinarily are not. Collective orientation means it is normative to take every 
person’s contributions into account rather than working independently. Collective 
orientation is appropriate in a committee and is required on a jury; individual 
orientation would be appropriate for students taking an exam or for matters of 
taste where there is no objectively correct answer. 

Definition 1 in Appendix 1 is diffuse status characteristic. Diffuse status char- 
acteristics, Ds, (1. have at least two states that are differentially evaluated in the 
culture, D+ and D-; (2. each state carries specific performance expectations having 
the same positive or negative evaluation as the state of D; and (3. each state carries 
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Figure 1. Schematic Representation of Creating Face to Face Inequality 
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general performance expectations having the same positive or negative evaluation 
as the state of D. If it is considered favorable or fortunate to be male rather than 
female in a particular society, that meets part 1 of the definition. In addition, if 
men are thought to be better at changing a tire than women, that meets part 2. 
If people treat men as more logical, rational, better at mathematical problems, 
stronger, etc., than women, that meets part 3. The “erc.” part of the definition 
means that the limits of status generalization from a D are unspecified.’ 

Fisek et al. (1991:117) defined a new concept, “behavior interchange patterns,” 
or behavior patterns in short, which may be positive or negative (b+, b-). For 
example, the behaviors of acting assertively and exerting influence during group 
interaction have “clear power and prestige significance,” meeting the first part 
of the definition. The definition also requires that others accept the behavior. If 
others feel someone is influencing a group inappropriately, they are likely to use 
sanctions to suppress him or her in order to avoid wasting time. Behavior patterns 
must have power and prestige significance and be accepted by others in the group. 
‘This is shown as definition 2 in Appendix 1. 

Next, Figek et al. posit that a sequence of behaviors may induce a second new 
element called a “status typification.” That is, behaviors may be treated as instances 
of more general roles, such as “leader-follower” or “performer-reactor.” Once typi- 
fications (B+ or B-) exist, they are treated as relevant to abstract problem solving 
ability (Y+ or Y-), then to successful or unsuccessful task completion (7+ or T-). 


This is definition 3 in Appendix 1. 
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Fisek et al. conceive behavior patterns as status elements, just as Ds are, and 
they modify the theory’s core assumptions to include them. Thus, Assumption 1 
states that “All status elements (rather than “all status characteristics”) that differ- 
entiate actors or that actors believe are relevant to task completion...” Replacing 
“status characteristics” in previous versions of the theory with “status elements,” 
along with the explicit definitions of behavior patterns and typification states, 
constitute the integrated theory to incorporate effects of behavior patterns on 
performance expectations and group hierarchies. 

Initial conditions of situations to which the theory applies include at least two 
actors Person and Other (p and 0), any number of status characteristics having 
positive and negatively evaluated states (D+, D-), a task with outcome states “suc- 
cess” and “failure” (7+ and 7-), and a task-relevant characteristic C* with state C*+ 
relevant to 7+ and C* relevant to T-. In addition, there may be a behavior pattern 
(2), also having either positive (b+) or negative (6-) significance. 

This accounts for cases in which individuals “import” structural status charac- 
teristics and behavior processes to a face-to-face, problem-solving group. Status 
advantages and disadvantages determined by society, and behavior patterns in- 
dividuals display affect interaction inequality through the intervening construct 
“performance expectation states.” Higher performance expectations are formed 
for actors whose status characteristics advantage them in the external society and 
for those whose behavior carries positive power and prestige significance. Those 
expectations translate to interaction advantages in the small group hierarchy.‘ 

Presume these definitions, scope conditions and initial conditions, then turn 
to the theory of status generalization, which contains five assumptions (shown in 
Appendix 1). Assumptions 1-4 constitute a precise specification of the mechanism 
represented by the lower left arrow in Figure 1: how status elements such as status 
characteristics and behavior patterns come to affect performance expectations, and 
precisely how much effect they will have. 

In other words, the theory predicts that status inequality and behavioral in- 
equality can create differential performance expectations. The pattern of perfor- 
mance expectations is what creates the small group power and prestige structure. 
Performance expectations are a theoretical construct, roughly equivalent to task- 
specific conceptions of skill. Precisely calculating expectations associated with 
actors from the status characteristics they possess and their behavior patterns is 
the first step in predicting behavior, and we will use the theoretical model in as- 
sumption 4 to do that. 

Assumption 5 claims that performance expectations translate into a small 
group's power and prestige hierarchy. That is, components of power and prestige 
inequality are direct functions of the relative expectations that have come to be 
associated with those actors. This is the right arrow in Figure 1. Precisely convert- 
ing expectations to measurable behavior is the second step in making testable 
predictions, and below we show an empirical model for doing that. 
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An alternate view of the theory in Appendix 1 is a graph-theoretic model first 
presented by Berger et al. (1977; also in Webster and Driskell 1978). Scope condi-. 
tions, initial conditions and defined concepts remain the same. Actors and status 
elements become points, and relations become signed lines. Relations among ac- 
tors and status elements are of three types: possession, relevance and dimensionality. 
Possession links actors and status elements. For example, we might say that a man 
(in our culture and in most other cultures) possesses the positively evaluated state 
of the status characteristic gender. In other words, actor p possesses D+. Relevance 
links status elements. We might say that male (D+) is seen as relevant to high 
performance expectations (C*+), meaning that people expect performance success 
(T+) from men and corresponding lack of success from women. Dimensionality 
links oppositely valued states of a possessed characteristic when possessing one 
state precludes possessing the other state, as possessing “male” precludes possess- 
ing “female” (at least at the same time). In a graph, possession and relevance are 
positively signed, and dimensionality is negatively signed. 

Figure 2 shows a status diagram containing one status characteristic and one 
behavior pattern, as treated in the Fisek et al. theory. 

In Figure 2, p has a status advantage (p possesses D, + and 0 possesses D,-); e.g., 
p is male and 0 is female. In addition, p displays the high state of a behavior in- 
terchange pattern (p possesses ,); e.g., p exerts influence and 0 accepts influence. 
Because D, and 6, differentiate p and 0, they both become salient by Assumption 
1 of the theory. If D, did not differentiate p and o—for example in an all-female 
egroup—and if there were no pre-existing beliefs linking D ,to the task, it would 
not be salient and so it would not appear in the graph. Finally, the negative link 
dimensionality connects the oppositely valued states of D, and 6. 

By Assumption 2, D, induces a general expectation state I’, the idea of doing 
well at tasks in general. This represents, for example, a cultural view that “men can 
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do most things better than women.” Status generalization then links [+ to C*+ 
and I- to C*-. In words, “One of the things men do better than women is this 
groups task.” D, is relevant to I, and I’is relevant to C*. 

The behavior pattern 6, also becomes salient because p and 0 possess differently 
evaluated states. Next, 6, induces B, the status typification, perhaps captured by 
“If? is influential this time, perhaps p is the type of person who usually dominates 
interaction.” Theoretically, b is seen as relevant to B. Next, B is seen as relevant to 
Y, abstract task ability, and Y is seen as relevant to TJ; the particular task at hand. 

Thus generalization proceeds through behavior patterns in ways comparable 
to, though distinct from, generalization on the basis of status characteristics. Both 
status and behavior provide salient information for actors forming performing 
expectations, and so both affect expectations and inequality in predictable ways. 
In the case shown in Figure 2, the states of status and behavior are consistent (they 
both favor p and disadvantage 0), and the theory predicts they both increase p3 
expectation advantage over 0.” 

The graph model represents the same ideas as Appendix 1, offering a visual 
representation of status and behavior situations. It facilitates calculating aggregate 
expectations for actors because the fi) values of Assumption 4 translate as the 
number of links (possession, dimensionality and relevance) connecting actors to 
outcome states 7+ and 7-. Using path lengths connecting actors to task outcomes 
for f{) values, we can calculate numerical values for expectations associated with 
every actor in a status graph. Every connection between an actor and a task out- 
come state contributes length = 1 to total path length, so in Figure 2, actor p is 
connected to outcome state T+ by four paths with lengths 4, 4, 5 and 5.° 

This completes the theoretical development integrating ideas on how behavior 
patterns influence performance expectations and how they combine with status 
characteristics, thus extending the range of situations to which status generaliza- 
tion theories apply. At the same time, we note that this first statement is limited 
to behaviors with clear task significance, and it omits many ideas that are part 
of other theoretical traditions, such as concern with the meanings of gestures, 
presenting an identity or motives to present positively evaluated identities. 

As noted, we will calculate expectations associated with each actor in the experi- 
ment to predict their relative expectations (¢, — e), or p's “expectation advantage” 
using the theoretical model of Assumption 4. Then, for different experimental 
conditions, we insert the expectation advantage of actors in an empirical model 
to predict power and prestige inequality in our experimental situation. 


The Theoretical Model 


‘The situation shown in Figure 2 above is not particularly unusual in observable 
terms. An actor p who has some status disadvantage also displays high status 
behaviors, interacting with actor 0 who has a corresponding status disadvantage 
and who displays low status behaviors. We anticipate that p will occupy a higher 
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Figure 3. Sample Slides for Phase 1 and Phase 2 of the Experimental Design 












= 2 





3a. One-Pattern Slide for Phase 1 
Task is to decide whether the 
pattern contains more black or 

more white area. 














3b. Two-Pattern Slide for Phase 2 
Task is to decide which pattern 
contains the greater area 
of white. 


position in the group’s power and prestige structure than 0. However to specify 
just how much higher p’ position will be, we turn to the theory. The goal is to 
determine a precise value for the expectations associated with each actor, and thus 
to get a value for each actor's “expectation advantage.” 
In Figure 2, the actor p is connected to outcome states 7+ and 7- by four paths, 
all of them positive. We write this as 4, 5, 4, 5. Because path lengths represent f(z) 
values in the graph model, ps connections to task outcomes are f(4), (5), f(4),and 
fC). Using the combining model in assumption 4 of the theory, - equals [1 — ((1 
—f (4)) (1—£(4)) (Q -f(5)) (1 -£(5)))], and e= 0. Inserting the f(z) values shown 
in appendix 1, Ae .332,e =0,ande = .332. Because the situation is symmetric 
(ps status advantage is matched by 05 status disadvantage, and similarly for ps and 
os behavior patterns), e, = -.332 and ps expectation advantage (e —¢) = .664.’ 
To predict small group interaction, we use the theory’s predicted expectation 
advantage for individuals in each experimental condition, along with an empiri- 
cal model that translates expectations to observable behavior in this experimental 


situation. 
Experimental Design 


Many experimental tests of development in status theories have adapted a basic 
experimental design, which we use here. Its development is described in Berger 
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(2007), while Kalkhoff and Thye (2006) trace variant designs and analyze their 
properties. Many experiments use a two-person group, the smallest size for which 
relative status, relative expectations and behavioral inequality are meaningful. The 
experiment has two parts or phases corresponding to initial conditions and pre- 
dicted outcomes. In phase 1, experimenters describe the situation as one of collec- 
tive problem solving and introduce the independent variables status and behavior 
patterns. In phase 2, participants interact and experimenters measure dependent 
variables reflecting relative performance expectations and small group inequality. 

‘The basic design includes several standard techniques and components. One is 
a task, Contrast Sensitivity, said to be a recently-discovered ability that is measured 
by judging slides containing black and white rectangles in checkerboard styles. 
Two sets of slides, one-pattern and two-pattern, have been developed. For one- 
pattern slides, the task is to decide whether each pattern contains more white or 
more black area. For two-pattern slides, the task is to decide whether the top or 
the bottom pattern contains the greater area of white. Figure 3 shows examples 
of both sets of slides. 

Two-pattern slides have been widely used in phase 2 of experiments to measure 
performance expectations created as a result of independent variables in phase 
1. Measurement uses the idea that rejection of influence, a component of small 
group power and prestige, varies directly with an actor's relative expectations. The 
higher actor p’s expectations relative to 0 when they disagree, the more likely is p 
to reject influence from o. 

Influence rejection is measured in a two-stage, decision-making process. First, 
a slide appears on a video monitor and both participants register their “initial 
choices” as to whether the top or the bottom pattern contains more white. Next, 
they are shown the partner's purported initial choice, and on most trials, they see 
that it disagrees with theirs. Following further study of the slide, each participant 
makes a “final decision” that is not communicated to the partner. The proportion 
of disagreements resolved in favor of self, P(s), is recorded as a function of each 
individual’s relative performance expectations. 

The experimental design has two independent variables: status position and 
behavior pattern. The theory predicts that both status and behavior will affect 
performance expectations, and that expectations will affect subsequent interac- 
tion inequality. Of course, greatest interest attaches to more precise predictions. 
Theoretically that means just how much effect status and behavior will have, and 
how they will combine to affect performance expectations. 

The experiments use gender as a diffuse status characteristic, and influence as 
behavior pattern inequality. Gender of p and gender of 0 have two states, and 
behavior pattern has three: advantage, unknown and disadvantage. ‘This gives a 
2x 2x3 design. We omit two conditions (M, M, unknown behavior and F, F, 
unknown behavior) because they give no information on how behavior affects 
expectations, That makes the 10-condition design shown in Table 1. 
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Table 1: Experimental Conditions Structural Position 


Interactional Position and Predicted Expectations 


Condition P O .P’s Behavior O’s Behavior 
b+ b- 


1 M M 
2 F F b+ b- 
3 M M b- b+ 
4 Es F b- b+ 
5 M F (unknown) (unknown) 
6 F M (unknown) (unknown) 
7 F M b+ b- 
8 M B b- b+ 
9 M F b+ b- 
10 F M b- b+ 


Theoretically, status characteristics that do not differentiate individuals may not 
become salient and so they do not affect expectations (Assumption 1 of the theory 
in Appendix 1). Thus conditions 1-4 show effects of behavior patterns by them- 
selves, when status and other factors are controlled.’ Conditions 5 and 6 show 

- effects of status inequality when no behavior patterns are salient. Conditions 7 and 
8 pair status inconsistently with behavior, either low status and high behavior in . 
Condition 7, or high status and low behavior in condition 8. Finally, conditions 
9 and 10 pair status consistently with behavior, high status with high behavior in 
Condition 9, and low status with low behavior in Condition 10. 


Operations 


Participants entered the laboratory and were shown to individual rooms. After 
they read and signed informed consent forms, experimenters appeared on a video 
monitor and described the forthcoming situation in general terms. To induce 
status, we used gender, a diffuse status characteristic known to convey status be- 
liefs (Ridgeway 1997; Rashotte and Webster 2005) and performance expectation 
advantages and disadvantages (Gerber 2001; Hopcroft 2002, 2006; Ridgeway 
and Correll 2000, 2006).’ Operations to make gender salient are straightforward. 
Experimenters invited participants to introduce themselves, and the TV monitors 
showed a partner of the same or different gender to create the status positions 
required for different conditions. Partners, who really were videotaped and thus 
standardized across conditions, stated their names, said they attended the univer- 
sity and named gender-typical hobbies (e.g., working on cars or shopping at the 
mall). The monitor then displayed the participant to herself or himself while s/he 
answered the same questions as the partner had just answered.!° 

We also needed a way to control behavior patterns because those constitute 
the new independent variable in the experimental design. We began by consider- 
ing kinds of behavior that might affect performance expectations. Influence is a 
well-established consequence of performance expectations; we reasoned that it 
could be a source of performance expectations as well. Influence, which after all 
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is one component of a group’s power and prestige hierarchy, definitely “has power 
and prestige significance,” thus meeting part 1 of the definition in Fisek et al. 
Operationally, we needed a way to control whether an actor accepted or rejected 
influence when the partner disagreed in phase 1. 

We also needed to be sure the behavior was “accepted by all group members” 
to fulfill part 2 of the definition. That part was easier; we could control feedback 
from the partner in phase 1 to show that s/he either accepted or rejected influence 
from p. We describe the new methods because they have not been used before." 

Following introductions, the experimenters explained the session would in- 
volve judging two sets of slides measuring a newly discovered ability called “con- 
trast sensitivity.” Slides for phase 1 involved judging one pattern per slide, to tell 
whether it contains more black area or more white area, and phase 2 used two 
patterns per slide, to tell which pattern contains more white area. The interaction 
conditions would differ somewhat in the two phases. 

Phase 1 had 12 trials, and we controlled initial choice feedback so it appeared 
the partner disagreed with 10 of the 12 choices. The behavior we needed to control 
is whether a person accepts or rejects influence on the disagreement trials. In con- 
ditions 1, 2, 7 and 9, we needed the individual to reject influence, thus enacting a 
positive behavior pattern. In conditions 3, 4, 8 and 10, we needed the individual 
to accept influence, thus enacting a negative behavior pattern. In conditions 5 
and 6, we did not show partner's initial or final choices, so participants had no 
information on behavior patterns. 

The method that we developed to control influence acceptance was a new com- 
puter program that rotates each pattern 180 degrees after initial choices— purport- 
edly to give participants a different view of it—and at the same time the program 
changes the color of 5 percent of the rectangles in the pattern. The latter fact is not 
disclosed to participants until debriefing at the end. The change in patterns is keyed 
to an individual’s initial choice. To cause a participant to reject influence, the pattern 
becomes subtly more like the person’s initial choice. To cause acceptance of influence, 
the pattern becomes more unlike the person's initial choice. Thus we were able to 
control behavior in phase 1 without giving orders or using confederates. 

Our design does not perfectly address how behavior alone may affect expectations 
and behavior. Rather, it causes an individual’s initial choice to appear more likely 
to be correct in the first case, and less likely to be correct in the second case. It thus 
may affect evaluations of one’s own initial choices, as well as affecting behavior.’ A 
test of behavior effects alone would remove the possibility of affecting evaluations at 
the same time, and would constitute a desirable refinement of our work. 

Figure 4 shows how the slide in Figure 3a changed during rotation to influence 
behavior towards acceptance of influence. Panel 4a shows how the slide would 
change during rotation for a participant who initially chose “black” and panel 
4b shows how it would change for a participant who initially chose “white.” Of 
course when the slide was used in a condition where we needed influence rejection, 
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Figure 4. How the Pattern Changes after the Initial Choice to Increase Chances of 
Accepting Influence on Final Decision 





4a. View Following Initial Choice “Black” 4b. View Following Initial Choice “White” 
The pattern becomes more white. The pattern becomes more black. 


changes were the opposite; that is, panel a was shown to someone who initially 
. chose “white,” and panel b would be shown if the initial choice were “black.” 

Pretesting established that the change was subtle enough not to be detected by _ 
participants, yet powerful enough to induce either the acceptance or rejection of 
influence. Thus we were able to create either kind of behavior, b- or b+, as needed 
for the conditions of the experiment. 

Testing the theory also requires creating a status typification, roughly equivalent 
to “seeing a pattern” that one person accepts influence and one rejects it. Presumably 
that means seeing several instances of influence acceptance and rejection. At this 
stage, we were not concerned with just how many unit decisions are required to 
create a status typification; we only wished to be sure that we had created it. It 
seemed reasonable that 10 trials of consistent b- or b+ behavior would be sufficient 
for participants to “see a pattern,” or in words of the theory, to treat the behavior as 
being part of a status typification represented by B- and B+. In fact, that number of 
trials did prove sufficient. Interviews and questionnaires during pre-testing showed 
that participants inferred typifications such as “leader-follower” during phase 1. 

To fulfill part 2 of the definition, that it be accepted by all interactants, we 
showed the partner’s purported final decision. In conditions 1, 2, 7 and 9, where 
p is linked to a positive behavior pattern, the partner's final decision changed to 
ps initial choice; the partner appeared to accept influence under those conditions, 
so the partner o was linked to a negative behavior pattern. In conditions 3, 4, 8 
and 10, where p is linked to a negative behavior pattern, the partner's final deci- 
sion differed from p5 initial choice; that is, the partner rejected influence in those 
conditions. Thus we induced a positive behavior pattern for p by causing p to reject 
influence from o when they disagreed at the same time as 0 accepted influence. We 
induced negative behavior for p by causing p to accept influence from 0 while o 
rejected influence. Ps induced behavior fulfills the first part of the definition, and 
feedback on 05 final decision fulfills the second part. 


Behavior, Expectations and Status ¢ 1033 


Following phase 1, the experiment moved to phase 2, data collection, and 
experimenters described a “second, related” task. This time each slide contained 
two patterns, one above the other as in Figure 3b, and the task was to tell which 
pattern, top or bottom, contained the greater area of white. As before, initial 
choices were exchanged and controlled disagreements were presented on 20 of the 
23 trials. However in phase 2 the slides did not rotate or change during the trial. 
There also was no feedback of the partner’s final decisions. In phase 2, acceptance 
or rejection of influence measures performance expectations formed from the 
status and behavior patterns in phase 1. As noted, this P(s) measure of expectations 
has been used in many experiments using variants of the basic design.'* Figure 5 
shows an overview of the sequence of interaction in phases 1 and 2. 

Following completion of the slides, participants filled out questionnaires ask- 
ing about performance expectations and impressions of each other and of the 
situation. Then we interviewed each participant individually to explore those 
topics, including success at inducing task focus and collective orientation during 
the phase 2 disagreements. Finally, interviewers fully explained the experiment, 
answered any questions, and thanked and paid each participant. 

Not all data from experiments of this sort are usable for theory testing because 
operations are never 100 percent successful at creating initial conditions or at 
inducing task focus and collective orientation for all participants. Exclusion deci- 
sions are made on the basis of the individual interviews conducted after each 
session,'* and they are made cautiously. First, exclusion requires that a participant 
report both thoughts and actions counter to initial or scope conditions. That is, 
a participant must claim not to have heard or not to have believed an impor- 
tant instruction, and report that he or she altered behavior because of that. This 
criterion precludes excluding data on the basis of an interviewer's vague feeling 
that “something is wrong” with the way a participant interpreted the situation. 
Second, if an interviewer believes data should be excluded in a particular case, 
that recommendation is reviewed by a senior member of the research team, using 
the recorded interview and all other information, such as questionnaire responses. 
Again, the presumption is inclusion unless the senior researcher concurs that 
there are specific reasons to exclude. A total of 328 individuals participated in the 
experiment. Of those, data from 66 or 20 percent were excluded before analysis 
for failure of task focus or collective orientation, leaving 262 cases, 123 males and 
139 females, for analyses reported here. This exclusion rate is comparable to other 
research using this experimental design (e.g., Bianchi 2004; Ridgeway et al. 1998; 
Troyer and Younts 1997; Whitmeyer et ale 2005 


Predictions and Outcomes 


To predict behavior, we first used a theoretical model to calculate expectations for 
individuals in each condition. Next, we use an empirical model for this experimen- 
tal situation to translate expectations to observable power and prestige behavior. 
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To assess predictions about observable effects of expectations, we use 
an empirical model that has been developed for this experimental situation. ‘The 
model is: 


P(s) =m + q (e,—e,). 


This model reflects ideas about how expectations will affect one specific behavior, 
disagreement resolution or P(s), in this experimental situation. The theoretical 
model describes how status and behavior affect performance expectations in any 
situation. Thus that model should be applicable to any situation meeting the 
scope conditions of the theory, including this experiment, discussion groups and 
athletic teams. The empirical model, by comparison, only applies to this specific 
experimental design. 

Table 2 shows expectations for actors in each condition, calculated from the 
theoretical assumptions. We use those quantities in the empirical model to predict 
behavior. M and q are empirical parameters. M reflects a population's baseline 
propensity to reject influence, independent of performance expectations; it is 
usually higher for college students (Berger et al. 1992; Webster and Driskell 1978) 
than for high school students (Webster and Sobieszek 1974), and it is higher still 
for U.S. Air Force sergeants (Berger et al. 1972). Parameter q absorbs features ofa 
situation that make expectations more or less important in determining behavior. 
When participants are highly task focused, for example, q is greater than it is if 
they do not much care about getting correct answers.'® 

We first calculate expectation advantage (e —e) theoretically, and then estimate 
mand q by OLS regression on Ps) data.'’ M is the intercept of the equation and 
g is the slope. For these data, the estimates are m = .558 and q = .152. With 10 
conditions and 2 quantities estimated empirically, 8 degrees of freedom remain 
for testing predictions. Table 3 shows predicted and observed P(s) for the 10 
conditions, along with s.d. of observed P(s) figures and differences from predicted. 

The fit of the model is good, though not excellent. Predictions are accurate to 
the second or third decimal, with differences ranging from .001 in conditions 7 


Table 2: Path Lengths, Positive and Negative Subsets, and Expectation 
Advantage, by Condition 


Condition +paths - paths e+ e- (e, -e,) 
{ 45 183 .000 366 
Z 45 183 .000 366 
3 45 .000 -.183 -.366 
4 45 .000 -.183 -.366 
5 45 183 .000 .366 
6 45 .000 -.183 -.366 
v 4,5 45 183 -.183 .000 
8 45 45 183 -.183 .000 
9 4,5,4,5 332 .000 664 

10 45.45 .000 -.332 -.664 
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Table 3: Predicted and Observed Data, 10 Conditions Empirical Parameter 
Estimates: m = .558, q = .152 


Predicted Observed Standard Difference 

Condition P(s P(s Deviation (observed — predicted N 
1 613 674 163 oe) 
2 613 628 178 015 30 
3 502 565 252 .063 26 
4 502 491 196 -.011 29 
5 613 565 118 -.048 23 
6 502 483 192 -.019 24 
i 558 557 164 -.001 26 
8 558 493 219 -.065 28 
i) 659 .663 161 .004 23 
10 457 458 .200 001 30 


and 10 to .065 in condition 8, and a mean absolute difference of .029. R’ for the 
OLS model is .723. G’, the proportional reduction in X the model yields (Fisek, 
’ Berger and Moore 2002), is also good, .639. 

However a X? model-fit test shows that predictions are significantly different- 
from observed data (X? = 30.98, p < .01). While the fact that a simple model does 
not fit the complexity of behavior in a large sample of individuals might not be 
particularly troubling, the difference between predictions and data makes further 
exploration reasonable. 

One factor to investigate is gender; do women and men respond differently 
to the theoretical variables, or did they respond differently to this experimental 
situation? A simple way to investigate gender differences is to compare behavior 
in conditions 1 and 2, and in conditions 3 and 4. According to the theory, these 
conditions show effects of behavior inequality among status equals. If women and 
men respond the same way to behavior among status equals, then we should see 
condition 1 = 2, and 3 = 4. 

By this comparison, men and women respond about the same to the struc- 
ture and interaction conditions of conditions 1-4. Predicted figures for males 
are slightly lower than observed, but that is not true for females, and in all cases 
predicted figures are less than one-half s.d. displaced from those observed. 

We also can examine gender differences in response to the full experimental 
situation, and analyzed this way, gender differences are evident. In fact, almost 
all the errors of prediction are attributable to errors in predicting the behavior of 
males in this experiment. To show that, we estimate the behavior model separately 
for women and men. Table 4 shows these results. 

Table 4 shows marked differences by gender in how participants responded to 
the experimental situation, and the differences may not reflect folk wisdom about 
gender differences. For example, it might be supposed that men are basically 
more resistant to influence than women are. Such a difference would show up in 
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Table 4: Behavior Model Estimations Separately by Gender 


m R? G? X? 
Both genders (conditions 1-10) PO 022 12586. 030 . 230.90 
p<.01 
Males only (conditions 1,3,5,8,9) 567) =.122) »=6.408 = 578 )=— 28.47 
p<.01 


Females only (conditions 2,4,6,7,10) .559 .172 979 .976 .90 
p= .82 


the parameter m, the baseline propensity to reject influence; but there is no such 
difference. M is only slightly greater for men than for women (.567 vs. .559); the 
difference, .008, is hardly important substantively or statistically. Men were not 
fundamentally more resistant to influence than women in these experiments. 

However the values of parameter q differ greatly. For men, q is .122, and for 
women q is .172. The fractions are small, but the value of q is about 41 percent 
greater for women than it is for men. Recall that q captures situational factors mak- 
ing expectations more or less important in determining behavior. Those factors are 
about 41 percent more important in determining women’s behavior than men’. 

Row 4 of Table 4 shows excellent success for describing the behavior of women 
in this experiment. Both explained variance and the proportional reduction in 7 
are more than 97 percent. X” is only .90, so predicted values are not different from 
observed here (p = .82). By contrast, for the men, the model is much less successful; 
it explains only about 41 percent of the variance and reduces unexplained dispersion 
by about 58 percent. Predictions for men do differ significantly from observed data 
(p < .01). Behavioral and questionnaire data for males generally show greater vari- 
ance for males, but we not yet have a clear answer as to why this is the case. 

Conditions 3 and 8 show the greatest departures from predicted behavior, 
.063 and .065, respectively. In Condition 3, males are paired with males, and 
in Condition 8, males are paired with females. In both conditions, ps behavior 
pattern is negative; that is, p accepts influence while o rejects influence. With a 
male partner in Condition 3, p responds less strongly to the behavior pattern than 
predicted; P(s) is higher than predicted. Perhaps in that case the situation triggers 
competitive feelings resulting in greater influence rejection. 

However the result in Condition 8 may be more interesting. Here, the usual 
structuring produced by gender is reversed by behavior, and p responded strongly 
to the behavior. P has a status advantage produced by gender and a behavioral 
disadvantage, and the latter has greater effect than predicted; that is, ?(s) is lower 
than predicted. In the usual course of events, someone having a status advantage 
will also display behavioral advantage, and the partner 0, who has a status disad- 
vantage, will display behavioral disadvantage. In condition 8, the expected pattern 
is disrupted. Perhaps such disconfirmations of the usual consequences of status 
advantage make behavior a more important cue to performance. 
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A more interesting interpretation considers both p’s behavior and cultural 
prescriptions associated with gender.'* Part of our (and other) culture's prescrip- 
tion for male behavior is being superior in agentic competence in relations with 
women. In Condition 8, males were induced to fail at this prescription —they 
deferred to their female partners—which may have made them feel especially in- 
adequate in their phase 2 work with the female partner. This interpretation could 
be checked by adding questionnaire and perhaps interview items asking males in 
Condition 8 how they felt “most men” would interact with a female partner, and 
whether they felt they acted in a typically masculine fashion in phase 2 of the 
experiment. A different way to get at this effect would be to replicate Condition 
8 using another status characteristic besides gender, such as a laboratory status 
lacking cultural definitions. If the intensification of behavior effects did not appear 
with the laboratory status, the above interpretation would be confirmed. 

Such interpretations need further attention theoretically to determine condi- 
tions making behavior more or less important than status in structuring future 
interaction. A first step would be to explore the issue empirically, through repli- 

‘ cation of Condition 8 and perhaps later with female subjects who have a status 
advantage in same-gender and mixed-gender groups. 

Overall, results show reasonable success in the first test of the Fisek et al. theory 
of effects of status position and behavior patterns on performance expectations. 
For the entire sample, predicted behavior is reasonably close to that observed. For 
women, predictions are excellent; for men, less so. From the comparisons we could 
make, it appears reasonable to attribute the gender difference partly to greater suc- 
cess inducing task focus with the women. Cases in which behavior is inconsistent 
with status position deserve further attention as they may have implications for 
improving the theory. 


Discussion and Next Steps 


We have developed new experimental techniques to directly test a precise theory of 
effects of behavior interchange patterns on performance expectations. We believe 
that this new method can be used for a variety of further tests for theories in which 
it is useful to control behavior as an independent variable. 

The theory's predictions explained about 72 percent of variance in data overall; 
that is about 41 percent for men and about 98 percent for women. We found 
strong effects of status position on performance expectations; this reproduces 
much prior research in a new setting with a different subject population. We also 
found predicted effects of behavior patterns; this is the first time such findings 
have been reported. Third, we found that behavior patterns interact with status 
position as predicted by the combining function of the theory. While it may not 
be surprising to claim that behavior affects group position, the Fisek et al. theory 
goes beyond that claim. It specifies what sorts of behaviors, conditions under 
which they have effects, precisely how much effect behaviors have by themselves 
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and in combination with status characteristics, and the power and prestige ele- 
ments of group position that behaviors and status can affect. 

The design we used creates a fully fledged behavior interchange pattern. That is, 
it incorporates both overt action and competence evaluations. When someone de- 
fers to the partner's initial choice in phase 1 of the experiment, that occurs because 
his or her own initial choice seems less likely to be correct as the slide rotates and 
vice-versa for conditions where the manipulation induces resistance to influence. 

Those behavior interchange patterns affect expectations, as predicted by the 
theory and as shown by the experimental results. Would behavior by itself, with- 
out the accompanying subjective assessments that participants’ initial choices were 
correct or incorrect, also affect expectations and subsequent behavior? A complete 
answer will have to await further evidence from experimental designs more precise 
than we were able to create. At present, we do not see how to control an indi- 
vidual’s behavior without introducing the possibility of affecting unit evaluations 
of performance at the same time. It would be relatively easy to control perceived 
behavior by one’s partner, thus affecting expectations held for that person and, 
by extension, relative expectations for oneself. Such a manipulation would have 
about half the effect of manipulating both self and other behavior, as we did here. 
It would require a considerable increase in N to see effects, both because only the 
partner’s behavior is effective, and also because participants’ uncontrolled self- 
expectation levels may vary widely, thus introducing variance in the data. 

Besides investigating this issue experimentally, we might consider theoretically 
what is entailed. The Fisek et al. (1991:117) definition of a behavior interchange 
pattern includes a requirement that “the [behavior] sequences are consistent in 
their power and prestige significance.” Of course, part of what gives a behavior 
pattern significance for power and prestige is precisely its actual or implied unit 
evaluations of performance. An assertive style connotes confidence, and acquies- 
cence to another person’s assertiveness connotes uncertainty. 

However confidence and uncertainty may differ from actual unit evaluations 
of performance. If our behavior manipulation caused participants to form actual 
unit evaluations, the cumulative effect of those would be to link actors to states 
of the task-specific skill, denoted by C* in Figure 2. In other words, if actors p 
and 9 inferred unit evaluations from the behavior, they would become linked to 
states of C*, giving them very short paths (of lengths 2 and 3) to outcome states 
T+ and T-. Short paths represent powerful expectations. ‘Thus if unit evaluations 
were inferred, we might expect the behavior manipulation to be considerably more 
effective than it actually was in our experiments, more than overcoming the effect 
of the status difference. 

One way to assess such an analysis would be to contrast two conditions, one in 
which we manipulated behavior as here, and another in which the TV monitors 
provided feedback “correct answer” or “incorrect answer,” depending on partici- 
pants’ behaviors. If the behavior manipulation causes inferred unit evaluations, 
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the two conditions will produce equal effects on performance expectations. If the 
behavior manipulation does not produce inferred unit evaluations, then the design 
with feedback will produce stronger effect on expectations. 

At present, then, without additional experimental data, we cannot know what 
the precise mechanism is that causes behavior patterns to affect performance 
expectations and future behavior. Possibilities include (1. that behavior patterns 
cause inferences about confidence that are more uncertain than actual unit eval- 
uations; (2. that behavior patterns by themselves, without any such inferences, 
might affect expectations; and (3. that behavior patterns create inferred unit 
evaluations about performance. Refining our understanding of the mechanism 
awaits more precise experiments. 

If we extrapolate a bit from the experimental results, several interesting im- 
plications appear. These discussions are tentative pending replication with dif- 
ferent participants, different status characteristics and behaviors, and improved 
experimental designs. 

In conditions 1 and 3, and in 2 and 4, behavior patterns by themselves created 
performance expectation differences that translated into power and prestige inequal- 
ity. This confirms ideas of many previous authors that behaviors can affect interac- 
tion, while adding the improved specificity and precision of the Fisek et al. theory. 

Conditions 5 and 8, and conditions 6 and 7, show that behavior patterns can 
attenuate both status advantages (the first comparison) and disadvantages (the 
second comparison). That is, positively evaluated behavior can raise the power 
and prestige position of a low status person, and negatively evaluated behavior can 
lower the position of a status-advantaged person. These conditions therefore show 
a way to intervene in the status generalization process when such changes might 
be desirable. Inconsistent status position and behavior pattern may in some cases 
trigger legitimacy concerns or other social processes. 

Because behavior patterns and status characteristics both affect performance 
expectations, we believe that in many situations neither status nor behavior alone 
is sufficient to fully structure a group. A high status person should not overlook 
behavior patterns if he or she wishes to be influential. Similarly, a low status 
person need not accept a low position as inevitable, for it is malleable through 
appropriate behavior patterns. In the simplest cases where status advantages and 
disadvantages are not salient (e.g., a group undifferentiated by gender, skin color, 
age, etc.), behavior patterns may be the primary determinants of individuals’ 
power and prestige positions. 

Further work might seek to improve precision of the mechanism by which 
behavior becomes salient in social situations. One question is how many unit 
behaviors (4s) are needed to create an imputed behavioral typification state (A). 
As a first approach, we used 10 trials in which someone was very likely either to 
accept or reject influence. Would it have worked with only five trials, or with 
three? In other words, how strong is the tendency to attribute traits on the basis of 
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behavioral observations? Quite likely, the answer involves conditions and perhaps 
differences by type of behavior. Perhaps when there is little other information, 
people are likely to make attributions on the basis of limited behavioral observa- 
tions, while they might require more repetitions in richer situations.” 

A second avenue for further work might be to explore different kinds of 
behaviors. Again, as a first approach here, we used a behavior that is part of a 
group’s power and prestige structure, reasoning that it would definitely qualify 
as “having power and prestige significance,” a crucial part of the definition of 
a behavior pattern. It might be worthwhile examining how other kinds of cues 
such as speed of speaking, holding eye contact, or standing close can have power 
and prestige significance. 

Considerable research in this tradition has been devoted to finding ways to 
overcome undesirable status generalization, from imputations of skill from skin 
color (Webster and Driskell 1978; Goar and Sell 2005), to gender-based advan- 
tages and disadvantages (Ridgeway and Correll 2004; Wagner, Ford and Ford 
1986), to interaction disabilities from ethnicity (Cohen and Lotan 1997; Chizhik 
et al. 2003). The general approach of such work has been to “add on” differently 
valued status characteristics. For example, a class may be shown that a child with a 
disvalued ethnic status possesses other positively valued status characteristics such 
as leadership or performance skills. While effective, that approach is limited to 
circumstances where someone such as a teacher controls believable information 
about status characteristics. It now appears that control over behaviors —which are 
somewhat voluntary—can also be used to intervene for overcoming undesirable 
status generalization. 

In terms of our first goal, to test the Fisek et al. theory, we have found reason- 
able confirmation of predictions regarding how behavior affects group structure. 
Our second goal, to develop a method for controlling status-relevant behavior, 
led to the computerized program and procedures described. We hope they will be 
useful in further, improved experiments. 


Notes 


1. Status and behavior may interact in complex ways, particularly in the case of gender 
status. Ridgeway (1982) showed that women who claim expertise in groups often 
must simultaneously express group loyalty to defuse men’s resentment at what 
otherwise might seem inappropriate behavior given their status positions. That 
research also showed that men need not take the additional step of proclaiming 
group loyalty before offering suggestions. 


2. Russell, Bachorowski and Fernandez-Dols (2003) trace interest in emotional 
expression to Charles Darwin and also to the art historian Bell (1806). Psychologists 
Tomkins (1962) and his student Ekman (e.g., Ekman and Oster 1979) developed a 
very influential categorization of facial expressions. Interestingly, psychologists have 
recently emphasized the intentional communication aspects of expression rather than 
the unconscious “given off” approach of many studies following Goffman’s writing. 
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Two properties of status characteristics merit comment. First, whether a particular 
characteristic meets the definition is a historical issue, not a theoretical one. There is 
no claim that gender or mechanical skill is a status characteristic. Rather, the theory 
claims that ifthose things meet the definitions, then certain interaction consequences 
will follow. Second, the definitions depend on cultural beliefs, not scientific measure. 
Men are not more capable than women at “most tasks,” but gender is a diffuse status 
characteristic because cultural beliefs treat men that way. A considerable body of 
work is devoted to identifying conditions that attach status value to characteristics 
(e.g., Ridgeway 1991; Webster and Hysom 1998), and to interventions that reduce 
or eliminate status generalization where it has undesirable consequences (Cohen and 


Lotan 1997; Goar and Sell 2005; Ridgeway and Correll 2006). 


These theoretical processes are not usually conscious or deliberate. This is a theory of 
behavior, not of thoughts. Gender status effects, for example, occur in the absence of 
overt sexism and in the presence of a desire to be egalitarian (Ridgeway 1997; Ridgeway 
and Correll 2006; Ridgeway and Walker 1995; Rashotte and Webster 2005). People 
ordinarily do not think “I am a woman and he is a man, and therefore he probably 
knows more about this topic than I do,” and they would be appropriately angered by 
a suggestion that they do think such things. People might sometimes behave as if they 
thought that, but the theory does not use thought processes for its predictions. 


The combined effect of consistently valued status elements is subject to diminishing 
returns, and conversely, an inconsistent status element has greater effect than it would 
by itself. These effects are captured in the combining model of assumption 4. Tests 
to assess this combining function in comparison with other functions are reported 
in Berger et al. (1992). Results show good confirmation of the combining function 
in assumption 4. 


Greater detail on calculating path lengths is provided in Berger et al. 1977. 


This is condition 9 of the experiment described below; other conditions create other 
patterns of expectation advantage and disadvantage. Table 1 shows the conditions. 


The theory makes the same predictions for women as for men; thus predictions are 
identical for conditions 1 and 2, and for conditions 3 and 4. As it happens, we found 
some gender differences in responses to the experiment. 


Social meanings of characteristics (including gender) change, so it is crucial in these 
experiments to know that gender is a status characteristic for our participants. Foschi 
and Lapointe (2002) reported that gender may be decreasing in status significance 
among their Canadian students, although other studies (e.g., Hopcroft 2002; Lucas 
2003; Rashotte and Webster 2005) show gender’s status effects persist. In conditions 
5 and 6 of our experiments here, status effects of gender are evident. 


Video unavoidably includes additional information besides gender, and some 
information, such as attractiveness, has known status significance (Webster and 
Driskell 1983). On the other hand, simply describing a partner’s gender does not 
take full advantage of a strength of experimental methods, namely, strongly inducing 
independent variables. Rashotte (2007) discusses reasons for preferring strong 
induction of experimental variables. In this experiment, investing the video partner 
with gender-stereotypic hobbies was included to strengthen the impression he or she 
made on participants and help them remember gender throughout the experiment. 
Post-session interviews confirmed that they did remember their partner's gender. 
Additional cues such as beauty were constant across conditions, for every participant 
saw either the same female or the same male partner. Still, the strong induction using 
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video unavoidably introduces the possibility that participants will respond to other 
information besides gender of the taped “partner,” and we have no independent 
evidence of just what any given participant attended to in the videos. The best way 
to deal with such concerns would be through more elaborate pre-testing asking 
participants what they attended to in the videos. Rashotte, Webster and Whitmeyer 
(2005) offer some suggestions for doing that kind of pre-testing. We thank a Social 
Forces reviewer for a thoughtful discussion of this important issue. 


Some previous studies of behavioral effects have studied naturally occurring behaviors 
such as interruptions in discussion groups and charted their effects on power and 
prestige (e.g., Smith-Lovin and Brody 1989). Others have used confederates trained 
to behave in certain ways and measured their effects on others’ behavior (e.g., 
Ridgeway and Correll 2006). We believe ours is the first design to create workable 
operations for controlling behavior of actual participants without telling them how 
to act or telling them that their behavior has been influenced. 


We thank the Editor and a Social Forces reviewer for pointing this out, and for identifying 
our design as testing a “full-fledged” behavior pattern as described in the Figek et al. 
theory. However, as the reviewer noted, competence evaluations interact with behavior 
and some combinations—such as when someone thought to have low competence acts 
assertively—can create additional processes including legitimacy concerns. 


We use a computer program to display the slides, present agreement and disagreement, 
and record responses. The program was written for PC, and is available from the 
authors on request. It is one of two programs widely used in status and expectations 
experiments. The other program was written and adapted by Foschi (1996) and by 
Troyer (2007). The programs create very similar situations. Both have 20 critical 
(disagreement) trials. Ours has three neutral (agreement) trials interspersed, and the 
Foschi-Troyer program has five neutral trials. Earlier experiments in this research 
tradition have used as many as 40 or even 62 trials (Berger et al. 1972; Martin and 
Sell 1985). While more trials give a more stable measurement, the interaction itself 
may introduce fatigue and other spurious processes. 


Of course interviewers use terms appropriate to participants’ language rather than 
the terms of the theory. For example, if a participant says “I didn’t care about what 
my partner thought, so I didn’t look at her initial choices. I don't know how often we 
disagreed, and she never affected my final decisions,” that shows failure of collective 
orientation that affected behavior, and so it is sufficient for exclusion. However 
professing independent thinking is quite normative for student populations, and often 
it is difficult to distinguish normative answers from real failure of collective orientation. 
In doubtful cases, data are included, following the exclusion rules described above. 


We aimed for N = 25 in each condition, though operational considerations and 
exclusions caused the numbers to vary from a high of 29 in Condition 1 to a low of 
22 in conditions 2 and 4. Foschi and Lapointe (2002) provide a useful discussion of 
criteria for including and excluding experimental data. 


Mis close to, but generally a bit greater than, .50, probably because changing one’s 
initial choice is somewhat unpleasant even if one thinks that changing improves the 
final decision. Q generally ranges from about .10 to .18. Higher values of gare desirable 
because they display expectation effects more clearly, but given the limits of a short- 
term experimental session, an unfamiliar partner and anxiety about the situation, no 
experiments that we know of using this design have attained a q value beyond that range. 
Fisek et al. (2005) discuss empirical parameters for this and other settings. 
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17. Fox and Moore (1979) first proposed this technique, and it has been widely used 
for similar experiments. Balkwell developed (1991) a logit-like model with a strong 
theoretical foundation and applicability to other settings such as discussion groups 
in addition to this experimental design. Generally, the OLS model and the logit- 
like model give similar results for this experiment except for unusual cases where 
many status elements are salient or where the paths of relevance are very long. Using 
Balkwell’s model for our data increases R2 slightly. For comparability with previous 
research, we report OLS results here. 


18. We thank a Social Forces reviewer for suggesting this point. 


19. The original power and prestige theory (Berger and Conner 1969) contains many 
similar arguments and ideas. 
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Appendix 1. The Theory of Status Characteristics and Expectation States 
Integrated with Concepts and Assumptions from Fisek et al. (1991) 


Terms and assumptions below include only those needed for the work described 
in this paper. We omit ideas used in other contexts, including the definition of 
specific status characteristics (Cs) in the general theory and some additional as- 
sumptions on legitimation processes in the Fisek et al. theory. 


Scope Conditions: All individuals are (1. task focused and (2. collectively oriented. 


Initial Conditions for Test: At least two actors, p and o (person and other), any 
number of diffuse status characteristics (D+, D-), and a behavior pattern (6+, 0-) 
in some conditions. Task outcomes (7+, J-), and a specific characteristic relevant 
to task outcomes (C*+, C*-). Status characteristics and behavior patterns are called 
“status elements,” and have both positive and negative (+ and -) states. Relations 
exist among actors and status elements of three types, possession, relevance, and 


dimensionality. 


Definition 1: A characteristic D is a diffuse status characteristic = 

(1. it has two or more differentially evaluated states in society; 

(2. each state carries similarly evaluated specific performance expectations; and 
(3. each state carries a similarly evaluated general expectation state. 


Definition 2: Behavior cycles or sequences constitute a behavior interchange 
pattern (0) = 

(1. they are consistent in their power and prestige significance; 

(2. they are mutually accepted by all actors. 


Definition 3: A status typification (A) is a socially constructed abstract conception 
of what high status and low status behaviors are like. 


Assumption 1. Salience. All status elements (status characteristics and behavior 
patterns) that differentiate actors or that actors believe are relevant to task comple- 
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tion become salient; that is, they will affect performance expectations and behavior 
in a social situation described by the scope and initial conditions above. 


Assumption 2. Burden of Proof. Unless actors believe that a given status ele- 
ment is irrelevant to the task at hand (7), they treat it as relevant. Actors will 
form performance expectations for every actor; those expectations will have the 
same valence as the society's evaluation of the status elements. D induces general 
performance expectations I having the same sign as D and a relevance bond will 
form connecting I” to C*. Then 6 induces # having the same sign, f induces Y, 
the idea of doing well at tasks in general, and Y becomes relevant to doing well at 
the group’s particular task 7. 


Assumption 3. Sequencing. Status generalization continues as described in as- 
sumptions (1) and (2) until all actors are connected to T by all possible paths. If 
actors leave or new actors enter the situation, all existing connections remain and 
new connections appear. 


Assumption 4. Combining Model. All salient status information is used to cre- 
ate expectation states for all actors. Aggregate performance expectations may be 
represented as: 


Sa Ae) ea); 

e= -[1 — ((1- f@) ... 1 —ffn)))]; and 

6-6, +63 

where é, is aggregate expectations for actor p, e is aggregate expectations for ac- 
tor 0, and ffi) is a function of the length of path i connecting an actor, through 
intervening status elements, to states of the task outcome T. 


Assumption 5. Behavior. Once aggregate expectations have formed for all actors, 


ps and o5 relative positions in the power and prestige hierarchy of the group are 
direct functions of their expectation advantage ( e=e a 


A note on using Assumption 4. F(i) values in Assumption 4 represent the “strength” 
of status elements on performance expectations. They have been estimated dif- 
ferent ways: empirically (Berger et al. 1977) and theoretically (Balkwell, 1991; 
and Figek, Norman and Nelson-Kilger 1992). The three methods provide closely 
similar estimates, though theoretical considerations may dictate choice of method. 


We use f(i) as determined by Fisek et al. 1992. Values are: 


f2) = 6321 
{3) = 3175 


Behavior, Expectations and Status ¢ 1049 


ft) = 1358 
fO) = 0542 
RO) = 0211. 


Generally, when an actor is connected to task outcomes by short paths (with low 
fii) values), the status elements in that connection have greater effects on perfor- 
mance expectations than connections through many relevance links. Berger et al. 


(1977) and Balkwell (1991) discuss f(z) values in detail. 
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Innovation and Selection: Symphony Orchestras and the 
Construction of the Musical Canon in the United States 
(1879-1959) 


Pierre-Antoine Kremp, Princeton University 


This article analyzes the determinants of innovation and success of innovation in 
the field of U.S. symphony orchestras from 1879 through 1959: why did major or- 
chestras (N = 27) innovate by introducing works of new composers to the repertoire 
instead of sticking to canonical pieces? Can organizational processes account for the 
selection and the popularization of new composers in the repertoire? By integrating 
field theory and organizational theory, this analysis shows that orchestra and musical 
director consecration and local elite cohesiveness favored innovative programming. 
Composers introduced by consecrated actors and entering the repertoire at a time 
of low competition with established composers and high field-level innovation were 
more likely to survive in the repertoire and have their works performed frequently. 
These effects became magnified throughout composers’ careers. 





The field of symphony orchestras has developed around the production and the 
reproduction of an institutionalized canon. The conservatism of the field is re- 
flected in the concentration of the repertoire: of the 1,612 composers ever played 
by 27 major American symphony orchestras from 1879 to 1959, 13 composers 
accounted for half of the total number of performances,' and the 100 most played 
composers accounted for 86 percent of the performances. Not only is the reper- 
toire dominated by a small number of composers, the list of the most performed 
composers is remarkably stable: 12 of the 20 most played composers in the 1950s 
were already among the 20 most played composers in the 1880s. In this context, 
critics have repeatedly denounced the “overdose of ancestor worship” in concert 
programming (Hart 1973) which has discouraged artistic creation and left major 
orchestras in a “symphonic quagmire.” (Horowitz 1987) 

Still, the image of an inert field dominated by the authority figures of ca- 
nonical composers should not obscure the successive attempts at introducing 
novelty into concert programming in the United States since the late 19" cen- 
tury (Dowd et al. 2002): orchestras have renewed the repertoire by performing 
works of new composers that had never been played before (including 20" 
century composers and “resurrected” composers from the Renaissance and the 
Baroque period that were not initially part of the musical canon’). While most 
of these new composers disappeared from the repertoire soon after their first 
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performances, some have survived and are played regularly. A few have become 
popular and part of the musical canon (Mueller 1951). 

Why do orchestras innovate by introducing new composers to the repertoire, 
and why do certain innovations succeed while others fail? This research provides 
a sociological account of these ongoing processes of innovation and selection 
through which new composers are introduced and legitimated or rejected within 
a bounded social space of organizations aiming to perform “classical music.” 
In contrast to studies emphasizing the personal role of individual conductors 
committed to promoting new music and renewing the musical canon,? I draw 
on analyses of the production of culture, in which “the symbolic elements of 
culture are shaped by the systems within which they are created, distributed, 
...and preserved.” (Peterson and Anand 2004:311) Namely, I concentrate on the 
field of organizations specialized in the performance of classical music as one 
such system constraining cultural innovation and its success or failure. In par- 
ticular, I analyze the relationship between the relative positions of actors (such 
as orchestras and conductors) in this field and their capacity to depart from 
existing categorical boundaries by introducing new composers; and I investigate 
the effect of their positions in the field coupled with the ecology of repertoires 
on composers’ careers. 

The sociological and historical literature on the construction of the mu- 
sical canon has focused on two sequential processes: the formation and the 
reproduction of the canon. The formation of the canon has been analyzed as a 
product of the emergence of a class-based distinction between “highbrow” and 

“lowbrow” taste during the 19" century, first in Europe (Weber 2004; DeNora 
1991) and later in the United States (DiMaggio 1991). This construction of 
highbrow taste by new economic elites—and in the case of Europe, the nobil- 
ity—led orchestras to specialize in the performance of what became defined 
as classical music. Formerly heterogenous collections of “light” and “serious” 
pieces, repertoires became more homogenous in the second half of the 19% 
century: orchestras started favoring music of the past over recently composed 
music and concentrating on a narrow set of mostly Austro-German composers 
(the “classics”) (Weber 1992, 2001). The reproduction of the canon has been 
explained by inertial pressures within orchestras leading to a marginalization of 
innovation and the institutionalization of the categorical boundaries defining 
“classical music.” Three interrelated organizational processes can account for the 
reproduction of the canon through the continuous predominance of a limited 
set of composers in the repertoires: the bureaucratization of large symphony 
orchestras (Arian 1971), the alignment of aesthetic interests of concert artists 
and administrative rationality of orchestras (Gilmore 1993), and the emergence 
of a culture of performance and virtuosity rather than creativity within major 
symphony orchestras (Horowitz 1987, 2005). 

In a conservative field oriented towards the reproduction of a canon, the 
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sociological processes explaining why orchestras would depart from tradition 
and innovate by performing works of new composers are not well understood. 
Innovation in concert programming has been related to changes in cultural 
classification systems since the 1960s in the United States. After the progressive 
institutionalization of the musical canon beginning in the late 19" century, the 
contestation of the distinction between “high” and “popular culture” in the 
1960s led to its deconstruction (DiMaggio 1987; Horowitz 2005). Dowd et al. 
(2002) amend this argument by linking the erosion of the musical canon to three 
factors: the expansion in resources for new music, the increase in performance 
capabilities of symphony orchestras, and the proliferation of music programs 
in higher education institutions. However, an ambiguity over the timing of the 
deconstruction of the canon remains. While DiMaggio (1987) suggests that the 
canon became progressively institutionalized after the late 19" century and that 
its deconstruction only started in the 1960s, Dowd et al. (2002) argue in favor 
of an “early deconstruction” account and suggest that concert programming 
became more innovative in the early 20" century. 

This research analyzes the transformations in the cultural order of the field by 
bringing the organizational level back into the study of repertoire change. This 
approach reconciles the two accounts by showing that both representations reflect 
actual changes in the repertoires although at two distinct levels of analysis: while 
the field as a whole has become more open to new composers during the first 
half of the 20" century, orchestras taken individually have become markedly less 
innovative throughout this period. Moreover, although the mechanisms driving 
repertoire change at the field level are well understood (Dowd et al. 2002), the 
processes explaining innovation at the orchestra level remain unclear. As Figure | 
shows, the number of new composers introduced per season is far from evenly 
distributed among orchestras; and the selection of new composers after their intro- 
duction to the repertoire varies dramatically by orchestra of first performance. The 
orchestra-level approach focuses precisely on these unexplained variations: why 
did the Chicago Symphony Orchestra introduce more composers than the Los 
Angeles Philharmonic? Why did composers premiered in Chicago, Philadelphia 
or Boston appear to survive better after their first performance? 

This research also contributes to the study of the production of culture, not 
only by analyzing the processes leading to the introduction of new composers 
to the repertoire, but also by following their careers after their introduction. 
Analyzing the introduction of new composers in the repertoire gives important 
insights into the role of innovation in the transformation of the canon, but 
because a high proportion of new composers disappears from the repertoire very 
early on, their impact on the repertoire is unclear. In other words, this article 
analyzes the determinants of innovation as well as the factors explaining success 


and failure of innovations. 
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Figure 1. Innovation and Selection of Innovation by Orchestra, 1879-1959 


Selection of Innovation 
Innovation (by Orchestra of First 
(by Orchestra) Performance) 


Chicago Symphony Orchestra 
Philadelphia Orchestra 

Boston Symphony Orchestra 
Cincinnati Symphony Orchestra 
New York Symphony Orchestra 
Minneapolis Symphony Orchestra 
New York Philharmonic Orchestra 
National Symphony Orchestra 
Denver Symphony Orchestra 

Saint Louis ero Orchestra 
leveland Orchestra 

_ Atlanta Symphony Orchestra 
Pittsburgh Symphony Orchestra 
Dallas Symphony Orchestra 

Detroit SOU Orchestra 
Rochester Philharmonic Orchestra 
Indianapolis Symphony Orchestra 
tah Symphony Orchestra 

Houston Symphony Orchestra 
Kansas City Philharmonic 

Seattle Symphony Orchestra 

San Francisco penny Orchestra 
Los Angeles Philharmonic Orchestra 
Baltimore Symphony Orchestra 
New Orleans Philharmonic Orchestra 
Buffalo Philharmonic Orchestra 





1.02.0 0 20 40 60 80 


Average Number of New % Introduced Composers 
Composers Introduced Who Disappeared From 
per Season by Orchestra Repertoire after First 
Performance 


Explaining Variations in Innovation and Success of Innovation 
Variations in Innovation 


Resource Mobilization 

Sociology of culture has emphasized the importance of resource mobilization 
as a factor in the institutionalization of innovation in art worlds (Becker 1982; 
Baumann 2001, 2007). In this perspective, the production of new artistic works 
is legitimated by the availability of organizations and networks that make innova- 
tion possible. The theory predicts that resource mobilization generates positive 
externalities in which present innovation makes future innovation more likely all 


things being equal. 
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In the case of classical music in the United States, Dowd et al. (2002) document 
the proliferation of resources for new music in the early 20" century—which not 
only involved efforts to help composers promote their works both from funding 
organizations and newly-created associations of composers (such as the League of 
Composers, see Metzer 1997), but also consistent interest from particular orches- 
tras and conductors in opening their repertoires to new music. 

Dowd et al. (2002) show that the increase in resources for new music at the 
field level_ measured by a proxy, the total number of new composers introduced 
in the previous year—played a significant role in the rise in the number of first 
performances by creating a favorable climate for innovation. However, this effect 
may be observed at two distinct levels of analysis: the field level (through efforts 
from funding organizations or the creation of associations of composers) and the 
orchestra level (through local efforts by orchestras to promote new music). 


Hypothesis 1: Innovation is positively associated with 
resources for new music available in the field of orchestras 
and available in the orchestra. 


Field Position 

Field theory has emphasized the role of past and present struggles over the appro- 
priation of economic and symbolic profits among artists and art organizations in 
explaining their “position-takings,” i.e., their propensity to promote different types 
of art and different conceptions of what art is (Bourdieu 1993). In this perspective, 
propensity to depart from the musical canon through innovative programming, is 
not randomly distributed within the field. Artistic fields tend to be structured along 
two main dimensions: the level of consecration (or “symbolic capital”) and the 
level of autonomy from economic profits and market demand (Anheier et al. 1995; 
Bourdieu 1996). Relative positions within this structure can in turn explain why 
some orchestras and musical directors innovate more than others. 

On the first dimension, consecrated actors are more likely to defend orthodoxy 
in cultural production and less likely to innovate because their symbolic capital 
was founded on already-established types of cultural production—such as the 
performance of canonical works (Bourdieu 1996). This hypothesis is consistent 
with findings from innovation and diffusion studies showing that quest for nov- 
elty and deviation from established norms within organizational fields tends to 
originate from low-status actors—who have more incentives to innovate than 
established actors (Leblebici et al. 1991; Strang and Soule 1998). On the second 
dimension, actors benefitting from a high level of autonomy from non-artistic 
(mostly economic) pressures may be more likely to adopt unorthodox positions 
and innovate. Multiple studies on the performing arts sector have shown that 
innovative programming is unlikely when organizations face financial pressures 


(on operas in the United States see Martorella 1977, 1982; Heilbrun 2001b. On 
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nonprofit theaters, see DiMaggio and Stenberg 1985), which illustrates Bourdieu’s 
argument that the pursuit of purely artistic goals is a function of the autonomy 
from economic necessity, even in nonprofit cultural industries. This dimension 
intersects with the opposition between “large-scale” and “restricted” forms of 
cultural production — respectively directed towards a general audience or a narrow 
public of connoisseurs (Bourdieu 1993:127). 

Position in the field can explain differences in innovation between orchestras, 
and within orchestras, between musical directors. The literature on the field of 
symphony orchestras and careers of conductors points to several useful indicators 
of field position. 

The level of consecration of orchestras seems tied to their age, as older orchestras 
have consistently been recognized as the most prestigious ones in the field —critics 
have identified the group of most prominent orchestras as the “Big Three” (the 
New York Philharmonic, the Boston Symphony and the Chicago Symphony) or 
starting in the 1950s, the “Big Five” (adding Philadelphia and Cleveland) (Hart 
1973; Faulkner 1973; Goldin and Rouse 2000). These groups include some of the 
oldest orchestras in the field.4 Anecdotal evidence also suggests that older orches- 
tras (such as the Boston Symphony) were regarded as models by newly founded 
orchestras (Hart 1973; Horowitz 2005), which seems consistent with the idea that 
older orchestras possessed significant symbolic capital. 

Orchestra autonomy from non-artistic constraints can be related to the eco- 
nomic environment in which orchestras are located. Research on the economics 
of the performing arts sector shows that organizations located in less populated 
cities are more likely to face financial pressures as the potential demand for music 
performance is limited (Baumol 1971; Heilbrun 2001a). Small cities constitute 
therefore an unfavorable environment for innovation as innovative repertoires 
are costly for orchestras: they require more rehearsal time and attract smaller 
audiences (Baumol and Bowen 1966). Conversely, organizations located in larger 
cities are more likely to be able to pursue purely artistic goals in their program- 
ming decisions thanks to the presence of a significant demand for “restricted” 
(as opposed to more conformist and “large-scale”) forms of cultural production 


(DiMaggio and Stenberg 1985). 


Hypothesis 2a: Innovation is negatively associated with 
orchestra consecration in the field. Older orchestras are 
less likely to innovate. 


Hypothesis 2b: Innovation is positively associated with 
orchestra autonomy from economic constraints. Orchestras 


located in larger cities are more likely to introduce new 
composers. 
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Similarly, field position may affect musical directors’ attitude towards innova- 
tion and their capacity to include works of new composers in their orchestras’ 
repertoires. While indicators of symbolic capital are necessarily imperfect, the 
historical literature on conductor careers has described the field as dominated by a 
small number of star conductors with similar social trajectories (Woodbury 1955; 
Hallmark 1986; Lebrecht 1991; Horowitz 2005). Most of them were born outside 
of the United States and had held previous musical director positions in other 
orchestras (both in Europe and in the United States). Conversely, American-born 
conductors with limited previous experience tended to hold fringe positions in 
the field and were incapable of getting appointments with prestigious orchestras 
and external recognition.’ 


Hypothesis 3: Innovation is negatively associated with 
musical director consecration in the field. Musical directors 
holding dominant positions in the field (born in Europe 
and/or with past experience in other orchestras) are less 
likely to innovate. 


Regional Contexts of Orchestra Institutionalization and the Construction 

of Highbrow Taste 

Highbrow musical taste became defined in the United States only after “cultural 
entrepreneurs” were able to create an organizational form susceptible of maintain- 
ing the distinction between popular and classical music in the late 19" century 
(DiMaggio 1982b, 1991). Non-profit permanent orchestras could offer more per- 
formances, rid their repertoires of “light” pieces and specialize in the performance 
of the newly aesthetized “serious” music (DiMaggio 1982a). Such programming 
choices attracted a limited audience and exposed orchestras to financial losses, but 
wealthy patrons dedicated to the institutionalization of highbrow music and the “el- 
evation of public taste,” such as Henry Lee Higginson in Boston or Charles Norman 
Fay in Chicago, would increase their donations to cover the endemic deficits of the 
orchestras they helped found (Levine 1988). While DiMaggio predicted that this 
trend led to the sacralization of canonical composers and a dearth of innovation, 
recent research has qualified this argument. In particular, Dowd et al. (2002) have 
argued that this process of “purification” of repertoires and the expansion in orches- 
tra performance capabilities led to a wave of innovation early in the history of the 
field as the nonprofit organizational form became institutionalized. As orchestras 
had to replace popular pieces with classical works and perform more often, their 
repertoires had to expand and include works of new composers. 

The differential ways in which nonprofit permanent orchestras were institu- 
tionalized across the United States can explain differences in innovation rates. 
This argument is largely consistent with studies of legitimation processes that 
trace the origin of innovations back to their validation in local cultural contexts 
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(Johnson et al. 2006). The critical factor explaining the development of the non- 
profit form is the cohesiveness of urban elites (DiMaggio 1987). The creation of 
the Boston Symphony Orchestra and its subsequent success were facilitated by 
the cohesion of the Boston Brahmins who perceived their social position, politi- 
cal domination and cultural authority as threatened due to industrialization and 
new waves of immigration (DiMaggio 1991). Elite cohesiveness is difficult to 

measure directly; however historical and sociological research has documented 

important and consistent variations across local contexts in the late 19® century. 
While the Philadelphia upper class exhibited a sense of unity and a perception 

of its collective interests similar to the Boston elite (Beisel 1990), the New York 
establishment was profoundly divided (DiMaggio 1987) and was not able to unite 

around a single orchestra until 1928. Until that date, the New York Philharmonic 

had to compete against the New York Symphony (which was backed mostly by 
the German elite); and the orchestras presented signs of incomplete institution- 
alization: the Philharmonic became a permanent orchestra only in 1909; seasons 

of the Symphony were cancelled from 1899 to 1903. More generally, historical 

research on urban elites (Jaher 1982) has found that the upper classes of Western 

cities tended to be less cohesive and more fluid on average than in Rustbelt and 

midwestern cities, in the South and in the Northeast (with the notable exception 

of New York), which could explain the difficulties experienced by West Coast or- 
chestras soon after their creation.° While elite cohesiveness was a central factor ex- 
plaining orchestra institutionalization, which would in turn account for variations 

in innovation (Dowd et al. 2002), one can expect these differences to decrease 

when controlling for differences in performance capability between orchestras. 


Hypothesis 4: Innovation is positively associated with 
institutionalization of nonprofit orchestras, which is, in 
turn, related to urban elite cohesiveness. Orchestras located 
in New York and western cities innovate less than other 
orchestras. 


Variation in Chances of Success of Innovations 


Institutionalization of Innovation and Competition 


The sociology of culture has recently focused on ecological explanations of cultural 
phenomena within autonomous fields (Kaufman 2004). One particular ecological 
mechanism can explain the selection of cultural innovations: density dependence. 
The organizational ecology literature has analyzed the ways in which an organiza- 
tional environment can affect survival chances of organizations by two sequential 
processes: institutionalization and competition (Hannan and Freeman 1988; Carroll 
and Hannan 1989, 2000). In the first phase, an increasing number of organiza- 
tions in a population makes success of new entrants more likely as a higher density 
is indicative of a process of institutionalization—in which an organizational form 
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becomes legitimized and taken for granted. In the second phase, higher density leads 
to excessive competition between organizations, which increases chances of failure. 

This density-dependence model has originally been applied to the processes of 
selection of organizations, but the underlying ecological processes—institution- 
alization and competition—can explain the selection of cultural objects as well 
(Dowd 2004; Schulz 1998). However this model refers to a case in which the 
same actors legitimate an organizational form and compete against each other. 
The model is therefore captured by a function with a single argument: the density 
of the population. In contrast, the success of new composers is more likely to be 
affected differently by the presence of new and established composers—works of 
new composers are usually thought of and labeled as part of distinct categories 
within repertoires (i.e., “new music,” “contemporary music” or “ancient music”). 
The density-dependence model has been disaggregated by categories of actors in 
recent empirical research (Lippmann 2008). My argument furthers this approach 
by showing that institutionalization and competition can be the product of dif- 
ferent population densities. 

For example, a high number of composers in the repertoire can indicate that 
the performance of classical music is institutionalized, but may not mean that 
new music is accepted as a legitimate cultural production; only an increasing 
density of new composers would indicate that new music is being institutional- 
ized. Conversely, new composers compete for space in the repertoire not only with 
other new composers, but with all other composers. In other words, variations in 
the success of new composers can be explained by processes of institutionalization 
and competition, but the actors accounting for these two processes are different: 
the extent to which novelty is legitimated can be related to the overall number 
of new composers introduced in a given year; but success of innovation may be 
affected by competition from composers already present in the repertoire—i.e., a 
high number of established composers in an overcrowded field. 


Hypothesis 5: Success of a new composer is positively 
associated with density of new composers as increasing 
resources for new music at the field level signal an 
institutionalization of innovation in the field. 


Hypothesis 6: Success of a new composer is negatively 
associated with density of established composers as an 
increasing number of composers already present in the 
repertoire signals overcrowding and competition in the field. 


Orchestra and Musical Director Consecration and Legitimacy of Innovation 
Success of new composers can be understood as the result of a diffusion process in 
which orchestras adopt composers introduced by other orchestras. Studies of dif- 
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fusion have shown that the social position (status, prestige) of innovators (orches- 
tra and musical director) affects chances of adoption (Rogers 1995; Wejnert 2002). 

This effect is likely to be particularly strong in the case of concert programming 
as orchestras and musical directors are subject to a high level of goal ambigu- 
ity. They need to satisfy the donors’ demands and the tastes of a diverse public 
(Arian 1971), and allow musicians to demonstrate their virtuosity through the 
performance of canonical works while leaving room for new pieces (Gilmore 
1993). According to neo-institutional theory, goal ambiguity leads organizations 
to mimic decisions of actors seen as powerful and successful in an organizational 
field (DiMaggio and Powell 1983). One can expect innovations originating from 
dominant field actors (orchestras and directors) to be considered as more legiti- 
mate than innovations from other orchestras. 

This hypothesis is consistent with historical evidence of the diffusion of pro- 
gramming choices from established orchestras towards newer ones. Composers 
were aware of the legitimating effect of a first performance by a prestigious or- 
chestra and a star conductor. For example, Aaron Copland (1944:256) describes 
in these terms the role of Sergei Koussevitzky, who directed the Boston Symphony 
Orchestra from 1924 to 1949, after a successful career in Russia and France: 


“The submitting of a new work to Koussevitzky is always 
something of an ordeal for a composer. ....If he likes a 
composition he generally likes it whole-heartedly, and the 
composer leaves his presence walking on thin air. (After all, it 
means a performance by the Boston Symphony Orchestra!) 
If he doesn’t like it, it means that other conductors may 
perform it, but the special atmosphere that surrounds a 
Koussevitzky premiere will be lacking.” 


One of the constitutive elements of this “special atmosphere” seems to be re- 
lated to the exceptional critical notice that only prominent orchestras and star 
conductors were able to attract. For example, Edgard Varése’s visibility had been 
limited to New York avant-gardist circles until Leopold Stokowski decided to 
perform his composition Hyperprism with the Philadelphia Orchestra in 1924. 
While the piece had been conducted less than a year earlier by Varése himself with 
the International Composers’ Guild in New York, critics only acclaimed it after 
Stokowski’s performance—which established Varése as the icon of ultramodernist 
music in the 1920s (Oja 2000). 

In other words, renowned orchestras and conductors may have been able 
to transfer their symbolic capital onto the new composers whose works they 
premiered—thus increasing their visibility and their chances of diffusion. 
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Hypothesis 7: Success of a new composer is positively 
associated with level of consecration of the orchestra and 
the musical director that introduced the composer to the 
repertoire. 


Cumulative Advantage 

Studies of trajectories of artists have analyzed inequality-generating processes 
within cultural industries as the result of a “Matthew effect” (Merton 1968). For 
example, research on careers of Hollywood screenwriters (Bielby and Bielby 1996) 
and film actors (Lincoln and Allen 2004) suggests that inequalities in career op- 
portunities can be the product of a cumulative advantage process through which 
small initial differences between artists become magnified throughout their careers. 
Similarly, Robert Faulkner (1983) has shown that composers in the film industry 
are more likely to be hired frequently if they have been able to develop and main- 
tain networks through repeated collaboration, which is a function of their past 
success in the field (see also Faulkner and Anderson 1987). 

A similar mechanism may help understand the concentration of orchestra reper- 
toires by explaining differences in composer trajectories. Variations in conditions of 
introduction can produce initial inequalities in new composers’ chances of success 
(as hypotheses 4, 5 and 6 suggest). If a composer enjoys early success and becomes 
accepted, he or she is likely to become even more legitimate and get dispropor- 
tionately more performances. This process can result from successive reconfigura- 
tions of audience taste and redefinitions of genre boundaries whereby the entry of 
contemporary composers in the repertoire and the rediscovery of old composers 
transform the definition of “classical music.” (Mueller 1951) Rao et al. (2005) show 
that high-status actors are able to redefine genre boundaries by initiating diffusion 
processes through which the penalties associated with transgressing existing categori- 
cal boundaries diminish. Critics eventually validate new genre definitions produced 
by high-status innovators. If this mechanism applies to concert programming, one 
can expect initial differences in context of introduction to the repertoire to lead to 
increasing differences in composer popularity throughout the composers’ careers. 


Hypothesis 8: Conditions of introduction lead to initial 
differences of success. Composer popularity results from 
the magnification of these differential initial chances as 


time since introduction passes. 


Data and Models 


Repertoire Dataset 


I use the data collected by John and Kate Mueller and revisited by Hart (1973), 
Gilmore (1993) and, more recently, Dowd et al. (2002) on 27 of the 30 sym- 
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phony orchestras in the United States that were considered “major orchestras” by 
the American Symphony Orchestra League in 1972 (Mueller 1973; CPANDA 
2008). The dataset includes the complete lists of all the works performed by each 
orchestra from its founding until 1969, from the oldest (New York Philharmonic 
in 1842) to the most recent (Milwaukee Symphony Orchestra in 1960). 

The orchestras included in the dataset constitute a limited subset of the broader 
population of symphony orchestras in the United States. Still, small orchestras 
generally imitate the repertoires of high-profile orchestras although they may 
tend to perform difficult contemporary pieces less frequently (Hart 1973). These 
orchestras constitute a meaningful sample to the extent that their repertoires are 
considered as models and reflect most of the innovation going on in the field. 
Both critiques of the lack of diversity in the repertoire (Horowitz 1987, 2005) 
and analyses of changes in concert programming (Hart 1973; Dowd et al. 2002) 
base their empirical assessment of the musical canon in the United States on the 
repertoires of these orchestras. 

The following analyses cover changes in orchestra-level innovation and the 
trajectories of composers first played from 1879 to 1959 rather than on the entire 
period of the dataset for two reasons. First, the study of an organizational field 
is only possible when multiple orchestras are present—i.e., after 1878, when the 
New York Symphony orchestra was founded. Second, composer longevity in the 
repertoire is unknown (right-censored) for composers introduced after 1959. For 
consistency I limited all the analyses to composers introduced from 1879 to 1959, 
however the results are unchanged if innovation and performances are analyzed 
over the extended period, 1879-1969. These data do not help evaluate more 
recent and potentially more successful attempts at deconstructing the canon since 
the late 1960s, a period marked by the emergence of minimalism as an original 
American genre (Fink 2005). Still, the historical debate over the timing of and 
the factors leading to the deconstruction of the canon focuses precisely on the 
nature of the changes in the musical repertoire in the first half of the 20" century. 


Dependent Variables and Models 


Innovation 

In the innovation model, I take as a dependent variable the number of new com- 
posers—whose works have never been played before by any of the 27 orchestras in 
the dataset—introduced by each orchestra for each year. 

I use a Poisson regression to model the number of new composers as a function 
of independent variables. An additional complication comes from the fact that the 
observations cannot be treated as independent across years and orchestras: in par- 
ticular, one of the major sources of structural inertia of repertoires comes from the 
role of the musical directors who decide which works to include in the program. 
To account for this clustering, I use a random effect model at the musical director 
level rather than fixed effects. While fixed effects could be used to obtain unbiased 
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point estimates (Halaby 2004), this solution would require a substantial loss of 
information since for a large number of musical directors (53 out of 143), no 
variation in innovation is observable during their tenures. Discarding these cases 
would amount to selecting observations on the dependent variable by eliminating 
from the analysis all the cases for which musical directors had short tenures or 
never introduced new composers, which would bias the results. 

The model assumes that musical directors are characterized by varying levels of 
taste for novelty or capacity to innovate that potentially affect the innovativeness 
of their orchestras for the duration of their tenures.® 


Success of Innovation 

I measure the chances of success of innovation by two characteristics of composer tra- 
jectories: composer longevity and popularity in the repertoire (measured by the annual 
share of the total number of performances). These measures focus on composers’ repu- 
tational careers in the field (as revealed by their performance), which differs from the 
multiple ways in which artists’ careers have been conceptualized in the literature (on 
labor-market careers, see Bielby and Bielby 1996; Lincoln and Allen 2004; Faulkner 
and Anderson 1987; on artist critical notice and reception, see Giuffre 1999, 2001). 


Composer Longevity 

I define longevity as the duration of the period following the first performance 
during which a composer enjoys at least one performance every 10 years. In other 
words, a composer is considered present in the repertoire as long as his works are 
still being played at least once every 10 years by any orchestra (see composers 1, 
2 and 3 in Figure 2), but if performances are separated by a gap of more than 
10 years (see composer 4), later performances are not included. However, such 
cases of “reappearance” in the repertoire after more than 10 years of absence are 
rare and account for about 2 percent of the total number of performances in the 
dataset. Because the records of the repertoire dataset end in 1969, the longevities 
of composers still considered present in the repertoire from 1960 to 1969 are 
right-censored? (see composer 5 in Figure 2). 

The analysis of the chances of survival illustrates the high level of selection 
in the field of classical music (Figure 3). Composers have a 10 percent mean 
probability of being played continuously for more than 70 years. But selection is 
particularly important in the very first years. Nearly half of the new composers 
drop out of the repertoire one year after their first year of performance; and more 
than three quarters disappear in the first 20 years after their first performance. 

To analyze the factors affecting composer longevity, I use a piecewise exponen- 
tial hazard model to estimate the risk that each new composer might disappear at 
any point after his or her introduction in the repertoire. This hazard is a function 
of the time since the first performance and is assumed to be constant on intervals 
of duration (0 to 1 year, 2 to 5 years, 6 to 10 years, 11 to 25 years, 26 to 50 years, 
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and more than 50 years). The independent variables included in the model can 
increase or decrease this baseline risk of disappearance. Additionally, because the 
data are clustered by musical director, I use a random effect at the director level 
as some musical directors may be more likely to introduce composers that will 
disappear or last in the repertoire.” 


Composer Popularity in Repertoire 
I use a second measure of success: the share of each composer's performances for 
any given year following his or her introduction to the repertoire. This measure 
allows to check the robustness of the longevity model findings and, because it 
accounts for composer popularity rather than simply presence in the repertoire, 
to test the cumulative advantage hypothesis. 

[use a Poisson model with composer-level random effects to analyze the effect 
of context of introduction on composers’ careers.'' The random effects capture 
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Figure 3. Survival Chances of New Composers after Their Introduction 
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Composer Chances of Surviving in Repertoire 
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Years Since First Performance 


N = 1,494 composers introduced from 1879 to 1959; Kaplan-Meyer estimates. 

Note: Composers surviving 80 years after first performance (right-censored longevity): 
Georges Bizet, Alexander Borodin, Anton Bruckner, George Chadwick, Frédéric Chopin, 
Edouard Lalo, Bedrich Smetana, Richard Strauss. 


any unobserved composer-level characteristics susceptible of affecting composer 


popularity in the repertoire.’” 


Independent Variables 


The independent variables (Table 1) used were obtained from several sources. 
Variables related to orchestra repertoires (total annual number of new compos- 
ers, annual number of composers present in orchestra repertoires) were derived 
from the Mueller dataset. 

Lists of musical directors for each orchestra were obtained from the directory 
of American Symphony Orchestras compiled by Craven (1986). Biographical 
data on musical directors were compiled using the Grove Dictionary of Music 
and Musicians (2001) and the Baker Biographical Dictionary of Musicians (2001). 
These variables include country of origin and musical director tenures at any of 
14 prestigious European orchestras and operas (Vienna Philharmonic, Vienna 
Symphony, Vienna State Opera, Berlin Philharmonic, Berlin State Opera, 
Bavarian State Opera, London Symphony, Hallé Orchestra, Royal Philharmonic, 
Royal Concertgebouw, Orchestre National de France, Orchestre de Paris, La 
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Table 1: Descriptive Statistics, Independent Variables 
Mean SD Min Max 


Field-Level Variables 


Number of new composers (all orchestras) 23.19 8.70 2 42 
Number or composers played (all orchestras) 191.91 58.52 19 267 
Orchestra-Level Variables 
Founding year 1906 26.47 1842 1950 
City Population (in thousands) 1433 1644 102 7891 
Region: Northeast (except New York) Ze At 0 1 
Region: Midwest 36 48 0 1 
Region: South ais 34 0 1 
Region: West 16 36 0 1 
Region: New York 13 33 0 1 
Performance capability (number of 70.73 31.30 1 166 
performances in year) 
Number of new composers 1.70 1.78 0 11 
Musical Director-Level Variables 
Missing data (interim guest conductors) .06 24 0 1 
Origin United States 18 39 0 1 
Previous appointments United States 2a 43 0 1 
Previous appointments outside United States th BY 0 1 
Age 50+ A5 50 0 a 


N = 986 orchestra x years 


Scala and Saint Petersburg Philharmonic). Thus, for each year, previous careers of 
musical directors in major orchestras both within and outside of the United States 
could be reconstituted. Some seasons, orchestras had several unspecified guest 
conductors or interim musical directors. In these cases, orchestras were coded as 
having distinct directors for each season; biographical data on directors were miss- 
ing and a dummy variable was added in the models (37 cases out of 143 directors). 
City population time series were obtained from decennial census estimates" 
(Gibson 1998); and orchestra location was coded using U.S. Census regions, with 
the exception of New York City orchestras, for which a distinct category was created, 
Finally, I use controls for musical director age and current tenure (in numbet 
of years since initial appointment with current orchestra) to isolate the effects 
of the director-level independent variables of interest. Orchestra performance 
capabilities (the total number of performances of an orchestra in a given year) 
are also added to the models. In the innovation model, performance capabilities 
can be a mediating factor through which other orchestra characteristics may af- 
fect innovation. In the composer longevity and popularity model, performance 


capabilities are used to identify the effect of orchestra consecration after holding 
constant orchestra productivity. i 
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Figure 4. Descriptive Statistics: Repertoires, Innovation and Orchestras 
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Results 


Farly Deconstruction or Institutionalization of the Canon 

Consistent with the “early deconstruction” account of the evolution of the field, 
the annual number of “new” composers—whose works were played for the first 
time—has grown since the late 19% century as well as the total number of com- 
posers played (Figure 4a,b). In the meantime however, individual orchestras have 
become less open to innovation as the mean number of new composers introduced 
by each orchestra has decreased (Figure 4c). The variance in orchestra innovation 
has also tended to decrease especially since the 1920s, indicating that orchestras 
have become similarly less likely to perform new composers over time (Figure 4d). 
In other words, the orchestra-level approach to innovation reveals that the trend 
cannot be characterized simply as an early and gradual deconstruction of the 
canon because orchestras have become less innovative over time. Rather, the in- 
crease in the annual number of new composers is the result of the creation of new 
orchestras throughout this period" (Figure 4e). 

This result can be interpreted in two ways. Orchestras and musical directors 
may have become increasingly conservative over time as they decided to discard- 
innovation and focus on the performance of canonical composers. Alternatively, 
it is possible that orchestras and musical directors simply saw less of a necessity 
to include new composers in their repertoires: as the number of orchestras was 
increasing, more venues were able to perform works of new composers. In both 
cases however, innovation became a lower priority for individual orchestras. 

Moreover, the evolution of rates of early failure of innovation indicates that an 
increasing proportion of new composers disappeared from the repertoire immedi- 
ately after initial performances (Figure 4f). In other words, each orchestra became 
less innovative, and innovations became more likely to fail. These changes indicate 
that the “early deconstruction” account accurately describes the proliferation of 
new composers at the field level, but does not apply to the level of individual 
orchestras. Far from becoming more distant from the musical canon, orchestra 
programming became less innovative from the late 19% century to the 1950s. 


Variations in Innovation 


Why do certain orchestras innovate more than others? Under different model specifi- 
cations, the number of new composers an orchestra introduces to its repertoire seems 
to be affected by the size of the city and the region where the orchestra is located, 
orchestra seniority and musical director consecration (although not in the direction 
predicted by field theory), and the mobilization of resources for new music (Table 2). 
Orchestra regional location seems to affect innovation in the direction pre- 
dicted by the elite cohesiveness hypothesis. Towards the beginning of the period, 
orchestras located in New York and the West, where urban elite cohesiveness 
was weak and orchestra institutionalization proved difficult, tended to innovate 
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Figure 5. Evolution of Estimated Effect of Orchestra Regional Location on 
Innovation (Log Scale) 


Innovation of NYC Orchestras Innovation of Western Orchestras 
Relative to Northeastern Orchestras* Relative to Northeastern Orchestras* 


Ratio (log scale) 
Ratio (log scale) 





1879 1919 1959 1879 1919 1959 


Year Year 


Note: *Northeastern orchestras excluding NYC orchestras. 

For each year, dots represent the overall multiplicative effect of regional location on 
innovation relative to orchestras located in the Northeast (NYC excluded); i.e. regional 
effect =e**P*/-18"), where 2, is the baseline region coefficient and 8, is the region-year 
interaction coefficient in Model 3. Bars represent 95% confidence intervals. 


considerably less than other northeastern orchestras. However this gap dimin- 
ished throughout the period. New York orchestras became as innovative as other 
northeastern orchestras when the New York upper-class united around a single 
orchestra with the merger of the Philharmonic and the Symphony in the 1920s 
(DiMaggio 1987) and even more innovative in the 1950s. The effect disappeared 
more slowly in the case of West Coast orchestras (Figure 5). Model 3 shows that 
these regional differences are not as pronounced when controlling for performance 
capabilities—which confirms the idea that part of the effect of elite cohesiveness is 
mediated by orchestra capabilities. 

Consistent with the autonomy hypothesis (Hypothesis 2b), orchestras lo- 
cated in larger cities tended to have more innovative programming in the be- 
ginning of the period (a 1 percent increase in population was associated with a 
.42 percent increase in number of new composers in 1879), although this effect 
progressively disappeared over time (Model 2). The effect seems partly medi- 
ated by performance capabilities because autonomous orchestras with a large 
potential audience can offer more performances per season. When the model 
accounts for differences in number of performances between orchestras, the 


effect of city population size on orchestra-level innovation decreases and only 
remains significant at the 10 percent level. 
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The effect of orchestra consecration is significant, but not in the direction 
predicted by field theory: in any given year, newer orchestras are Jess likely to in- 
novate. Model 3 predicts that an orchestra founded in 1946 (such as the Atlanta 
Symphony) introduces on average 33 percent fewer new composers per year than 
an orchestra founded in 1881 (such as the Boston Symphony). Similarly, one 
measure of musical director consecration (previous appointments in other U.S. 
orchestras) is positively related to orchestra innovation, although this effect is only 
significant at the 10 percent level after controlling for performance capabilities 
(which seems to be related to a self-selection process: high-status conductors are 
more likely to be appointed by high-performance capabilities orchestras). In other 
words, hypotheses 2a and 3 are rejected. Consecrated players in the field are more 
likely to feature works of new composers, while recently-founded orchestras and 
musical directors with limited experience tend to favor less risky programming 
and stick to already-known composers. 

Finally orchestra-level resources for the performance of new music significantly 
increase innovation according to models 1 and 2, but this effect disappears once 
the effect of orchestra performance capabilities is controlled for (Model 3): orches- 
tras with large resources for the performance of new music are also orchestras with 
enough space in their repertoire that they can fill with new composers. 


Variations in Success of Innovation 


Composer Longevity 
The longevity model analyzes the factors affecting the risk that a composer might 
disappear from the repertoire in the years following his or her first performance 
(Table 3). While all composers see their risk of disappearing from the repertoire 
drop dramatically as time since the first performance passes (the risk of disappear- 
ing 50 years after the first performance is e?*“’=35 times lower than immediately 
after the introduction), composers first played in certain contexts are more likely to 
last. Consistent with the ecological hypothesis, high resources for new music at the 
time of introduction increase composer longevity. Each additional new composer 
played at the time of the first performance decreases the risk of disappearance by 
1.3 percent. Longevity also increases when fewer composers are already present in 
the repertoire: each additional composer played by the orchestras at the time of first 
performance increases the hazard of disappearance by .9 percent’? (Model 3). The 
number of new composers generally tended to covary with the number of composers 
present in the repertoire until 1918, but became more disconnected in later periods. 
For example, the post-1945 period appears as a particularly unfavorable ecological 
context, with high numbers of composers present in the repertoire, but lower num- 
bers of new composers than in previous decades (Figure 1). 

Composer survival chances are not significantly improved by orchestra con- 
secration (measured by its seniority in the field), but a dominant position of the 
musical director in the field increases composer longevity strongly. When the 
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musical director who introduces the composer has previ- 
ously held similar positions in major European orchestras, 
chances of disappearance drop about 24 percent, which 
is consistent with the innovation legitimacy hypothesis 
(7b). Coefficients on other measures of musical director 
status in the field (U.S. origin and previous appointments 
in other U.S. orchestras) have the expected signs, but are 
only significant at the 10 percent level. 

In other words, musical directors endowed with high 
levels of symbolic capital are more likely to see their 
innovations last. However other indicators of musical 
director consecration do not affect composer survival 
chances significantly. 


Composer Popularity 

The last model analyzes the effect of context of introduction 
on the number of performances of each new composer as 
a proportion of the total number of performances for each 
year following introduction to the repertoire (Table 4). 

Model 1 confirms results of the longevity model. A high 
number of composers in the repertoire at the time of the 
first performance leads to overcrowding and competition, 
which diminishes new composers’ subsequent chances of 
success. Each additional composer decreases the share of 
performances by 1.5 percent. Conversely, a high number 
of new composers at the time of the introduction to the 
repertoire increases each new composer's share of perfor- 
mances after the first performance (4 percent). 

However, contrary to the longevity model results, all 
indicators of orchestra and musical director level of conse- 
cration in the field affect new composers’ shares in the rep- 
ertoire as Hypothesis 7, predicts: composers introduced 
by more recently founded orchestras, American-born and 
less experienced musical directors have significantly fewer 
performances. Innovators considered as legitimate in the 
field are more likely to legitimize new composers. 

While the longevity model relies on a minimal defini- 
tion of composer success (as some composers are consid- 
ered present in the repertoire even though their works 
remain marginal in terms of total number of perfor- 
mances), the share of the total number of performances 
is an indicator of the frequency with which works of a 
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Time x Musical director: appointments United States (/100) 


Intercept 





Neomposers 
Number of observations (composers years) 


Number of composers 


2 


Oo 


59,969 


59,969 


59,969 


1,494 1,494 


1,494 


Notes: t-ratios (beta/s.e.) are shown in parentheses next to coefficient estimates. 





Pp =..001 


**y< 01 


Significance levels: *p < .05 
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given composer are played every year. This allows us to 
test for the cumulative advantage hypothesis. Models 2 
and 3 include interaction terms that estimate the change 
in the effects of ecological context of introduction and 
orchestra/musical director position in the field after the 
first performance. The model shows that conditions of 
introduction see their effect on composer share of the 
repertoire magnified after the year of the first perfor- 
mance, whether they affect composer popularity posi- 
tively (high number of new composers, musical director 
with previous appointments in other major American 
orchestras) or negatively (high number of composers, 
recent orchestra, U.S.-born musical director). Initial 
conditions of introduction are therefore critical to com- 
posers success long after their first performance, which 
is consistent with the cumulative advantage hypothesis. 


Discussion and Conclusion 


This research argues for an integration of field theory 
with approaches to innovation and change derived from 
organizational theory. Emirbayer and Johnson (2008) 
have criticized organizational theory's selective adoption 
and incomplete grasp of Bourdieu’s conceptual frame- 
work, arguing instead for a more systematic use of field 
theory in organizational analysis. This study of reper- 
toire stability and change contributes to this dialogue be- 
tween field theory and organizational analysis, although 
it takes Emirbayer and Johnson’s question in reverse by 
suggesting that analyses of the dynamics of fields can 
benefit from the contributions of organizational theory. 

More precisely, if actors in fields of cultural produc- 
tion strive to impose, maintain or transform categorical 
distinctions between genres and types of art (Bourdieu 
1993), this cognitive framing is often produced and re- 
produced by organizations and individuals acting within 
organizational contexts. In other words, the categorical 
boundaries at the center of the struggle for the imposition 
of a legitimate definition of art in a field are produced by 
and inscribed in organizations trying to maintain or mod- 
ify them. The effect of the field (i.e., the tight coupling 
between the relative positions of actors in the field, their 
position-takings and the prevailing doxa) can therefore be 
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mediated by organizational processes shaping the cultural order of the field. 

Still, in his attempt to deconstruct the myth of the artist as an “uncreated 
creator” removed from social constraints (1996:188), Bourdieu has focused more 
systematically on the relative positions of the artists themselves than on the orga- 
nizational logics susceptible of explaining stability and change in fields of cultural 
production. Similarly, empirical studies extending field theory through the analy- 
sis of artist social networks have minimized the role of organizational processes 
at work in the production of culture (Anheier et al. 1995; Giuffre 1999, 2001). 
This paper highlights four of these organizational processes which can be used to 
explain innovation and success of innovation in other fields of cultural production. 


Organizational Symbolic Capital and Innovation 


Organizations are endowed with unequal levels of symbolic capital, which can 
facilitate innovation and boundary transgression. This research shows that old 
orchestras are more likely to perform works of new composers than recently- 
founded orchestras. This result seems to contradict a central tenet of field theory, 
which would predict that newcomers are more likely than established actors ta 
innovate, favor avant-gardist art and challenge existing norms. This seemingly 
inverted structure of position-takings can be understood from an organizational 
perspective as a product of the orientation of the field towards the reproduction of 
a canon defined by strict categorical boundaries. In this perspective, the potential 
economic and symbolic costs (e.g., lost revenue, negative reviews from critics) 
associated with deviation from the canon are so high that only organizations 
endowed with high levels of symbolic capital are able to transgress these categori- 
cal boundaries and mix new and established composers in their repertoires. This 
interpretation is consistent with recent research on the effect of institutionalized 
categorical boundaries on organizational conformism: organizations face sanctions 
when their production crosses established boundaries and becomes more difficult 
to locate within prevailing categorization systems (Zuckerman and Kim 2003). 
Only high-status actors can take such risks without endangering their legitimacy 
(Rao et al. 2005). Prestigious orchestras and star conductors were therefore more 
able to mix canonical pieces by Brahms or Beethoven — easily identifiable in exist- 
ing systems of classification—with new and unknown works in their programs. 
Further research on other fields where artistic canons exert strong inertial pressures 
on programming (such as opera or dance) may help test this hypothesis further. 


Field Autonomy and Organizational Environment 


The autonomy of a field can be analyzed as the extent to which organizations 
located within a field are constrained by their environment. Research on the 
diffusion of the non-profit organizational form has shown that its adoption was 
contingent on the presence of a class of “cultural entrepreneurs” in the local envi- 
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ronment (DiMaggio 1982a). Similarly, innovation in concert programming was 
initially significantly constrained by regional contexts. However, the waning effect 
of regional location and population size on innovative concert programming is 
an important indicator of the increasing autonomy of the field over time. In this 
sense, variations in the extent to which organizations are affected by or protected 
from changes in their environments can be related to a fundamental property of 
a field: its level of relative autonomy. 


Ecology of Fields and Cultural Change 


‘The trajectories of cultural products can be related to ecological pressures within 
the field. Bourdieu (1993) has highlighted the fact that artists’ careers can be con- 
strained by the demography of the field as actors compete for more or less scarce 
resources—although without systematically theorizing the mechanisms through 
which these demographic configurations may favor or hinder cultural change. This 
article illustrates the fact that hypotheses derived from the ecological approach to 
organizations can help analyze these mechanisms. 


Diffusion and Cumulative Advantage Processes 


Finally, one of the central findings of this research relates the success of new 
composers to the conditions of the field at the time of their introduction to the 
repertoire through a cumulative advantage process. This can be explained by neo- 
institutional theory: in a context of uncertainty, dominant actors are more likely 
to be watched and imitated by other actors. High-status innovators can therefore 
transfer their legitimacy to the composers they introduce. 

This last result has an important methodological implication as it highlights the 
importance of adopting a diachronic (rather than synchronic) approach to the study 
of fields. For example, the capacity of dominant actors to legitimate novelty in the 
field has long-lasting effects and, in the case of individual actors, survives beyond 
their biographical careers. Research on the evolution of other fields should investi- 
gate the explanatory power of cumulative advantage processes and evaluate their role 
in the emergence of innovation, conformism and diversity in cultural production. 


Notes 


1. These composers include Wagner, Beethoven, Brahms, Mozart, Tchaikovsky, Strauss, 
Bach, Berlioz, Debussy, Ravel, Schumann, Schubert and Mendelssohn. 


2. “Resurrected” composers constitute a minority of the total number of new composers 
introduced to the repertoire from 1879 to 1959: less than 10 percent were born 
before 1800. 

3. For example, see Smith (Smith1990) on the role of Leopold Stokowski at the 
Philadelphia Orchestra and Copland (1944) on Serge Koussevitzky’s transformation 
of the Boston Symphony Orchestra. 
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Among the “Big Five,” only the Cleveland Orchestra was founded after 1900. 


For example, the first American-born musical director of a “Big Five” orchestra, 
Leonard Bernstein, was appointed in 1958 (Lebrecht 1991) — whereas other orchestras 
(San Francisco, Seattle, Cincinnati) had been directed by American-born conductors 
since the beginning of the century; of the 9 conductors featured on Time Magazine 
covers between 1926 and 1959, only Bernstein (1957 cover) was not born in Europe. 


For instance, the San Fransisco Symphony Orchestra nearly disappeared during the 
Great Depression (the 1934-35 season was cancelled) and was only rescued through 
the assistance of a municipal subsidy (Hart 1973). The Los Angeles Philharmonic 
relied on an isolated and unreliable donor, William Andrews Clark, who did not leave 
an endowment before his death in 1934 (Horowitz 2005). 


According to this classification, orchestras with operating budgets exceeding $500,000 
were considered as major. Only three orchestras in this category are missing: the 
Honolulu Symphony, the San Antonio Symphony and the American Symphony. 


The number of new composers n,, played by orchestra i in year t is modeled as log 
n= x’, b+u,,, where x’, is a vector of independent variables and d, is the musical 
director for each orchestra/year. The director random effect can be written as Uy = 
loga,, where a, has a gamma distribution with mean 1 and variance o? estimated by 
the model. 


The findings presented in the next section are robust and could be replicated using 
more conservative (at least one performance in a five-year interval) or more generous 
(at least one performance in a 20-year interval) measures of composer longevity. 


- Formally, the risk of disappearance A for composer c after & years in the repertoire is 


modeled as log 1_(k) = log A +X 7+ Uz, for ke [x, t,,,], where Lis the baseline hazard, 
9” le: . . . = 
x .is a vector of covariates and v, is the director-level random effect. The random 


effect can be written as v, = log 9, where 8, has a Gamma distribution with mean 1 
and variance o§ estimated by the model. 


The model can be written as log N ks YE e, where N_, is total number of 

performances of works by composer ¢ across all orchestra repertoires in year t, P 

Sie) a, . . pe 

represents the performance capabilities of all orchestras at time #, x’ is a vector of 
2 , ct 

covariates and e_is the composer-level random effect. The random effect can be 


written as ¢ = logy where 7 has a Gamma distribution with mean 1 and variance OF 
estimated by the model. 


Composer-level control variables (contemporaneity, country of origin) did not prove 
significant. 


T used linear interpolation to approximate city population between U.S. Census years. 


‘This evolution is captured, in Dowd et al.’s model, by the rise in performance 
capabilities of orchestras, 


Adding quadratic terms on number of new composers and number of established 
composers does not improve model fit significantly. 
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Declining Dixie: Regional Identification in the Modern 
American South 


Christopher A. Cooper, Western Carolina University 
H. Gibbs Knotts, Western Carolina University 


We replicate and extend John Shelton Reed’s classic work on regional identification by 
examining and modeling the prevalence of the words “Dixie” and “Southern” in business 
names across 100 cities and four decades. We find that the instances of “Dixie” have 
dropped precipitously, although identification with the word “Southern” has remained 
more constant, providing evidence of a trend we term re-southernization. We also find that 
the relative number of blacks in the population provides the most consistent explanation of 
regional identity. Population density has also emerged as a significant predictor of regional 
identification in more recent time periods. These findings contribute to the literature on 
regional identification, the politics of naming and the sociology of the South. 


The United States is increasingly homogenized. Strip malls, cookie-cutter housing 
developments and the rise of chain restaurants make it difficult to tell if you are 
in suburban Atlanta, suburban Dallas or suburban Cleveland. As a consequence, 
regional differences may also be fading. Is the South, once thought to be the most 
distinctive region in the country, still unique? Has the homogenization of America 
altered regional boundaries? If so, what types of places have experienced the largest 
declines in regional identification? 

One of the best ways to address these questions is to examine place names. 
The study of place names, or toponyms, represents a long tradition in the social 
sciences and provides an important window into regional culture and collective 
identity (Kearns and Berg 2002; Stewart 1958; Zelinksy 1980). According to 
Alderman (2008:197), “naming is a noteworthy cultural practice not only because 
of its ability to create a sense of continuity over time but also through its capacity 
for changing and challenging lines of identity.” ‘This line of research has examined 
the placement (Alderman 2006; Light 2004; Rose-Redwood 2008) and economic 
impacts (Mitchelson, Alderman and Popke 2007) of street naming, the patriotic 
significance of county naming (Zelinsky 1988), and most importantly for our 
purposes, toponyms as a signal of shared cultural and regional identity (Alderman 
and Beavers 1999; Dwyer and Alderman 2008; Reed 1976; Reed, Kohls, and 
Hanchette 1990; Shortridge 1987; Zelinsky 1980). 

In this article, we examine the changing contours of shared regional identity in 


the South by replicating and extending John Shelton Reed’s classic work published 
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in the pages of Social Forces (Reed 1976). Like Reed, we analyze the frequency of 
the words “Dixie” and “Southern” among businesses in 100 cities. In addition to 
updating previous work, we examine the correlates of regional identification to 
better understand why some places more strongly identify with the South. Our 
results provide a measure of regional identity that could be used to explain a host 
of social and political phenomena and provide a deeper understanding of the 
changing dynamics of regional identification. 


Collective Identity in the South 


To many scholars of geography and culture, region is more than a physical area 
(Paasi 2003). This sense of collective identity is particularly prevalent in the litera- 
ture on the American South. More than a half century ago, Cash (1941:428-29) 
argued that there was a distinct “mind of the south.” Reed (1982) also believed 
that there is a collective identity, particularly among white Southerners. More 
recently, Griffin (2006:7) summarized the sources of this collective identity, noting 
that the South was “exceptional in its fierce commitment to slavery, in its failed 
experiment with secession and nationhood, in its military defeat and occupation 
by a conquering power, in its poverty, cultural backwardness, and religiosity, and 
in its pervasive, prolonged resistance to racial justice.” 

Southern collective identity also stems from political, demographic and cul- 
tural distinctiveness. A range of studies demonstrate that Southerners are more 
conservative than non-Southerners in a host of areas including religion, morality, 
international relations and race relations (Glenn and Simmons 1967; Hurlbert 
1989; Kuklinski, Cobb and Gilens 1997; Rice, McLean, and Larsen 2002). Even 
today, the South remains demographically distinct with higher percentages of 
blacks, lower percentages of high school graduates, lower housing values, lower 
household incomes and higher percentages of people in poverty (Cooper and 
Knotts 2004). Southerners also have distinct table manners and maintain strong 
loyalty to family ties, causing some to label white Southerners a “quasi-ethnic 
regional group.” (Reed 1982:3) 

Nevertheless, there has been some compelling evidence that collective identity 
and Southern distinctiveness are declining, particularly when “defined against 
an earlier South that was somehow more authentic, real, more unified and 
distinct.” (Ayers 2005:46) In 1958, Ashmore wrote An Epitaph for Dixie, and 
much later Degler (1987) noted that the South was being northernized. Others, 
such as opp (2000:3), have defended Southern distinctiveness and identity, not- 
ing that “the South has outlived many of its epitaphers.” Similarly, Applebome 
(1996) argued that the decline of Southern distinctiveness is real, but not because 


the South is becoming more like the rest of the country. Instead, he makes the 
case that the rest of the country is becoming more Southern. 
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Frameworks for Understanding the Changing South 


One way to understand changing collective identity in the South is by examining 
naming trends in the region. Alderman (2008) provides two conceptual frame- 
works for understanding these naming trends: naming as symbolic capital and 
naming as symbolic resistance. According to Alderman (2008:196), “naming as 
symbolic capital is a theme that recognizes how place names are evoked to bring 
distinction and status to landscapes and people associated with them.” The second 
framework, naming as symbolic resistance emphasizes that “racial and ethnic 
minorities are increasingly turning to place renaming as a strategy for challenging 
the dominance of white-controlled commemoration and asserting the legitimacy 
of their historical achievements.” 

This broader framework is particularly useful when understanding recent 
trends in the contemporary South. Alderman and Beavers (1999) identify three 
region-specific trends: de-Confederatization, re-Confederatization and African 
Americanization. De-Confederatization, as the name suggests, occurs when tra- 
ditional Southern names are removed. A number of high-profile events indicate 
the region's increasing de-Confederatization. Examples of this shift include the 
abandonment of “Colonel Rebel” at the University of Mississippi, the removal of 
the Confederate Flag from the South Carolina State House and the elimination of 
the phrase “Heart of Dixie” from Alabama's license plate (Associated Press 2004). 
The de-Confederatization trend would suggest that the frequency of “Dixie” and 
“Southern” mentions in business names have dropped over time. 

Despite signs of de-Confederatization, some argue that Southerners are re- 
sponding to a perceived loss of Southern exceptionalism through re-Confederati- 
zation, “the creation, preservation, and reaffirmation of place-naming references 
to Dixie and the Confederacy, often but not always occurring after calls to remove 
them.”(Alderman and Beavers 1999:195) A recent example of re-Confederati- 
zation occurred in Duvall County, Florida, where a school board consisting of 
three white and two black members voted to retain the name “Nathan Bedford 
Forrest High School” at a majority black school (Sanders 2008b). One supporter 
argued for keeping the name, noting that Forrest, an early Ku Klux Klan leader, 
was “nice to his slaves” and that “the only people [in favor of the name change] 
are people from the North who don't care about our heritage.”(Sanders 2008a) 
There is also an active and influential neo-Confederate movement throughout the 
South, exemplified through magazines like the Southern Partisan (Hague, Beirich 
and Sebesta 2008). In addition, eight states currently give their citizens the oppor- 
tunity to display a Sons of Confederate Veterans license place, complete with the 
Confederate Battle Flag (Wright 2009). The SCV supports re-Confederatization 
through its “Chief of Heritage Defense.” Re-Confederatization suggests that we 
may see a rise in “Dixie” and “Southern” naming as whites struggle to hold on to 


what they view as their Southern heritage. 
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A final trend in the modern South is African Americanization, the “rescript- 
ing of the landscape to reflect the achievements, ideals, and heroes of Black 
southerners.” (Alderman and Beavers 1999:195) This growing movement to 
identify the region with the civil rights movement represents an important de- 
velopment in the South (Dwyer and Alderman 2008). While the most obvious 
examples of this trend are found in the increasing numbers of streets, monuments 
and buildings named after Martin Luther King, Jr. (Alderman 1996), there are 
many other examples including Booker T. Washington monument in Tuskegee, 
Alabama, the African-American Memorial on the capitol grounds in Columbia, 
South Carolina, and the Carrie A. Tuggle Memorial in Birmingham, Alabama 
(Dwyer and Alderman 2008). 


Reed's Approach 


Reed examined a range of unobtrusive measures in his quest to understand 
the South's culture, geography and politics. He portrayed the socioeconomic 
South by mapping kudzu growth, cotton cultivation, housing units with 
indoor plumbing, illiteracy rates and active dentists per 1,000 residents (Reed 
1993). Because his focus was primarily on white Southerners, Reed’s cultural 
South was also reflected by members of Baptist churches, birthplaces of coun- 
try music notables and states mentioned in country music lyrics. 

Most important for our purposes, Reed and his colleagues (Reed 1976; Reed, 
Kohls and Hanchette 1990) studied business names in the South. This work 
assumes that business names have symbolic meaning. In addition to helping 
attract customers, business naming is an opportunity for business owners to 
make a statement. This statement is often about the product, but sometimes, 
business names may also say something about an owner's identity and values. 

Reed determined “Southern” entries as a proportion of “American” entries (S 
scores) and “Dixie” entries as a proportion of “American” entries (D scores) in 
phone books from a “vaguely purposive” sample of cities (Reed 1976:926). Given 
this, a city with 50 “Dixie” and 50 “American” entries would receive a D score 
of 1.0. Similarly, a city with 50 “Southern” entries and 100 “American” entries 
would have an S score of .5, This strategy allowed Reed (1976:925) to answer three 
questions: “(1) where is the South (defined this way); (2) where is Dixie?; (3) what 
is the relation between the two?” 

In his initial study, Reed (1976:934) found that S scores were generally higher 
than D scores, perhaps because “Dixie is, as one journalist observed, ‘a meaner 
word’ than Southerner.” He also found that the S score mirrored boundaries 
of the South drawn by sociologists and geographers, but the D score was used 
primarily in the Deep South. Although the two were correlated, Reed concluded 
that the S score measures both regionalism and secti 
unambiguously a measure of the latter,” 
attitude than latitude.” 


onalism “while D is more 
adding that Dixie “has more to do with 
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In later work, Reed, Kohls and Hanchette (1990:222) provided more contrasts 
between the words “Southern” and “Dixie” noting that, “Dixie is a relatively 
pure measure of sectional identification, usually connoting a shared history of 
exceptionalism and opposition to the rest of the country; Southern can have the 
same connotations, but it can also indicate merely integration into the South's 
developing economy.” Summing up the differences between the two terms, Reed 
and his colleagues concluded that “Dixie evoked the Old South of plantation 
agriculture, Southern the New South of commerce and industry.” 

Alderman and Beavers (1999) replicated Reed’s work, focusing solely on D 
scores. This research highlights the trend of de-Confederatization as D scores 
in his study were significantly lower than those found in earlier work. Despite 
this larger trend, the authors caution that some Southern cities exhibit signs of 
re-Confederatization as well. Alderman and Beavers (1999:203) also discuss the 

“hybridity of southern regional identification” noting that “identity for African 
American and Whites in the South is a series of multiple and overlapping historical 
and geographic subjectivities.” 

Although previous work does an admirable job cataloging the changes in D 
and S scores over time, there has been no effort to explain what makes regional 
identification vary across the South. In his 1976 article, Reed argued that high 
D scores represented the Old South and high S scores represented the New 
South but did not explore the characteristics of cities with high (or low) D and 
S scores. In the 1990 article, Reed and his colleagues demonstrated considerable 
changes in D and S scores between 1976 and 1988, and although they inferred 
that some systematic factors may explain these changes, they did not empirically 
model differences. Alderman and Beavers (1999:202) call for more investigation 
of “the social relations or politics underlying these patterns,” but do not empiri- 


cally examine these trends. 


Our Approach 


We begin our study by updating Reed’s work with data from 2008. Using an 
online phone directory, we identify businesses named “Dixie” and “Southern.” 
Of course, this method is not without its faults. Choosing to use terms like “Dixie” 
and “Southern” might reflect what business owners think is socially acceptable, 
rather than their internal beliefs. There is no way to know for sure why a business 
owner chooses to name his or her business “Dixie Mini Storage,” but other mea- 
sures face similar problems. For example, a person might give a socially desirable 
answer to a survey question, rather than answering truthfully. 

We include not only businesses that start with the words “Dixie,” “Southern” 
and “American,” but also businesses that have the words anywhere in their title. 
Although this departs from the strictest replication of Reed’s analysis, it is within 
the spirit of what Reed was trying to accomplish. If “Dixie” is important as the 
first word in a business, it is important as the second, third or fourth word as well. 
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We think the “2 Chicks from Dixie” gift shop in Jackson, Mississippi belongs in 
our dataset just as much as “Dixie Cremation Service” in Montgomery, Alabama. 

Reed also makes a judgment call about whether “Southern” in the title modifies 
a state or region. Because we have no way to know the intent of the business owner, 
we include these entries in our analysis. Given the lack of such specific knowledge 
of intent, we believe the most responsible course of action is to include all entries 
with “Southern” and avoid judgment calls which might cause us to bias the sample 
in one way or another. It is important to emphasize that we use the same procedures 
when collecting the “American” data. Because we estimate both the numerator and 
the denominator using an identical procedure, the relative proportions should be 
comparable across both our data collection and Reed’s data collection. 

Finally, we did not include entries for the grocery store chain Winn-Dixie 
in our dataset. Reed did not include Winn-Dixie perhaps because the stores are 
distributed unevenly across the country and the South. For example, in the com- 
pany’s corporate headquarters of Jacksonville, Florida, 87 percent of “Dixie” list- 
ings are Winn-Dixie stores. In Miami, Florida, hardly the epicenter of the Old 

* South, 60 percent of the “Dixie” listings are Winn-Dixie. Clearly, this one large 
player would distort the concepts we are trying to measure in our study. 

After replicating Reed’s work, we extend his analysis by investigating the corre- 
lates of regional identification for the 100 cities in Reed’s original sample. We model 
D and S scores in 1976, 1988, 1998 and 20083 as a function of three independent 
variables: educational achievement, the relative size of the black population and 
population density. Because studies demonstrate that more educated people are 
less likely to support the Confederate flag (Cooper and Knotts 2006; Reingold and 
Wike 1998), we expect that cities where people are more educated will have lower 
D and S scores. Previous studies have also demonstrated that people in less densely 
populated areas may be more supportive of traditional symbols of the Old South, 
such as the Confederate flag (Cooper and Knotts 2006). Consequently, we expect 
that areas with less population density will have higher D and S scores. 

Areas with the highest black populations are often located in the Deep South 
which is where we would expect the highest D and S scores. There is also literature 
suggesting that racial context has a particularly strong impact on the social and 
political attitudes and actions of whites, although the direction of this relationship 
is more difficult to discern. Racial threat theory demonstrates that in areas with 
higher concentrations of blacks, whites are less likely to support busing (Giles 
1977), more likely to vote for David Duke (Giles and Buckner 1993), and more 
likely to hold a host of conservative racial attitudes (Taylor 1998). These studies 
make us think that whites in areas with high black populations will be more likely 
to embrace traditional Southern labels. Another line of thinking 
cial context will make it less likely for businesses to be named “Di 
depend on a black customer base and the economic costs of a name like ’ Dixie 
may be too high. Given these cross pressures, it is difficult to predict how the black 
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Figure 1. John Shelton Reed’s Map of the Proportion of “Dixie” to “American” 


Entries in Phone Books, 1976 
| 


ay 





Note: Reed, John Shelton. 1976. “The Heart of Dixie: An Essay in Folk Geography.” 
Social Forces 54(4):925-39, 


population variable will affect naming patterns. This point is well-summarized 
by Alderman and Beavers (1999:202) who note that Southern blacks “take very 
complicated and contradictory positions within the South’s geography of naming, 
positions that do not always find expression on our maps.” 


Identification with the Old and New South 


To begin our analysis we replicate Reed’s original study, calculating the D and 
S score for all 100 cities in his sample. A complete listing of D and S scores ap- 
pears in the Appendix. Following previous work (Reed 1976;, Reed, Kohls and 
Hanchette 1990) we also map D and S scores. 

For comparison purposes, Figure 1 presents Reed’s (1976) original map of D 
scores and Figure 2 is our map of D scores in 2008. To produce our map, we began 
with the data collection procedure described above. We then used an ordinary 
krigging method to interpolate the data in between points (Issaks and Srivastava 
1990) and, using Arc GIS 9.3, placed each location into one of the four ranges 


used by Reed (1976). 
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Figure 2. Map of the Proportion of “Dixie” to “American” Entries in Phone Books, 2008 
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Note: Created using Arc GIS 9.3 and an ordinary krigging method. 


Comparing the two D-score maps shows quite a few changes in naming pat- 
terns from 1976 through 2008. Figure 1 shows a core region with a D score greater 
than .25 that coincides with most common definitions of the South. Even in 1976, 
southern Florida, metropolitan Atlanta, western North Carolina, Texas and most 
of Virginia appeared less southern than the core South. Figure 2 shows that the 
prevalence of D score has dropped considerably across the South, although there are 
still pockets of strength. In particular, southern Alabama and southern Mississippi 
still have a large number of establishments using the name “Dixie.” D scores ranging 
from greater than .15 to .25 extend through parts of Louisiana, Alabama, Mississippi, 
Florida and southern Georgia. The third layer includes parts of the aforementioned 
states, as well as Kentucky, Tennessee, and areas of North Carolina, Texas, Virginia, 
Ohio, Indiana and Arkansas. Most of Virginia, as well as the majority of eastern 
North Carolina appear in the lowest category of less than .06. 

Comparisons between the S-score maps present a much different picture. Figure 
3 displays Reed’s (1976) map of S scores and Figure 4 includes our map of S scores 
from 2008. Although they were created more than 30 years apart, both maps look 
strikingly similar. The highest concentration of “Southern” to “American” entries 
in 1976 is similar to the highest concentration in 2008. The only exception is that 
eastern North Carolina, southern Virginia and northern Tennessee have moved to 
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Figure 3. John Shelton Reed’s Map of the Proportion of “Southern” to “American” 
Entries in Phone Books, 1976 





Note: Reed, John Shelton. 1976. “The Heart of Dixie: An Essay in Folk Geography.” 
Social Forces 54(4):925-39, 


the second highest category. In both 1976 and 2008, the category with the second 
highest proportion of “Southern” to “American” listings forms a band around the 
core South that includes parts of southern Virginia, portions of Kentucky, most of 
Arkansas, southern Illinois, East Texas, and central and northern Florida.‘ 

Our descriptive statistics confirm that the prevalence of the word “Dixie” has 
dropped dramatically over the past 30 years. The mean D score in 1976 was .17, 
and it dropped to .09 in 1988 and .07 in 1998. The drop has continued with a 
mean D score of .05 in 2008. The standard deviation has become much more 
compact (from .20 to .12 to .08 to .07), suggesting that cities are more similar than 
they used to be. Further, about half of the cities in Reed’s original sample had a D 
score of .20 or greater compared to about 4 percent in 2008. By examining specific 
cities, we see that the top three D scores in 1976 were Macon, Biloxi-Gulfport 
and Augusta; the top three D-score cities in 1988 were Columbus, Macon and 
Jackson; in 1998, the top cities were Montgomery, New Orleans and Tallahassee. 
By 2008, the top three cities were Biloxi-Gulfport, Louisville and Montgomery. 
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Figure 4. Map of the Proportion of “Southern” to “American” Entries in Phone 
Books, 2008 
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Note: Created using Arc GIS 9.3 and an ordinary krigging method. 


The maps and descriptive statistics show a dramatic decline in the use of the 
word “Dixie,” suggesting that the Old South is shrinking. The question remains 
whether the proportions are shifting at similar rates across cities, or whether some 
cities are experiencing more dramatic shifts than others. Comparing correlation 
coefficients between D scores over time, we find that cities with lower D scores 
in previous years are in general, more likely to have lower D scores at later time 
periods, although this pattern fades as the time between measures grows. For 
example, the correlation between the D score in 1976 and the D score in 1988 
is .88. The D scores in 1988 and 2008 are correlated at .67, and the D scores in 
1998 and 2008 are correlated at .66. Not surprisingly, the fit between 1976 and 
2008 is somewhat lower at .62. 

A closer examination of S scores over time suggests that S and D are measuring 
different concepts. The S score means are much closer than they were for the D 
scores. [he mean S scores in 1976 was .38, compared to .36 in’ 1988, and .35 in 
2008. Further, there are still some cities that frequently use the word “Southern,” 
including nine cities where businesses are more likely to refer to themselves as 

“Southern” than “American” (an S score of greater than 1). Comparatively, there 
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were four cities in 1976 with S scores greater than 1.0 and 10 cities in 1988. 
Turning to specific cities, Macon, Augusta and Biloxi-Gulfport had the highest 
S scores in 1976, and Columbus, Biloxi-Gulfport and Savannah had the highest 
S scores in 1988. By 2008, the cities with the largest S scores were Shreveport, 
Baton Rouge and Birmingham. The correlation between the S score in 1976 and 
the S score in 1988 is .91. Scores in 1988 and 2008 are correlated at .79, and the 
fit between 1976 and 2008 is .82. 

In sum, the picture that emerges from the analysis of “Southern” entries is much 

different than the pattern we saw for “Dixie.” Businesses are not moving away from 

“Southern” like they are from “Dixie.” Nostalgia for the Old South appears to be 
declining, but it is still possible to express regional identity without harkening back 
to the days of the Confederacy. At least for business owners, the word “Southern” 
is still fairly mainstream. Our data, therefore, do not reveal a decline of regional 
identity per se, as much as a decline of identification with the Old South. 

To provide further evidence of the growing differences between D and S, we 
computed a series of correlation coefficients between D and S scores for each time 
period. This analysis reveals that the relationship between these concepts was once 
fairly strong. In 1976, the correlation was .84 and in 1988 the correlation was .87. 
Today, the relationship between D and S scores is a much lower .59. This finding 
suggests that these concepts were more similar in the past and that “Dixie” and 

“Southern” have increasingly different meanings in 2008. 


Explaining Identification with the Old and New South 


To extend earlier work, we model Reed’s D and S scores in 1976, Reed, Kohls, 
and Hanchette’s D and S scores from 1988, Alderman and Beavers’ D scores from 
1998, and our D and S scores from 2008. Table 1 presents the Ordinary Least 
Squares regression results for seven models. The unit of analysis is the city, and 
we obtained the independent variables from the most recent U.S. Census prior to 
data collection. Therefore, we used data from the 1970 U.S. Census for the 1976 
models, the 1980 U.S. Census for the 1988 models, the 1990 U.S. Census for 
the 1998 models, and the 2000 U.S. Census for the 2008 models. Each of the 
columns includes values for the unstandardized regression coefficients and the cor- 
responding t-ratios. Recall that we expect educational attainment and population 
density will be negatively associated with the dependent variables across time. We 
are more agnostic, however, about the expected effects of the relative size of the 
black population. Racial threat theory would suggest a positive relationship, while 
rational economic interests would suggest a negative relationship. 

Modeling Reed’s D and S scores from 1976, we find that cities with higher 
black populations during this time period also had higher D and S scores. Cities 
with higher percentages of college-educated people have higher S scores, but 
education did not have an impact on the D score. Population density was not 
significantly related to either D or S scores in 1976. 
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Figure 5. The Influence of Black Density on D Scores 


D Scores 

















Note: D Scores are computed as the number of Dixie entries divided by the number of 
American entries in city phone directories. 


Modeling Alderman and Beavers’ 1998 D scores presents similar results to our 
1988 D-score model. Once again, cities with higher black concentrations had higher 
D scores, and cities with higher population densities had lower D scores. For the 2008 
models, percent black is once again a statistically significant predictor of both D and 
S scores. Similar to the 1988 and 1998 models, population density is again negative 
and significant for the D-score and S-score models. College education, however, is 
not significant in the 2008 models. Together, these findings point to the overwhelm- 
ing importance of race in predicting identification with the South. Consistent with 
tacial threat theory, cities with higher concentrations of blacks are likely to have more 
businesses names that identify with both the Old and the New South. 

‘To gain a better sense of the relationship between the D score and the black 
density variable over time, we generated predicted values of D scores in 1976, 
1988, 1998 and 2008 at each level of black density, while holding all other vari- 
ables at their sample means (King, Tomz and Wittenberg 2000; Tomz, Wittenberg 
and King 2003). These results, presented in Figure 5, demonstrate the substantive 
significance of racial context at each of the four time periods. In a city with a 
relatively high proportion of blacks (around 80 percent), the predicted D score 
is almost .5 in 1976, compared with about .3 in 1988, .2 in 1998, and .09 in 


1096 © Social Forces 88(3) » 


2008.° In all, Figure 5 suggests that although race continues to be important, 
racial context does not have the same influence that it once did. In cities with few 
blacks, there have never been high D scores, and although there has been a drop 
in D scores in these cities, it is not substantial. In cities with higher black density, 
however, the drop in D scores has been considerable. 


Discussion 


Our study replicates and extends Reed’s classic research on regional identification. 
This replication provides evidence of the continued de-Confederatization of the 
American South. We demonstrate that collective identity with the Old South is de- 
clining, as the use of “Dixie” has decreased precipitously across the United States. 
Growing African Americanization in the South may be one factor contributing to 
this decline. Not only is there increased competition for potential names, but the 
political and economic consequences of connecting to the Old South are greater 
than ever. In today’s society, most businesses cannot afford to name a business 
“Dixie” and risk alienating much of the population. As John Shelton Reed noted 

in 2004, “If you're a businessperson, why do you want a name that’s going to raise 
anybody’s hackles?” (Associated Press 2004) 

Despite widespread de-Confederatization, we find that use of the word 

“Southern” has stayed fairly constant over time. Today, more than at any other 

point in time, there is a clear difference between a business owner naming her 
business “Dixie” and naming it “Southern.” As a result of this growing differ- 
ence, we suggest another trend not offered by Alderman and Beavers (1999). 
The preservation of “Southern” place names points to a re-southernization—the 
creation, preservation and reaffirmation of place-naming references to the South 
(but not necessarily to the Old Confederacy). This trend may be fueled in part 
by the increasing popularity of the “Southern” brand. In recent years, the South 
has developed considerable brand identity and is frequently marketed to those 
who tour and relocate to the region. The quintessential example is Atlanta, a city 
with a large black population and substantial in-migration from the North that 
promotes itself as the “Capital of the New South.” Another factor contributing 
to re-southernization may be the in-migration of blacks, a group that displays an 
increasing affinity for the region (Cobb 2005). Whatever the reason for this re- 
southernization, it is clear that the modern South is one where southern business- 
owners have learned to express regional pride without harkening back to the 
negative stereotypes associated with the Old South. 

Our study also extends Reed’s work by explaining variation in place naming 
across 100 cities at four different time periods and presenting a more nuanced 
interpretation of the decline in regional identity. In the mid 1970s, following the 
aftermath of the civil rights movement and amidst an era of busing, our results 
demonstrate that racial context was the only significant predictor of place names 
connected to the Old South. Contrary to our expectations, college education and 
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population density were not factors in explaining the variation in D scores in 1976. 
Cities with higher levels of education and a more cosmopolitan orientation did 
not have significantly different levels of regional identification in 1976. 

In 1988, 1998 and 2008, racial context remained important. However, eco- 
nomic pressures and a growing sensitivity to the black customer base likely led to 
a decrease in these types of business names— particularly in areas with high black 
concentrations. Our expectations about the effects of population density were 
confirmed. Density emerged as a significant predictor of regional identification, 
demonstrating that it was likely less acceptable to name a business “Dixie” in more 
cosmopolitan areas. 

Going forward, we believe that D and S scores provide an unobtrusive and 
easily replicable means of measuring the South. Advances in technology mean 
that future work should move beyond Reed’s 100 cities and focus on small towns 
and rural areas of the South. Future studies could also explore uses of “Dixie” 
and “Southern” at the state-wide level. Rather than relying on whether the state 
is located in the Old Confederacy, future scholars should consider employing 
continuous measures of regional identity, such as D and S scores, to provide 
a more nuanced understanding of the South. Similar techniques could also be 
applied to study the increased prevalence of black naming in the South (Dwyer 
and Alderman 2008) and the growing study of regional identity in nation states 
across the world (Paasi 2003). 

‘The study of regional identity has changed considerably since Reed’s seminal 
work. Studying the politics of naming can still provide keen insights on regional 
identity, but scholars should consider supplementing previously used techniques. 
To truly understand the politics of naming, future researchers should also con- 
sider surveying and interviewing business owners. Talking to the proprietors and 
patrons of “2 Chicks from Dixie” will go a long way to better understanding this 
evolving American region. 


Notes 


1. We performed all searches at http://www.whitepages.com/. 


2. Other than increasing the D score disproportionately, keeping Winn-Dixie in the 
dataset makes virtually no difference in the results of our multivariate models. 


3. Reed included complete lists of D and S scores in his 1976 and 1990 articles, and 
Alderman and Beavers provided a complete list of D scores in their 1999 article. 


4, The darker shade around Los Angeles and Las Vegas likely signals identification to 
the Southern California region. 

5. Wealso modeled the difference between the 1976 D score and the 2008 D score and 
the 1976 S score and 2008 S score using the true difference method (Firebaugh and 
Beck 1994). In the change in D score model, the only significant predictor is the D 
score for 1976. In the change in S score model, there were two significant predictors: 
the S score in 1976 and the change in population density. 
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Unpacking the Unspoken: 
Silence in Collective Memory and Forgetting 


Vered Vinitzky-Seroussi, Hebrew University of Jerusalem 
Chana Teeger, Harvard University 


Collective memory quite naturally brings to mind notions of mnemonic speech and 
representation. In this article, however, we propose that collective silences be thought 
of as a rich and promising arena through which to understand how groups deal with 
their collective pasts. In so doing, we explore two types of silence: overt silence and 
covert silence, and suggest that each may be used to enhance either memory or forget- 
ting. We illustrate our conceptual scheme using data on the commemoration of slain 
Israeli Prime Minister Yitzhak Rabin. 


The concept of collective silence quite naturally brings to mind notions of forget- 
ting and amnesia. Collective memory is generally understood to entail the nar- 
ration and representation of the past, while collective forgetting is antithetically 
thought to be a silencing and muting of the past. It comes as no surprise, therefore, 
that when nations, collectives or individuals wish to ensure that certain events, 
eras, people and experiences are remembered, they quite naturally turn to words 
and images. What can be heard, seen and touched has become the cornerstone 
of memory. As a result, absence and silence have often resulted in protest by 
groups who have shared the assumption that recollection is impossible without 
talk and representation (see for example Zolberg 1998; Scott 1996; Young 1993; 
Wagner-Pacifici and Schwartz 1991). While no one can guarantee the mnemonic 
maintenance and survival of issues that have received textual and narrative repre- 
sentation because memory is unstable, changing and unpredictable (Zelizer 1995), 
commemoration seems to amount—at least in its beginning stages—to words, 
narratives and much talk. In other words, speech, narrative and text seem to be 
perceived as necessary —if not sufficient—for ensuring collective memory. 
Scholars of collective memory have focused much attention on such representa- 
tions of the past and have paid attention to both the form and content of these 
representations. In terms of form, they have examined formal and informal rituals, 
historical museums, central and peripheral monuments, history books, school 
curricula and much more. In terms of content, scholars have paid attention to 
the words, texts and narratives that have filled the above-mentioned forms. In 
addition, scholars of collective memory have quite naturally taken note of what 
is missing and not talked about in representations of the past. In this way, they 
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have drawn attention to distortions in what had hitherto been perceived as the 
truth about the past (e.g., Ben-Yehuda 1995) as well as to processes through which 
people and events were excluded and forgotten from collective memory (e.g., 
Armstrong and Crage 2006; Stora 2006; Piterberg 2006; Choi 2001; Yoneyama 
1999; Prost, 1999; Aguilar 1999; Sturken 1991; Ehrenhaus 1989). Thus, along- 
side analyses of mnemonic narratives, speech and representations, we have seen a 
growing scholarly interest in silences, omissions and exclusions. Although it is dif- 
ficult to study those things about which individuals keep silent (Zerubavel 2006), 
scholars of collective memory have been able to examine official representations 
of the past in the aim of noting those topics that were left out and silenced. These 
silences have become the first indication that specific topics have stood a chance 
of being permanently sidelined, forgotten or denied altogether. 

While acknowledging that silence is often tightly coupled with forgetting and 
talk with memory, we wish to expand on the ways in which silence can also be 
used to facilitate recollection, while talk can be used to enhance amnesia. In other 
words, we suggest that silence be understood as a complex and rich social space 
that can operate as a vehicle of either memory or of forgetting and thus can be 
used by various groups for different ends. We do this by extending the notion of 
silence in two ways. First, we argue that silence need not only be thought of as 
the antithesis of speech. As such, we suggest that silence be disaggregated into 
its overt and covert manifestations. By overt silences, we refer to a literal absence 
of speech and narrative. Covert silences, on the other hand, are silences that are 
covered and veiled by much mnemonic talk and representation. Such silences are 
not about the complete absence of talk, ritual or practice. Rather, they are about 
the absence of content. Because these absences are not immediately apparent as 
such, covert silences are often quite difficult to identify and critique. Second, we 
suggest that both overt and covert silences can be utilized in the aim of either 
memory or forgetting. In other words, we offer a typology of silences distinguished 
on the one axis into overt and covert silences and on the other axis into silences 
aimed at memory and silences aimed at forgetting. 

Our analysis of the various forms and functions of silence is undoubtedly locat- 
ed within a contemporary mnemonic landscape where groups, sectors and entire 
nations are expected to recognize and confront their post-heroic pasts (Schwartz 
and Schuman 2005) and to examine their shameful histories and embarrassing 
moments (Olick 2007). For reasons that are beyond the scope of this article, cel- 
ebrating a mythic and heroic past and ignoring “difficult pasts” (Wagner-Pacifici 
and Schwartz 1991; Vinitzky-Seroussi 2002) seem to have become less and less 
legitimate in the contemporary mnemonic landscape. As such, keeping completely 
silent about certain issues is increasingly becoming a non-option for many nations 
(or, at the very least, an option with a high political price tag attached). This trend 
has not, however, meant that commemorative activities around these pasts are 
wholeheartedly embraced. ‘Today, as in the past, certain constituencies do not wish 
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to remember and acknowledge certain pasts, especially if such memories bring up 
issues of accountability and guilt. However, unlike in the past, these groups often 
cannot withdraw into a complete and collective silence. How then do groups and/ 
or nations that wish to forget the past, or at the very least not to talk about it, do 
so in an era where this is less and less acceptable (and, in some cases, where they 
are legislatively forced to “remember”)? Furthermore, how do groups that do wish 
to remember the past do so while minimizing conflict with other groups that do 
not wish to recollect its shameful aspects? 

Using the case of the commemoration of the assassination of Yitzhak Rabin 
in Israel, we address these questions by showing how different actors in the mne- 
monic scene differentially engage in talk and/or silence in the aim of advancing 
remembering and/or forgetting. 


Yitzhak Rabin’s Assassination: Background and Data 


Yitzhak Rabin was a much-admired military officer who commanded a brigade 
that fought in the 1948 Israeli War of Independence. He was the Israeli Defense 
Forces Chief of Staff during the 1967 War, at the conclusion of which the east- 
ern part of Jerusalem and the West Bank were captured from Jordan, the Golan 
Heights from Syria and the Sinai Desert and Gaza Strip from Egypt (collectively 
referred to as the occupied territories). In September 1993, during Rabin’s sec- 
ond term as Prime Minister and Minister of Defense, the peace process with the 
Palestinians was officially initiated with the signing of the “Oslo Accords” by 
Israeli and Palestinian leaders. The term “peace process” refers to the political 
attempt at concluding the bitter and bloody 100-year-old conflict between Jews 
and Arabs in the Middle East. That Rabin was engaged in a peace process was 
evident to his political supporters, but it was not evident to all of his opponents, 
who perceived any withdrawal from the occupied territories (by now populated 
with Jewish settlements) as a nightmarish peace—a disaster on both strategic and 
religious grounds. Thus soon after the famous handshake between Rabin and 
Arafat at the White House in September 1993, Rabin became the primary target 
of a vilification campaign organized by elements of the Israeli right who labeled 
him as a traitor (Ben-Yehuda 1997). The main accusation against Rabin had to do 
with his willingness to withdraw from territory that had been occupied by Israel 
since 1967.! The Israeli right, and especially the Jewish settlers in the occupied 
territories, who felt that the Israeli government had deserted them, organized 
many harsh demonstrations, which were often led and addressed by prominent 
right-wing political figures and religious rabbinical authorities. 

In an attempt to respond to the campaign against Rabin, his government and 
the peace process, the Israeli left organized a demonstration in Tel Aviv on Nov. 
4, 1995. As Rabin was leaving the rally, he was shot three times. Several hours 
later, Rabin’s personal assistant announced that the Prime Minister was dead. 
Soon enough, the assassination would come to symbolize a chasm deep in the 
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heart of the nation. The assassination tore open the right-left, religious-secular 
divides within Israeli society. While all (save several conspiracy theorists) agreed 
that Yigal Amir, an Orthodox Jewish Law student, had pulled the trigger, there 
was little agreement over what both the assassin and the assassination symbolized 
at a deeper level. For the religious right, Yigal Amir became something of “a bad 
apple,” a fringe lunatic who acted irrationally. For the secular left, however, Yigal 
Amir stood within a much broader context of a campaign of political incitement 
against the Prime Minister. 

It should come as no surprise therefore that right-wing groups, and especially 
religious right-wing groups, including groups representing Jewish settlers living 
in the occupied territories, were less than enthusiastic about commemorating 
Rabin and the assassination, while left-wing groups were more inclined to do so. 
The commemorations of Yitzhak Rabin have thus been classified as “a difficult 
past” in so far as the commemorated event is disputed and divisive (see Wagner- 
Pacifici and Schwartz 1991; Vinitzky-Seroussi 2002). Within this context, many 
of those who wished to commemorate Rabin and the assassination did so on 
their own volition, forming voluntary associations and creating, for example, the 
annual memorial ceremony held in Rabin Square in Tel Aviv. These elective or 
voluntary commemorations are not the only ones. In 1997, the Israeli Knesset 
(Parliament) passed the Yitzhak Rabin Memorial Day Law, which legislated that 
major social institutions (mainly the educational system) were required to com- 
memorate Rabin. Additionally, an annual memorial ceremony at Rabin’s gravesite 
became institutionalized through this legislation. The law, in effect, forces certain 
individuals and groups to talk about Rabin and the assassination. 

In this article, we illustrate our argument about the various forms of silence 
by drawing on data gathered by the first author over a period of close to 10 years. 
First, we make use of observations at various public commemorations for Rabin, 
including “coerced commemorations” (e.g., mnemonic activities in state schools, 
official ceremonies at Rabin’s burial site on Mount Herzl, Jerusalem) as well as 
“elective” ones (e.g., the annual memorial ceremony in Rabin Square in Tel Aviv, 
near the assassination site). Second, we draw on 24 in-depth interviews with rel- 
evant agents of memory” including Rabin’s widow, his personal assistants, various 
political activists, memorial designers and others. Third, we analyze data gathered 
from observations conducted in 30 Jewish state and state-religious schools on 
Rabin Memorial Day in 1997 (the year in which the Yitzhak Rabin Memorial Day 
Law was passed) and in 2005 (the 10" anniversary of the assassination). Finally, we 
draw on content analysis of all articles published about Rabin or the assassination 
from seven Israeli newspapers in three languages (Arabic, Russian and Hebrew). 
The texts analyzed include the entire population of articles published on the topic 
around the date of the Memorial Day (i.e., the anniversary of the assassination) in 
the selected newspapers in 1996 (the first anniversary of the assassination) and in 
2000 (the fifth anniversary of the assassination).> While focusing predominantly 
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on the “public transcript” of commemorative activities, our data allow us some 
leverage in unpacking the “hidden transcripts” (Scott 1990) articulated in schools 
and niche newspapers. Our analysis of silence in the commemoration of Rabin 
is thus grounded in both official commemorative activities directed at broad and 
diverse audiences as well as in more localized events and pronouncements target- 
ing only “insiders.” 


Silence and Memory, Talk and Forgetting 


Forgetting is of course an inescapable element in remembering. Schudson 
(1997:348) puts it succinctly when he states that “[m]emory is distortion since 
memory is invariably and inevitably selective. A way of seeing is a way of not see- 
ing, a way of remembering is a way of forgetting, too.” As many have pointed out, 
in any recollection of the past, certain elements are always highlighted, some are 
ignored and all are interpreted (e.g., Crane 2000; Winter and Sivan 1999; Brink 
2000). In effect, one of the major characteristics of commemorative activities is that 
they serve to elevate from historical records certain events and people that would 
otherwise be socially forgotten or buried in archives and other deserted social loca- 
tions (Schwartz and Schuman 2005). Thus, memory, like narrative, is “constructed 
around its own blind spots and silences.”(Brink 2000:37) By the same token, the 
ability to converse at all may be predicated upon silences, such that saying anything 
at all entails silencing the multitude of other possible speech acts available in a 
given situation (see Gurevitch 1995). In other words, the ability to remember, to 
speak of or to commemorate one thing may implicitly be predicated on the ability 
to keep silent on others. Needless to say, many of these silences and exclusions are 
far from benign and often reflect real desires to mute certain aspects of the past in 
order to (re)present its other aspects in specific ways, often more favorable to those 
in power (Yoneyama 1999; Polletta 1998; Trouillot 1995; Spillman 1994; Sturken 
1991, 1997). In this sense, the narration of certain memories and the silencing of 
others can oftentimes be conceptualized as the attempts of those with power to set 
the limits on what is speakable or unspeakable about the past. 

In their examinations of unspoken or unspeakable pasts, Zerubavel (2006) and 
Cohen (2001) have theorized on ways in which psychoanalytic theories might 
be applied in sociological contexts. Accordingly they have pondered about how 
Freudian ideas of repression—whereby individuals subconsciously bury traumatic 
and painful memories below consciousness—might be said to operate at the 
inter-personal level. In moving to the social sphere, both Zerubavel and Cohen 
adopt the notion of denial over that of repression. This preference may indicate 
a sociological prioritization of voluntary, conscious and—at points—contingent 
processes of social acknowledgement or denial, over a psychological concern with 
unconscious, involuntary and inevitable processes of repression.’ Having framed 
their concern in terms of denial rather than repression, both Zerubavel and Cohen 
proceed to identify moments of denial by analyzing the social configuration of 
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Table 1: Types of Silence 
Overt Silence Covert Silence” 
Memory Moment of silence Bland commemoration 


Forgetting Pervasive silence Cacophonous commemoration 





silence. As in intra-personal processes of repression, the types of inter-personal 
processes of denial that they identify may help to avoid pain, bad memories, 
trauma, embarrassment, shame and stigma (see Stein 2009). All of these denials, 
however, are understood as closely associated with forgetting and silence. 

In this article, we suggest that silence be understood more broadly. Thus, we 
argue that silence need not necessarily be about denial and forgetting but may in cer- 
tain circumstances be the ultimate example of acknowledgement and remembrance. 
At other points, much talk may be the clearest indication of desires at denial and 
social amnesia. As such, we suggest that silences be distinguished along two dimen- 
sions: type and intention. In terms of the former, we distinguish between overt and 
covert silences. Overt silences are those types of silences that we quite normally think 
of. They are literal silences characterized by a complete absence of any narrative or 
speech and are thus usually quite easy to detect. Covert silences, on the other hand, 
are silences that inhere within speech. These are silences that are veiled by much 
mnemonic talk and as such are harder to decipher and identify. 

In terms of the second dimension of our typology, silences vary in terms of the 
purpose they are intended to serve. Here we distinguish between silences aimed at 
memory and silences aimed at forgetting. Both types of silence (overt and covert) 
can be used as mechanisms through which to enhance either memory or forgetting. 

The table summarizes our typology. In our analysis, we identify, illustrate and 
discuss four types of silences: (1. Overt and literal silences aimed at enhancing 
memory; (2. Overt and pervasive silences aimed at forgetting; (3. Covert silences 
that inhere in the mnemonic talk of agents intending to construct and maintain 
memory; (4. Covert silences used to enhance forgetting in situations where com- 
plete and overt silences are neither possible nor desirable. 


Overt Silence in the Domain of Memory 


When nations, societies, agents of memory or other individuals wish to remem- 
ber and make others remember, they often turn to total silence. That silence, 
however, is not unbounded, nor does it rest in a vacuum. On the contrary, the 
silence is intentional, purposive and planned in advanced, and its raison d’étre is 
commemoration. 

In many social and national contexts, the most sacred ritual saved for unexpect- 
ed major tragedies, natural disasters, heroes, martyrs and deaths begins with a mo- 
ment of silence. The aim of such moments of silence is introspection and reflection 
on that which is commemorated. These moments interrupt the usual flow of time, 
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of gestures and bodily movements, of speech, and of thoughts. Such moments of 
silence often repeat themselves at the precise time that the commemorated event 
occurred. There is probably no text that can perform a similar commemorative 
function by inscribing itself on one’s body so powerfully. This is especially true 
if the moment of silence is enveloped by a siren heard all over the country and is 
accompanied by a normative requirement to stand still. Moments of silence that 
become part of an annual official memorial time and thus repeat themselves every 
year, are arguably the strongest manifestations of a desire to remember. 

In Israel there are three moments of mnemonic silence that are announced with 
a siren heard across the country.” On Memorial Day for the Holocaust there is 
one siren which is heard at 10 a.m. This siren lasts for one minute. On Memorial 
Day for the Fallen Soldiers—which falls a week later—there are two sirens. The 
first, which lasts one minute, is heard the evening preceding the Memorial Day.° 
The second, which lasts two minutes, is sounded at 11 a.m. and marks the official 
opening of the memorial ceremonies at state military cemeteries. The normative 
imperative is that people stop whatever they are doing and stand still for the 
duration of the siren regardless of where they are when it sounds. The structured 
moments of silence constitute part of the mnemonic socialization of young chil- 
dren, who early on learn to discipline their bodies in the name of commemoration. 
Driving on the highway in Israel on one of these memorial days when the siren 
goes off, it is in no way uncommon to see people stopping their cars, stepping 
onto the road and standing still until the siren is over. The annual reenactment 
of the moment of silence is so powerful that individuals often find that when 
the siren sounds, they stand still, keep silent and contemplate the day even if the 
demarcated moment finds them alone in their homes or offices. The structured 
moment of silence thus becomes something that is difficult to ignore. Borrowing 
from Durkheim (1964) and Foucault (1977) one could say that the ritualized 
moment of silence becomes the ultimate manifestation of social control in that 
it comes to be internalized without external surveillance, creating “docile bodies” 
disciplined in the act of memory. 

In the case of Rabin commemorations, we find the structure of the moment 
of silence observed during official memorial ceremonies on Rabin Memorial Day. 
However, while individuals stand and keep silent in the context of public gather- 
ings commemorating Rabin, they do so in the absence of a national siren. The 
reasons for the lack of siren on that day are beyond the scope of this article. 
What is clear, however, is that decisions regarding the institutionalization of a 
siren ushering in a national moment of silence have served to create a hierarchy 
around national mnemonic importance in Israel (where commemorating the 
Holocaust and the fallen soldiers are of prime importance). The important point 
is that the overt literal silence of individuals and collectives, when framed within 
a commemorative context, can become the ultimate mechanism through which 


to promote memory. 
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Overt Silence in the Domain of Forgetting 


The second type of overt silence is frameless, unbounded and is not encapsulated 
within mnemonic activity as its aim is not remembering but forgetting. Unlike 
overt silences in the domain of memory that are introduced either by a verbal 
announcement or by the siren, overt silences in the domain of forgetting are not 
spoken of at all. They are characterized by a complete absence of any mention 
of the topic at hand. The passage of time may in itself increase the probability of 
finding this kind of silence as witnesses pass away or grow old, and collectives grow 
bored or tired.” The type of silence we analyze in this discussion, however, is not 
about collectives who have simply grown apathetic or disinterested. Instead, it is 
about groups who, for a variety of reasons, actively do not wish to remember or 
commemorate a specific event or person. 

In the case of Yitzhak Rabin, overt silences in the domain of forgetting are 
exemplified by Israeli state-religious schools located in highly ideological settle- 
ments in the occupied territories where much of the opposition to Rabin’s policy 
regarding the solution of the conflict with the Palestinians was centered. Ten years 
after the assassination no ceremonies took place in these schools on Yitzhak Rabin 
Memorial Day—in spite of the legislation. Children were not asked to come to 
school in special dress (as they are on every other official memorial day) and no 
special decorations, posters or activities marked the day. Several weeks prior to the 
memorial day, in a telephone conversation with one of the authors, a principal in 
one such school initially declined to participate in the study by suggesting that the 
researcher who visited the school would not like what she saw. “She may see pupils 
spitting on and thus putting out Rabin’s candle,” she warned in a telephone conver- 
sation in October 2005. Without analyzing the assumptions made by the principal 
about what the researcher would or would not like to see, it is worth noting that, 
after permission was given to visit that school, the researcher did not see anyone 
spitting on any candles because there were no candles on which to spit. While data 
were not collected at Jewish ultra-Orthodox schools, it is probably safe to assume 
that no commemorations took place there on Rabin’s Memorial Day either because 
the same kind of total silence characterizes ultra-Orthodox schools during every 
official memorial day instituted by the State of Israel—even the most consensual of 
them, such as the Memorial Day for the Holocaust (Ebenstein 2003).® 

Another kind of complete and overt silence that eventually enhances forget- 
ting is the one exemplified by Palestinian citizens of Israel. Following Rabin’s 
assassination, Palestinian citizens of Israel—comprising 18 percent of the Israeli 
population —mourned him, expressed anger and pain, and, shared, perhaps for the 
first time in national history, an emotional response with the Jewish majority (see 
Al-Haj 2000). In effect, Rabin’s Memorial Day could have constituted the first and 
only official holiday observed by both Jewish and Arab citizens.? Following Rabin’s 
assassination, Jewish Israelis did not pay much attention to the grief of their fel- 
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low citizens (Al-Haj 2000) and certainly did not see it as an opportunity for any 
sort of shared mourning. Instead, they generally perceived Rabin’s assassination 
as an internal Jewish-Israeli affair (El-Or 1998). Indeed, the first and only time 
that a Palestinian citizen of the state was invited to speak at the annual memorial 
ceremony in Rabin Square (an event organized by left- and liberal-leaning Israeli 
parties and associations) was in 2000—a full five years after the first ceremony. 

Palestinian citizens of Israel can be said to have withdrawn from participation 
in remembering of Rabin. Ten years after the assassination, no mention of Rabin 
was made on his memorial day in Palestinian schools in Israel. Five years after the 
assassination, Israeli publications in Arabic stopped publishing anything -— critical 
or not—about the assassination.'° There may be many factors that contributed to 
this public mnemonic withdrawal. It may be part of a more general response to the 
continual exclusion of Palestinians from formal and informal spheres in Israel, or 
it may be embedded within a more specific protest against the events of October 
2000 (when 13 Israeli Palestinians were killed by Israeli police during demonstra- 
tions following Ariel Sharon’s visit to Jerusalem’s Temple Mount), or it may reflect 
a broad disappointment at the pace and status of the peace process. Nonetheless, 
the emergence of this silence seems also to be informed by thwarted attempts by 
many Palestinians to share in this specific moment. Riad Ali, a Palestinian journal- 
ist, bitterly and painfully expressed this sentiment to his Jewish countrymen in 
Maariv, a Hebrew-language newspaper, on Nov. 5, 1998: 


“Detached, that is how I feel these days, the days in which 

the Jewish-Israeli public remembers its Prime Minister... 
You [Jews] excluded [us]... from mourning [Rabin]. We 
wanted to be part of the family. We cried and mourned... 
We thought that he became a friend of all of us. Apparently 
he didn’t. He was a great man... I feel sorry for you...” 


This literal silence reflects a reaction to mnemonic talk that was offered by this 
constituency but rejected by the majority. The silence is thunderous, and although 
it may be rooted in a past desire to remember, it ultimately comes to be located 


in the domain of forgetting. 


Covert Silence in the Domain of Memory 


Schudson (1997:354-55) has argued that in an “effort not only to report the 
past but to make it interesting, narratives simplify.” Part of this simplification 
is the result of commercial considerations which seek to “make an account of 
the past palatable to all tastes—hence, bland and uncontroversial.” Commercial 
considerations, however, are not the only motivation for inducing audiences. 
Agents of memory are often motivated by a variety of reasons to set aside certain 
troubling aspects of the past recounted to enable broader collectives to participate 
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in a memory that otherwise may be hard to share. Thus, within commemorative 
activities and narratives, certain issues come to be ignored and silenced in the 
aim of memory. Sometimes this silencing involves complete sidelining of aspects 
of the narrative. Other times, the silencing is more subtle and is manifest, for 
example, through issues that are hinted at but not explored. Such covert silences 
can facilitate more peaceful transitions between regimes and curtail the eruption 
of conflict over representations of shameful and contested pasts. 

Covert silence in the domain of memory is often about widening the audience 
that can share the moment at the expense of a certain depth to the narrative 
offered. This kind of silence, which is manifested in what we term a bland com- 
memoration, involves a compromise on behalf of those who wish to remember. It 
allows broader collectives to participate in commemorative activities and thus to 
remember, rather than forget, the past. 

Aside from the Israeli government and its agents and municipalities, all of 
Rabin’s agents of memory belonged to or were associated with the Israeli left and 
were more or less unified around a certain narrative of the assassination. In general 
terms, this narrative includes an emphasis on the organized political campaign 
against Rabin and his government that preceded the assassination, a campaign 
that led to the three shots fired by Yigal Amir at the Kings of Israel Square (now 
called Rabin Square). In this narrative, the assassin is placed within a broader social, 
political and ideological context; and the leaders of right-wing groups, through 
their action or inaction, are understood to bear at least some of the responsibility 
for the outcome. “The assassination did not start on November 4" [the date of 
the assassination)” states one of the agents of memory, “the physical assassination 
took place then, but the character assassination started before.” (Vinitzky-Seroussi, 
Jan. 13, 1998 interview) Rabin’s widow, Leah, elaborates the view shared by all of 
Rabin’s agents of memory: “On the night of [the signing of the second agreement 
with the Palestinians] in Oslo [October 1995, a month before the assassination] 
you stood there [in Zion Square, Jerusalem], Mr. Netanyahu, and you incited 
the crowd against Yitzhak Rabin. I blame primarily the leadership of the Likud 
[Party]... For God’s sake, why did you [plural] assassinate him? The circumstances 
of the assassination were horrible: It is the combination of political manipulation 
and the religious establishment.” (Vinitzky-Seroussi, Dec. 30, 1998 interview) In 
all the interviews conducted with Rabin’s agents of memory, this narrative was 
recounted. Some were more direct and harsh in their accounts. Others were gentler, 
implying rather than declaring the narrative. All, however, explained the assassina- 
tion within a political context that goes beyond the action of an individual and 
permeates deep into the social fabric of Israeli society. 

And yet, when these same agents of memory prepared a special educational kit 
that was distributed to state schools before Yitzhak Rabin Memorial Day in 1997, 
no mention was made of this political context. Thus, what was mentioned in the 
context of an interview was silenced in the context of the educational kit where the 
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prime focus was instead on a selective biography of Rabin that elevated him to the 
status of national hero and savior. The act of the assassination was recounted and 
dramatized but its significance was de-politicized and de-contextualized. The assas- 
sin was characterized as evil, devoid of any social, political or ideological context. The 
context of the assassination was, in many ways, silenced (Vinitzky-Seroussi 2001). 

One of the agents of memory who was heavily involved in setting the mne- 
monic agenda for the state schools admits: “I would like to turn Rabin’s memory 
into something that is shared by as many walks of society as possible. If we take 
off 10% from either side of the two extremes, we are left with most of the society. 
A merely sectoral memory is eventually lost. So if I want to work to uphold his 
memory, I would like it to be shared memory of a wide cross-section... The radical 
right is lost for us but the non-radical right has a real problem [with the assas- 
sination]. It cannot say ‘leave us alone with Rabin.’ It has to swallow it without 
vomiting. This is where we stand in this seesaw.” When asked specifically about the 
content of the material prepared for the educational system, she said: “In order to 
penetrate the educational system, we cannot present [Rabin’s assassination] on the 
background of the incitement. [Why?] Because it is interpreted at least partially 
as a bill of indictment against one side [i.e., the right]... We have to make some 
effort ahd look for the widest common denominator short of lying, without cutting 
the corners, but sometimes without discussing the corners, you understand? (Vinitzky- 
Seroussi, Feb. 28, 1999 interview, emphasis ours) A compromise was thus made: 
Certain interpretations of the past would be silenced so as to enable the acceptance 
of the narrative by a broader public. 

The desire to make Rabin commemorations shared by a large audience and thus 
to keep silent on parts of the narrative is evident in arguments among the agents 
of memory over the content of the annual memorial ceremonies at Rabin Square. 


According to one of Rabin’s personal assistants, 


“The [second] memorial ceremony [in Rabin Square], was 
indeed an amazing demonstration. There was nothing like 
that in the history of the state, but we failed. [There were] 
no religious people, no one from the development towns in 
the South, and the buses from those places did not arrive... 
If we want to make Rabin a shared national [icon] we need 
to look for wide common grounds. We need to make sure 
that Rabin belongs to the entire people. If we want to see 
people from the right [at the ceremonies], we cannot make 
it into a political event.” (Vinitzky-Seroussi, April 1, 1998 


interview) 


Making the ceremony into a political event means first and foremost introducing, 
rather than sidestepping, the context of the assassination. One of Rabin’s most active 
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agents of memory criticizes the impulse to purge the memory of the assassination 
of its political context in order to make it more palatable to broader collectives (and 
thus to allow more people to remember and commemorate). She states: 


“You cry and cry and cry and one day you have to say 
something... You can mourn for a year but then you have 
to remember... There were people... who did not want the 
ceremony to be political because Yitzhak belonged to the 
entire people. But I said ‘what does being political mean?’ 
Yitzhak Rabin was assassinated as Prime Minister, as the 
leader of the Labor Party. He was assassinated because of 
politics... Ifwe cannot talk about it there, what can we talk 
about?” (Vinitzky-Seroussi, Jan. 9, 1999 interview) 


The annual memorial ceremony in Rabin Square is probably the most important 
public mnemonic activity for Rabin in that it draws a large crowd and is broadcast 
live on state television channels. In contemplating the event, Rabin’s agents of 
memory have debated whether to offer the entire interpretation of the event as 
they understand it, or to condense the narrative so as to allow a larger public to 
member together. On certain years the former perspective has won out, on oth 
the latter. Regardless, many of Rabin’s agents of memory seem to agree with @ 
of the most active of Rabin’s agents of memory who states that “If they [the age 

of memory who organize the memorial ceremony] won't make adjustments, o 
half of the people will remember [Rabin’s assassination] and the other half \ 
burn every place where Rabin’s statue will be erected.” ( Vinitzky-Seroussi, Apri 
1998 interview) Such a perspective has meant that in the educational system a1 
in some of the memorial ceremonies, a certain contextualization and interpret 
tion of the event is not elaborated upon. The aim of this silencing—hidden as i. 
is within the ample mnemonic talk that is offered—is not forgetting. Rather it is 
about the consolidation of a shared and wide memory of the event and of the man. 


Covert Silence in the Domain of Forgetting 


Complete silence about the past is one way through which collectives may try 
to forget the past. Others, however, develop sophisticated mechanisms through 
which to attempt to effect forgetting, some of which carry the appearance of 
commemoration. This is often a result of the fact that while certain groups find 
commemorating certain people or events to be unacceptable or uncomfortable 
for a variety of reasons, keeping totally silent on these issues is increasingly being 
perceived as illegitimate within the broader society. Moreover, agents of memory 
are often aware of potential criticism that may be raised against them if they fail 
to mention certain elements of the past and thus preemptively respond to these 
potential criticisms by incorporating difficult aspects of the past in ways that 


Silence in Collective Memory and Forgetting © 1115 


minimize their impact (Teeger and Vinitzky-Seroussi 2007). Such covert silences 
are not easily identifiable and thus not easily critiqued as they are covered and 
hidden by much mnemonic talk. 

The legislation enacting Yitzhak Rabin Memorial Day did not specify that 
schools had to conduct an official memorial ceremony. And yet, following a long 
tradition of marking official state memorial days with a school ceremony, most 
Israeli state schools interpreted the new memorial law as one that required a public 
performance in front of the school community consisting of pupils, teachers 
and parents (Vinitzky-Seroussi 2001). The requirement to commemorate Rabin's 
assassination, however, did not turn it into an event that all sectors wished to 
remember and mark. While 10 years after the assassination ceremonies were still 
conducted in all state schools (except for the ideological state-religious schools 
located in the occupied territories), the ceremonies in state-religious schools had 
undergone a significant change. This change indicates that commemorating Rabin 
had moved from operating as a “mythic duty” to a “rational requirement” (Olick 
and Levy 1997) and quite clearly exemplifies the notion of covert silence. 

By 2005, Rabin and the assassination were no longer the pivotal, central and 
sole foci of the ceremonies conducted in these schools but shared the time and 

ace (decorations at the entrances to schools and in their corridors for example) 
shat least one, and sometimes two, other topics. The first topic to be integrated 

w the ceremony held in these schools on Rabin Memorial Day was one com- 

-«morating the Biblical matriarch, Rachel (Rachel Imenu). The commemora- 

a of Rachel in a school ceremony constitutes a new and “invented tradition” 

“obsbawm and Ranger 1992). As narrated in the Old Testament, Rachel —Jacob’s 

jloved wife—died during the birth of her second son, Benjamin. Many Orthodox 

pws traditionally visit the place where she is believed to be buried (Rachel’s Tomb) 
on the supposed date of her death, Yud Alef of Cheshvan (the eleventh day of the 
Hebrew month Cheshvan). However, until recently, her death was never marked 
in state-religious school ceremonies. The fact that Rabin was assassinated one day 
after Rachel is believed to have died may explain the ease with which this new 
tradition was incorporated into the schools. 

The second topic added to the ceremony commemorating Rabin was rain or, 
more accurately, a prayer for rain that is traditionally recited by observant Jews 
around that time of the year (i.e., from autumn through winter). Needless to say, 
the link made between the three topics (Rabin, Rachel and Rain) was far from 
trivial, and indeed the organizers of the ceremonies in these schools took great 
pains to construct a connection between the three. The important point for the 
purposes of this article, however, is that mnemonic activities that combine various 
topics in time and space, serve to diminish the potency of any one event com- 
memorated as the commemorated issues blur into each other. Such processes are 
not about distracting one’s attention from the proverbial “elephant in the room” 
and thus denying its existence (Zerubavel 2006) because, as noted above, com- 
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plete denial is socially proscribed (Olick and Levy 1997). Instead, these processes 
are about diminishing the size, importance and magnitude of “the elephant” by 
recognizing it but only as one part of a much larger, busier picture. 

While replacing a specific memorial day for a specific individual (or event) with 
a generalized day focused on many individuals or events may have many financial, 
commercial and logistical advantages (as is the case in the United States with 
Presidents Day), it may threaten the ability to concentrate on a specific individual 
or event. The uniqueness of the event or person may be diminished such that all 
the events and people who share the commemorative time and space may become 
interchangeable or forgotten altogether. Moreover, in such cases, the amount of 
time dedicated to remembering a specific topic within all the issues contained by 
the commemorative space and time may range and thus hierarchies of importance 
may come to be constructed. 

Not surprisingly, after Rabin’s commemorative time was shared with Rachel 
and Rain, the amount of songs and texts dedicated to his memory declined 
significantly. Even if Rabin remains the center of the ceremony, after the abil- 
ity to share exists, uncomfortable content can more easily be pushed into the 
margins as the silences become veiled by an abundance of commemorative 
activity and speech. The incorporation of many topics into one mnemonic 
time—what we term a cacophonous commemoration —suggests that much talk 
can sometimes be related more to forgetting than to memory. As such, a mul- 
titude of commemorative activities in a single space or time can serve to create 
so much mnemonic stimulation that the uniqueness and content of any one 
commemoration can get lost. As the case of Rabin’s commemoration illustrates, 
it becomes quite difficult to accuse teachers and principals that they are silent 
about the event because the event is in fact commemorated. And yet, forgetting 
is enhanced by the mere fact that Rabin is remembered within a cacophonous 
commemoration. Covert silence—silence that is hidden in much commemo- 
rative talk—can be an extremely sophisticated mechanism through which to 
effect collective amnesia about certain issues, people or events. Amplification, 
in short, is not always about hearing better, and silence itself may be facilitated 
and escorted by much noise. 


Concluding Remarks 


The notion of silencing the past and thus burying specific events is not new. But, 
in a world that demands talk and memory even about pasts that contain embar- 
rassing moments, human right violations, shameful events and little to be proud 
about, silence may conquer a new position and social space. Most immediately 
silence is connected in our mind with forgetting, while talk is tied to remem- 
brance. In this article, however, we saw that silence is also part of the language of 
remembrance, and talk can be found in the language of forgetting. 

At the extremes of memory and forgetting, we find two types of overt silence. 
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The first, in the domain of memory, is heavily ritualized, bounded, short and 
escorted either by a siren or by much mnemonic talk that comes before or after 
the moment of silence. This kind of silence is perhaps the highest official honor 
that can be granted to the past. The second is silence in the domain of forgetting. 
Here, the silence is not framed by memory. It does not exist in a clearly demarcated 
time or space and it is often generated by those who object to the commemoration. 
This kind of total silence can also be found among groups that withdraw from 
participating in the mnemonic activity and who have perhaps despaired at the 
possibility of being equally and fully integrated as part of a democratic community. 
The end product—forgetting—may thus look similar, but the basis for the silence 
may be very different. In effect, while silence that is generated by opposition may 
leave some hope for a shared future and understanding, silence that is generated 
by social and political despair leaves little or any such hope. 

In between these extremes of literal silence, we find covert silences: silences 
that are contained within and disguised by much mnemonic talk. First, there are 
covert silences in the domain of memory. These are used by agents of memory who 
choose to give up on part of their preferred interpretation of the past (often the 
context of the event) so as to enable various collectives to participate in mnemonic 
activities, to enlarge the potential mnemonic audience, and thus to enhance mem- 
ory. The second kind of covert silence is embedded in an expectation that the past 
be commemorated coupled with little desire to do so. This type of covert silence 
thus falls within the domain of forgetting and is achieved through cacophonous 
commemorations where a mnemonic time and space are shared with many other 
issues. Thus, what looks like remembrance, may in fact, be aimed at forgetting. 

Given that our analysis is located within the contemporary social challenge of 
coping with difficult pasts, it should come as no surprise that different collectives 
have different interests in how events are remembered or not. In such a context, 
the potential for social conflict is never far away. Our analysis suggests that overt 
silences may be more susceptible to enhancing social conflict than covert ones. In 
the domain of forgetting, overt silence can raise much criticism and fuel social 
conflicts and protests as various collectives object to attempts to erase the past. In 
the domain of memory, overt silences can create a context for social conflict as 
those who do not adhere to the social norms demanded by moments of silence 
generate social antagonism. Covert silences, on the other hand, may help mitigate 
social conflicts. In the domain of forgetting, covert silences that are hidden by 
much mnemonic talk are often difficult to identify, critique and therefore protest 
against. In the domain of memory, covert silences are about widening the audience 
that can share a less divisive version of the past. The power of veiled silences as a 
mechanism for coping with a difficult past lies precisely in their ability to minimize 
the potential for social conflicts. However, the price may be too high for those who 
wish to remember as the process may ultimately result in forgetting and denial. 

Each of the types of silence discussed in this article can have unintended conse- 
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quences. Moments of silence can lose their uniqueness as they become increasingly 
popularized and used in a multitude of commemorative events. Overt silence in 
the domain of forgetting may result in objection, criticism and public outcry and 
may thus, ironically, become the locus of commemorative activity. Covert silence 
in the domain of memory may lead to social amnesia as more and more of the 
narrative gets lost in the attempt to appease too many audiences. Finally, covert 
silence in the domain of forgetting may end up stimulating memory as the sym- 
bolic offering of memory gets taken up by audiences who somehow paid attention. 
Silence, like memory, is unstable and unpredictable. 

In this article, we identified a typology of silences by drawing on the case of 
Israel in general and the commemoration of the assassination of Yitzhak Rabin 
in particular. We view this exercise as an opportunity to open doors for future 
research on how overt and covert silences in the domains of memory and forget- 
ting are used in different social contexts where groups are being called upon to 
deal with their difficult and shameful pasts. A good starting point would be to 
examine these issues in the contexts of European colonialism, American slavery, 
ethnic cleansing in the Balkans and apartheid in South Africa. To be more con- 
crete, scholars may want to start by unpacking the meanings behind the renaming 
of Soweto Day as Youth Day in post-apartheid South Africa. They may also want 
to examine the representations of particular aspects of the past, such as violent 
resistance in the struggle against apartheid. To take just one other case, while the 
Holocaust is probably the most researched event in studies of collective memory, 
we believe that a systematic cross-national comparison of the different forms of 
silence may synthesize and highlight crucial aspects of the mnemonic reality of 
this period. Yad Vashem:'' Holocaust Martyrs’ and Heroes’ Authority in Jerusalem, 
the Jewish Museum in Berlin and the United States Holocaust Memorial Museum 
in Washington, D.C. all represent and silence different aspects of the same event 
(evidenced even by their names). Moving from representations of the Holocaust 
to more general representations of World War II may be further illuminating. 
Examining continuities and changes in the representations of Pearl Harbor in 
the USS Arizona Memorial in Hawaii one may find a variety of silences around 
the history of the military base,! the decision to join the Allied Forces, and the 
atomic bombing of Hiroshima and Nagasaki. On a different level of analysis, one 
can examine issues of change and continuity in how individuals speak and keep 
silent about “cultural traumas” (Alexander 2004) as social conditions transformec 
(Stein 2009). In these and other cases, attending to the configuration of overt anc 
covert silences may help us to understand better not only the particular cases bus 
also issues around the challenges inherent in dealing with difficult pasts. 

Forgetting and denial, as we have illustrated, can be achieved by silence, but 
they can also be achieved by much talk. Inversely, memory may be achieved by 
much talk, but it may also be enhanced through silence. In this article we have 
shown how silence should be understood as a socially embedded construct used 
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for different ends by different collectives. Furthermore, we have suggested that si- 
lence is a broad concept, one that includes a variety of mechanisms enabling both 
forgetting and remembering. In a world that still believes that some past events 
and people should be remembered, addressing the role of silence may, in fact, be 
the key for understanding not only collective amnesia but also collective memory. 


Notes 


1. The formula “land for peace” is not only championed by Rabin’s supporters and the 
Palestinians but is also based on United Nation resolutions 242 and 338. 


2. By “agents of memory” we refer to individuals who work (sometimes as their full-time 
profession) to construct, build and maintain the memory of a specific person or event. 
In Fine’s terminology (1996), these individuals may be thought of as “reputational 
entrepreneurs. 


3. The newspaper data were collected in the said years over a period of four weeks in each 
year (beginning two weeks prior to Rabin Memorial Day and ending two weeks after). 


4, Interestingly, Freud (1967[1939]) himself seemed to prefer the term denial over 
repression when talking about more collective processes. 


5. These are sounded using the same state infrastructure used during times of war to 
warn citizens of immanent danger. 


6. According to Jewish tradition, and adopted by the Israeli State, every holiday, 
including memorial days, lasts from sunset to sunset. 


7. It is worth noting that the relationship between time and this type of silence is not 
always linear as interested parties may rediscover buried and forgotten histories. See 
for example the recovery of the story Masada after 2,000 years of silence (Ben Yehuda 
1995; Y. Zerubavel 1995) 


8. Generally speaking, ultra-Orthodox Jews do not support the State of Israel since 
they believe not only that the Jewish state should be a religious one but also that 
it should not have come into existence before the arrival of the Messiah and the 
rebuilding of the Temple in Jerusalem. As such, members of this community refrain 
from participating in state ceremonies and rituals. 


9. Other official holidays instituted by the Israeli state in no way create a platform for 
any sense of shared experience. Except for the Memorial Day for the Holocaust, all of 
the official holidays in Israel are either Jewish religious holidays or national holidays 
that mark military victories over Arab countries and thus constitute days of mourning 
for the Palestinians. The most notable of these days is of course Israel’s Independence 
Day which is the Palestinians’ Nakba (Catastrophe). 


0. One of the newspapers studied stopped publishing anything about Rabin two years 


following the assassination. 
11. Literally translated as “a memorial and a name.” 


12. For the detailed analysis of the films presented in this memorial site, see White (2001). 
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World-System Mobility and Economic Growth, 1980-2000 


Rob Clark, University of Oklahoma 


World-system scholars have traditionally emphasized the stability of the core/periphery 
hierarchy. However, prior network studies employing both categorical and continuous 
measures of world-system position reveal substantial mobility across time, whereby 
a number of developing states have become more integrated in the world economy 
over the past several decades. The developmental impact generated by such mobility 
remains unexplored. Using data on all commodities in the world trade network, I 
examine the presence and economic impact of world-system mobility across 110 
states during the 1980-2000 period. First, I show that the continued upward mobility 
of “middle-tier” states from East Asia and the semiperiphery no longer contributes 
to the long-term trend of trade convergence, but began producing divergence during 
the final decades of the 20" century. Second, using difference models that control for 
lagged values, I find that world-system mobility positively affects economic growth, 
net of initial world-system position, capital and labor inputs, and other regional and 
economic characteristics. In fact, the magnitude of mobility’s effect is substantially 
larger than that of initial world-system position and second only to initial human 
capital. These findings support a range of perspectives that are collectively useful for 
understanding the rapid economic growth of emerging states. 


Introduction 


Network studies of the world-system highlight the importance of occupying an 
integrated “core” position in the world economy. ‘These studies commonly reveal 
the economic, political and social welfare problems that plague non-core states in 
the semiperiphery and periphery (Bollen 1983; Bollen and Jackman 1985; Clark 
2008; Kick 1987; Kick and Davis 2001; Nemeth and Smith 1985; Nolan 1983; 
Snyder and Kick 1979). World-system scholars readily acknowledge that some 
developing nations have experienced upward mobility along the core/periphery 
hierarchy. However, the belief that such movement occurs infrequently accounts 
for the paradigm’s focus on hierarchical stability, rather than change. Thus, a 
central theme emerging from the world-system literature is that core and non-core 
nations travel along divergent economic trajectories. 

Recent network studies of the world economy, though, reveal substantial upward 
mobility by developing states over the past several decades, which has produced 
convergence in core/periphery trade relations (Kim and Shin 2002; Mahutga 2006; 
Smith and White 1992). Moreover, there is evidence that rapid economic growth 
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in some parts of the developing world has occurred with sufficient frequency and 
power during the post-WWII era to generate large-scale effects, such as the leveling 
off of world income inequality (Bourguignon and Morrisson 2002). To the extent 
that these trends may be connected, a closer examination of world-system mobility, 
and its potential impact on development, is warranted. 

In this study, I use difference models with lagged values to investigate the 
effects of world-system position and mobility on economic growth across 110 
states during the 1980-2000 period. I measure world-system position using a 
continuous core/periphery indicator of a state’s level of network integration in 
international trade, while using world-system mobility to refer to a state’s change 
in world-system position across time. Reaffirming theoretical expectations, I find 
that world-system position is positively associated with economic growth. More 
importantly, I find that upward mobility also produces substantial growth, net 
of initial world-system position, capital and labor inputs, and regional and eco- 
nomic characteristics. In fact, a comparison of fratios across predictors reveals 

_ that world-system mobility is more likely to be associated with economic growth 
than any other measure across every model featured. Furthermore, the magnitude 
of mobility’s effect is consistently larger than initial world-system position by 
substantial margins and is second only to a state’s initial level of human capital. 
I interpret the positive effects of world-system position and mobility as broadly 
consistent with a range of perspectives, including those found in economics and 
world-system theory. 


Upward Mobility in the World-System 


World-system research traditionally emphasizes the reproduction of class hierar- 
chies that results from states occupying a particular world-system position (core, 
semiperiphery or periphery). Thus, much less attention is afforded to change in 
world-system position or to the consequences that such change might produce. 
“Rather than focusing on the exceptional success stories, the upwardly mobile 
states, attention should be given to the more usual patterns revealed by the relative 
stability of the core/periphery hierarchy.” (Chase-Dunn 1998:121) 

However, network studies of the world economy reveal substantial upward 
mobility and convergence in recent decades. Examining trade data from 105 
countries during the 1959-1996 period, Kim and Shin (2002) find a notable 
increase in network density (i.e., the proportion of all country pairs with ties) 
primarily due to the emergence of developing states that have begun trading 
with a greater number of partners. As a result, the world trade network has 
become more decentralized (i.e., the extent to which ties are equally distributed 
across countries) during the post-WWII era, as trade relations have become less 
monopolized by core states. 

Moreover, network studies investigating core/ periphery dynamics have revealed 
substantial movement along this hierarchy. Using hierarchical five-block partitions, 
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Figure 1. World-System Mobility in Mahutga’s Trade Network 
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Smith and White (1992) and Mahutga (2006) detect considerable inter-block mo- 
bility using data on selected commodities! of the international trade network dur- 
ing the 1965-1980 period. In their 63-country dataset, Smith and White (1992) 
find that a third of these states (21) changed blocks from 1965 through 1980—a 
remarkably high proportion of cases given the relatively short period of time under 
investigation.” Perhaps most noteworthy, 19 of these 21 states experienced upward 
block mobility, implying a “growing core and shrinking peripheries” (Smith and 
White 1992:887) trend that is consistent with Kim and Shin’s (2002) finding of con- 
vergence. More recently, Mahutga (2006) finds even greater movement, as almost 
half of all states (26 total) in his 53-country dataset experienced block mobility from 
1965 through 1980.3 Moreover, 22 of these 26 states experienced upward mobility, 
which again suggests considerable network convergence. 

In addition to portraying upward mobility as a relatively infrequent phenom- 
enon, world-system scholars also tend to characterize upward mobility as fairly 
limited in scope. Peripheral states may elevate themselves to the semiperiphery 
via “dependent development” (Evans 1979), but non-core states, in general, are 
“very rarely” able to gain core entry (Chase-Dunn 1998:121). However, Smith and 
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White (1992) and Mahutga (2006) each find that seven of their upwardly mobile 
cases (about a third of their respective totals) represent semiperipheral entry into 
the core during the 1965-1980 period. 

Mahutga (2006) also constructs a continuous indicator of world-system position 
at several points in time during the 1965-2000 period. Using his reported scores, 
I find that Mahutga’s trade network exhibits signs of significant -convergence* 
during the 1965-2000 period. Figure 1 depicts a scatter plot of world-system 
position (1965) by world-system mobility (the change score between 1965 and 
2000) in Mahutga’s trade network, with the fitted line indicating a clear negative 
association ( = -.320, p = .000). Notice also that states customarily located in the 
core (indicated by diamonds) are concentrated in the lower right-hand corner of 
the plot, indicating a high initial network score in 1965, coupled with downward 
mobility during the 1965-2000 period. In sum, recent network studies on the 
world trade network reveal considerable movement and convergence over the past 
several decades. Thus, an important question to consider is whether this upward 

_ mobility has translated into economic gains for developing nations. 


Upward Mobility and Development 


There is little debate that cross-national income inequality skyrocketed between 
the early 1800s and WWII. However, the lack of a clear or robust trend during 
the post-WWII era (Bourguignon and Morrisson 2002; Firebaugh 1999, 2000, 
2003; Firebaugh and Goesling 2004; Korzeniewicz and Moran 1997; Milanovic 
2005) suggests that inequality may have plateaued. Several studies reveal condi- 
tional B-convergence in economic growth, whereby developing states with high 
levels of human capital (Barro 1991, 2001) and open trade policies (Sachs and 
Warner 1995, 1997) have begun to catch up to wealthier countries. Alternatively, 
the spread of industrialization to the developing world is a popular explanation 
for why some poorer countries have economically outgrown advanced nations in 
recent decades (Amsden 2001; Firebaugh 2003; Firebaugh and Goesling 2004), 
but others remain skeptical (Arrighi, Silver and Brewer 2003; Chase-Dunn 1998). 
Other studies suggest that states are clustering at both ends of the income distri- 
bution, producing a polarized world economy (Quah 1996a, 1996b, 1996c). In 
sum, the literature suggests that world income inequality has leveled off during 
the post-WWII era due to a select group of economies that have emerged from 
an otherwise stagnant developing world. 

What can account for such trends, including the leveling off, yet persistence, of 
cross-national income inequality? I draw from the work of economists and sociolo- 
gists who recognize the potential for international trade and network dynamics to 
structure cross-national variation in economic growth. I argue that the core/periphery 
hierarchy in the world trade network may help explain the persistence of cross-national 
inequality, if not also the polarization of national economies. On the other hand, 
upward/downward mobility along this hierarchy may help account for the leveling off 
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of cross-national inequality due to the emergence of several poorer nations. 

The potential for international trade to produce economic growth is a highly 
regarded thesis among economists. Open economies have a greater capacity to 
absorb new ideas and technological innovations that are generated in other coun- 
tries (Edwards 1992). Moreover, because open economies are exposed to the 
global stock of knowledge, trade integration reduces duplication in R&D, thereby 
enhancing efficiency (Grossman and Helpman 1991). Trade openness, via import 
liberalization and export promotion, also introduces competition from the world 
market, which can inspire innovation more so than in autarkic regimes where 
producers operate in a more protected environment (Grossman and Helpman 
1994). Overall, then, international trade diffuses technology, provides access to the 
marketplace of ideas, boosts efficiency, and introduces competition, all of which 
stimulate productivity, thereby leading to greater growth in output. This enhanced 
productivity provides a theoretical explanation for the empirical link commonly 
reported between trade and growth by economists, as indicated by trade openness 
({imports + exports]/GDP) (Barro 2001; Frankel and Romer 1999) and liberaliza- 
tion policies (Collins and Bosworth 1996; Greenaway, Morgan and Wright 1998; 
Sachs and Warner 1995, 1997). 

However, a network measure of trade is theoretically preferable for capturing 
the various mechanisms that economists suggest are operative. Trade openness 
considers a state’s volume of exchange, but ignores information about its trading 
partners. Thus, an open economy may exchange a high volume of goods relative 
to its GDP, but may do so with a small number of partners, thereby limiting com- 
petition as well as access to technological innovation. Worse yet, trade indicators 
that focus on liberalization policies merely reflect a state’s potential for exchange. 
In contrast, a network measure of trade more precisely reveals the extent to which 
a state has access to the accumulated knowledge that diffuses across trade partners. 
Theoretically, the core/periphery hierarchy in the world trade network reveals the 
considerable disparity that exists between states regarding access to technology 
spillovers. Thus, cross-national “spillover disparities” may help account for the 
persistent economic gap between advanced, integrated core states and stagnant, 
isolated nations in the periphery. 

World-system scholars emphasize, however, that the core/periphery hierarchy is 
primarily important for indicating the particular role that each state performs in 
the world economy. Poorer nations in the periphery serve as nodes of extraction 
for wealthy, advanced states in the core. Surplus value is channeled away from the 
periphery and into the core via “vertical” trade (i.e., the unequal exchange of raw 
materials for manufactured goods) and/or foreign investment penetration (where- 
by profits generated in the periphery are repatriated to the core). Core/periphery 
relations are generally characterized as exploitive, and they persist because isolated 
nations in the periphery are dependent on their economic ties with the core for 
receiving processed goods and/or investment capital. Peripheral economies are 
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also thought to be “disarticulated” because they are organized around the extrac- 
tive logic of global capitalism. The classic symptoms of disarticulation include 
(1. the reduction of social services because the economy is outward-oriented, (2. 
unemployment because of excessive internal migration to productive sectors, and 
(3. wage deflation because productive sectors are juxtaposed with non-productive 
sectors (Stokes and Anderson 1990). In sum, a state’s level of coreness in the world 
trade network should positively affect economic growth because integrated core 
states enjoy greater access to technology spillovers, and isolated nations in the 
periphery suffer from economic dependency and disarticulation. 

At the same time, upward mobility in trade may be able to account for the 
leveling off of inequality caused by rapidly growing economies in the developing 
world. Economists suggest that there may be sizeable growth effects for develop- 
ing countries exiting autarky. While all states stand to benefit from technology 
spillovers generated by trade, developing countries may have the most to gain 
from expanding trade relations because most innovation actually occurs in ad- 

_vanced nations, where knowledge is most heavily accumulated (Grossman and 
Helpman 1990; Romer 1990). Consequently, economists find that the positive _ 
effect of trade persists (Edwards 1992; Harrison 1996) and may even be stronger 
(Barro 2001) among developing countries. Even ifa developing economy does not 
become a core trading power overnight, its upward mobility in the trade network 
may explain growth dynamics in the transition that a simple core/periphery hierar- 
chy may not appreciate. All else being equal, a peripheral state that is experiencing 
upward trade mobility should develop more quickly than a peripheral state that 
remains “relationally stagnant” or downwardly mobile. 

The idea that upward mobility may positively affect economic growth is also 
compatible with the world-system perspective. Economists suggest that the 
mechanism linking mobility to growth is enhanced exposure to competition 
and innovation. In contrast, world-system scholars suggest that upward mobility 
sparks development because it places ascending economies in a position of greater 
exploitive advantage vis-a-vis stagnant nations in the periphery. Thus, develop- 
ing countries elevating themselves to, say, the semiperiphery are able to achieve 
economic growth not necessarily as the result of technology transfers, but because 
semiperipheralization alters their relationship with those nations remaining in the 
periphery (see Wallerstein 1979). By the same token, downward mobility intensi- 
fies economic dependency on core nations. Thus, world-system scholars expect 
downwardly mobile states to experience slower economic growth as a result of 
occupying more exploited positions. 

Nevertheless, while there are a range of perspectives that converge on the idea 
that upward mobility in the world trade network has the potential to produce 
growth, the hypothesis has yet to be tested empirically. Using data from Smith 
and White (1992), we can compare GNP per capita growth rates (1965-1983) 
between those countries exhibiting either downward or no mobility between 
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world-system zones during the 1965-1980 period, and those countries exhibit- 
ing upward mobility during the 1965-1980 period. Table 1 shows the results 
of this comparison, with higher growth rates enjoyed by the upwardly mobile 
states, controlling for zone of destination. These data suggest that upward mo- 
bility across world-system zones is itself positively associated with economic 
development, independent of what particular position a country occupies. Of 
course, while the pattern in Table 1 suggests that world-system mobility and 
economic growth may be correlated, multivariate analyses involving a larger 
sample and complete trade data, and covering a more recent time period, are 
warranted for investigating this relationship further. 


Methods 







Data 


Unless otherwise noted, data come from the 
World Development Indicators (International Bank 
for Reconstruction and Development 2004). 
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trade relations (Kick 1987; Snyder and Kick 1979; Van Rossem 1996), and several 
rely exclusively on trade to construct a state’s world-system position (Mahutga 
2006; Nemeth and Smith 1985; Smith and White 1992). Following this tradi- 
tion, I use data on all commodities in the international trade network to measure 
a state’s world-system position. Data come from the Direction of Trade Statistics 
(International Monetary Fund 2004). Given the skewed nature of trade flows, I 
use the base-10 logarithm of the raw data to measure ties. When constructing 
network data on trade, analysts can rely on either export data (trade flows from 
the reporting country to its partner) or import data (trade flows #o the reporting 
country from its partner). Because both reports consider trade flows to and from 
each country pair, only one is necessary to fill the entire network. I use the import 
version because it is considered more accurate than the export version (see Kim 
and Shin 2002). Thus, in the world trade network, ties are measured as the base- 
10 logarithm of the average value of all imports from country, to country, for the 
1980-1990 and 1990-2000 periods. 

Early network studies used CONCOR (Kick 1987; Nemeth and Smith 1985; 

Snyder and Kick 1979), a partitioning method based on structural equivalence, 

whereby actors are considered equivalent if they share identical ties to and from ~ 
all other actors in the network (Wasserman and Faust 1994). Subsequent network 
studies relax this criterion by using partitioning methods based on role or regular 
equivalence (Mahutga 2006; Smith and White 1992; Van Rossem 1996), whereby 
actors are considered equivalent if they occupy the same structural role, even if the 
identities of the actors with whom they share ties are not identical (Wasserman 
and Faust 1994). These prior efforts all employ methodologies that assign coun- 
tries to discrete world-system positions (i.e., countries with similar patterns in 
their network relations occupy similar positions in the world-system). 

Theoretically, networks that obey a core/periphery structure feature a set of core 
actors who enjoy ties with all other actors, along with a set of peripheral actors 
who only share ties with the core and are isolated from one another. In the present 
study, I use the continuous coreness procedure available in the UCINET 6 soft- 
ware package (Borgatti, Everett and Freeman 2002) to calculate a state’s location 
along the core/periphery hierarchy in the international trade network. In contrast 
to previous approaches, however, the procedure does not categorically assign states 
to core and peripheral groups so much as it estimates each actor’s distance from the 
core. Moreover, the procedure considers the centrality of those with whom each 
actor shares a tie. For example, ceteris paribus, a country whose trade partners are 
central would be considered more core-like than a country whose trade partners 
are peripheral. Continuous coreness scores range from 0 to 1, with larger values 
indicating greater coreness (or network integration), while smaller values indicate 
greater peripherality (or network isolation). 

Because the world trade network is asymmetrical (i.e., the volume of trade from 
country, to country, may not equal the volume of trade from country, to country,), 
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two types of “coreness” become possible: out-coreness (based on extensive export 
relations) and in-coreness (based on extensive import relations). Unfortunately, 
the continuous coreness procedure only considers a state’s level of out-coreness, 
such that all ties are attributed to country, (the country corresponding to the row 
of the cell with the tie), while the identity of country, (the country correspond- 
ing to the column of the cell with the tie) is ignored (see also Boyd, Fitzgerald, 
Mahutga and Smith 2006). Consequently, when I interchange the rows and col- 
umns of the trade network (i.e., taking the transpose of the matrix), the coreness 
scores based on a country’s level of in-coreness are not perfectly correlated with the 
coreness scores based on a country’s level of out-coreness in either the initial 1980- 
1990 period (r = .976) or the latter 1990-2000 period (7 = .977). Rather than 
relying on only one type of coreness to calculate a state’s world-system position, I 
incorporate both, using each state’s average coreness score derived from the origi- 
nal (out-coreness) and transposed (in-coreness) matrices for both time periods. I 
standardized both versions of coreness scores (mean = 0; standard deviation = 1) 
prior to calculating the average for each time period. The lagged value of world- 
system position refers to each state’s average coreness score for the 1980-1990 
period, while world-system mobility is calculated as the change score ce —T,) in 
world-system position between the 1980-1990 period and the 1990-2000 period. 
Appendix A lists the 110 states in my sample and reports each state’s world-system 
position for the 1980-1990 (WSP,) and 1990-2000 (WSP.) periods, as well as 
each state’s world-system mobility score for the 1980-2000 period (MOB). 


Capital and Labor Inputs 

I estimate the effect of world-system position and mobility net of capital and labor 
inputs. Gross Capital Formation. | measure physical capital using the ratio of gross 
capital formation over GDP. Gross capital formation considers additions to the 
fixed assets of the economy, including land improvements (e.g., fences, ditches, 
drains), plant, machinery and equipment purchases, as well as the construction of 
roads, railways, schools, offices, hospitals, private residential dwellings, and com- 
mercial and industrial buildings. Secondary School Enrollment. | measure human 
capital with each state’s secondary school enrollment, which refers to the propor- 
tion of people in the age group that officially corresponds to the secondary level 
who are currently enrolled. Labor Force Participation. | measure each state’s crude 
labor force participation as total labor force divided by total population. Total 
labor force refers to a country’s “economically active” population, which consists 
of all people who supply labor for the production of goods and services. 


Additional Controls 

Following the main analyses, I present results from supplemental analyses that 
include the following additional controls. East Asia. East Asia is a dummy vari- 
able indicating whether or not a country is located in this region. The following 
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15 East Asian states are included in my final sample of 110: Bangladesh, China, 
India, Indonesia, Japan, Laos, Malaysia, Mongolia, Nepal, Papua New Guinea, 
Philippines, Singapore, South Korea, Sri Lanka and Thailand. Semiperiphery. 
Semiperiphery refers to the set of countries customarily located at an interme- 
diate position in the capitalist world economy, as indicated by the traditional 
network measure of world-system position, constructed primarily from Snyder 
and Kick’s (1979) original classification, with revisions from Bollen (1983) 
and additions from Bollen and Appold (1993). The following 23 semiperiph- 
eral states are included in my final sample of 110: Argentina, Bulgaria, Cyprus, 
Finland, Hungary, India, Iran, Ireland, Israel, Jordan, Kenya, Malaysia, Pakistan, 
Peru, Philippines, Portugal, Singapore, South Korea, Spain, Sri Lanka, Turkey, 
Uruguay and Venezuela. Trade Openness (log). Trade openness is calculated as the 
sum of exports and imports of goods and services, measured as a share of GDP. 
Industrialization. Industrialization refers to the amount of value added to industry, 
measured as a share of GDP. Industry corresponds to International Standard 
Industrial Classification divisions 10-45 and includes manufacturing (ISIC divi- 
‘sions 15-37), as well as mining, construction, electricity, water and gas. 


Analyses 


I hypothesize that both world-system position and mobility positively affect eco- 
nomic growth. Thus, difference models that include lagged values represent an 
ideal estimation strategy for the current study. In difference models, the dependent 
variable and all predictors are calculated as change scores (T, — T,), while the 
lagged values represent each variable’s initial score (T,). In addition to being able 
to estimate the effects of world-system position and mobility simultaneously, this 
approach offers several other advantages. First, difference models help address 
potential heterogeneity bias (the confounding effect of unmeasured time-invariant 
variables), which can seriously affect coefficient estimates. Second, including the 
lagged values for the dependent and independent variables isolates any impact that 
initial values may have on growth, thereby preventing ceiling/floor effects from 
distorting the results (Finkel 1995).° 

I test the effects of world-system position and mobility against a neoclassical 
growth model, in which output (Y) is a function of capital (K) and labor (Z) 
inputs (Barro and Sala-i-Martin 1995). I consider both physical capital (gross 
capital formation) and human capital (secondary school enrollment), along with 
labor force participation. For all three measures, I include both their change scores 


(T, — T,) and their lagged values (T,) in the model. 
Y= FKL) (1) 


Technology (A) is sometimes included as an additional input (Collins and 
Bosworth 1996). Because economists assume that international trade exposes 
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countries to technological innovation, they sometimes estimate a state’s techno- 
logical progress using trade measures (Edwards 1992; Harrison 1996). Thus, when 
I add the world-system terms to the model, I introduce a trade input that could 
be said to represent a state’s level of exposure to the global stock of technology. 


Ver iA) (2) 


The main analyses consist of four models that estimate the isolated effects of 
each predictor on the rate of economic growth (models 1-4, Table 2), a base model 
that estimates the effects of the capital and labor inputs simultaneously (Model 
5, Table 2), and a final model that adds the world-system terms (Model 6, Table 
2). The following equation shows the final model, with the range of years for each 
time period indicating an average value. 


GDP PC (PPP) UT, [1995 — 2000] al; [1980 — a) 3 

B, + 8, GDP PC (PPP) (Lagged Value) (T1560 1985)) + 

B, Gross Capital Formation (Growth) (T, 51595 3000 ~ 11 11980 -1985)) + 

8, Gross Capital Formation (Lagged Value) (T), 559 _ 1985)) + 

B, Secondary School Enrollment (Growth) (T), 51595-2000) ~ Fs {1980 ~1985)) * 


B, Secondary School Enrollment (Lagged Value) (T1559 1985)) * 

B. Labor Force Participation (Growth) (T, Assad i aoe nee 
B. Labor Force Participation (Lagged Value) (Ty j,559_ 1935) + 

B, World-System Position (Mobility) (T, 55-2000) ~ 1 1980 - 1990) * 
B, World-System Position (Lagged Value) (T1559 990) * € 


In the main analyses (models 1-6, Table 2), I restrict the models to those states 
for which observations are available for all variables, thereby producing a standard- 
ized sample of 110 states that, together, constitute more than 85 percent of the 
world’s population. In the supplemental analyses (models 7-10, Table 3), I include 
additional controls (East Asia, semiperiphery, trade openness and industrializa- 
tion), producing a floating sample of 96-109 states. 

I formally identified several outliers in the main and supplemental analyses (mod- 
els 1-10, tables 2 and 3) using the Hadi procedure available in Stata 10.0 (Stata 
Corporation 2007). The Hadi procedure identifies multiple outliers in multivariate 
data (I use the p < .05 significance level for my outlier cutoff). In separate analyses, 
I replicated every model from the main and supplemental analyses by excluding the 
identified outliers.’ World-system mobility continues to feature the largest +ratio 
across all replicated models, exerting the strongest positive effect among all predic- 
tors in four of the six models in which it appears (models 4, 6, 7 and 8), and the 
second strongest effect in the remaining two models (models 9 and 10). 

I also investigated potential collinearity by producing variance inflation factor 
scores for every model in the main and supplemental analyses. ‘The maximum VIF 


1134 © Social Forces 88(3) 


> 


scores range from 1.51 to 6.62, while the mean VIF scores range from 1.33 to 
3.08. These scores indicate that collinearity may be somewhat present in several 
models. It is important to note, however, that in none of the models do the highest 
VIF scores ever exceed 10, suggesting that collinearity is never severe (Chatterjee, 
Hadi and Price 2000). Nevertheless, in separate analyses, I replicated every model 
without the lagged values, which are primarily responsible for the collinearity 
(see the correlation matrices in appendices B and C). When the lagged values are 
excluded, collinearity is no longer a concern (the maximum VIF scores range from 
1.00 to 1.88, while the mean VIF scores range from 1.00 to 1.39). More impor- 
tantly, world-system mobility remains a positive predictor of economic growth 
at the highest level of significance, carrying the largest tratio and standardized 
coefficient in all six models in which it appears. 

Finally, preliminary diagnostics detected heteroskedasticity in several of the 
models presented below. Therefore, I estimate models with robust standard er- 
rors obtained from Huber and White’s “sandwich” HCCME (heteroskedasticity- 
consistent covariance matrix estimator). In separate analyses, I re-ran all models 

"using regular standard errors and found that world-system mobility remains a 
positive predictor of economic growth at the highest level of significance, carrying 
the largest tratio across all models in which it appears. 

I use the Bayesian Information Criterion Prime to indicate improved fitness 
across models. BIC’ suggests that the more parsimonious model is preferable 
unless fit is significantly enhanced. Smaller BIC’ values indicate better fit. Raftery 
(1995) proposes that a BIC’ decrease of 2-6 across two models suggests “positive” 
evidence of model improvement, while a BIC’ decrease of 6-10 across two models 


suggests “strong” evidence, and a BIC’ decrease of more than 10 suggests “very 
strong” evidence. 


Results 


World-System Mobility, 1980-2000 


Figures 2a and 2b show scatter plots of world-system position by world-system 
mobility during the 1980-2000 period for the 110 states in my sample. Figure 
2a highlights the upward mobility of East Asia, whose states are indicated by plus 
signs with squares. Thirteen of the 15 East Asian countries in my sample (87%) 
experienced upward mobility during the 1980-2000 period (Japan and Papua 
New Guinea are the exceptions). Overall, these 15 East Asian states achieved 
a higher average mobility score (.096) than the remaining 95 states (.002). In 
separate analyses, when I regress world-system mobility on East Asian status, I 
find that East Asia has experienced significant upward mobility relative to the rest 
of the world-system (6 = .095, p = .000). | 

Previous network studies of world trade have likewise revealed East Asia’s as- 


cendance. Kim and Shin (2002:457) find that, from 1959 to 1996, more than 
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a third of newly created trade links involved Asia, and 6 of the top 10 countries 
with trade ties that increased the most during this period were in Asia. Similarly, 
Mahutga (2006) finds that two of the three countries experiencing significant 
upward rank mobility in his trade network during the 1965-2000 period were 
in East Asia (Singapore and South Korea). Moreover, six of the seven East Asian 
states in his dataset experienced upward rank mobility, and all six were ranked in 
the top 20 by the year 2000 (Mahutga 2006). In addition, since the 1960s, East 
Asia has economically outgrown the rest of the world (Collins and Bosworth 1996; 
Firebaugh 2003; Firebaugh and Goesling 2004). According to Milanovic (2005), 
five of the seven countries experiencing upward income mobility during the 1960- 
2000 period were East Asian. Consequently, in supplemental analyses, I include 
a regional dummy to investigate whether East Asia is driving any relationship 
between world-system mobility and economic growth. 

Figure 2b reproduces the same scatter plot, but highlights the upward mobility 
of the semiperiphery, as designated by customary world-system position. Twenty 
of the 23 semiperipheral countries in my sample (87%) experienced upward mo- 
bility (Iran, Kenya and Venezuela are the exceptions). Overall, the 23 states from 
the semiperiphery achieved a higher average mobility score (.066) than the 15 
states from the core (.004) and the 67 states from the periphery (.003) (five states 
lack a world-system classification). In fact, 19 of the 23 states from the semipe- 
riphery experienced higher mobility than all 15 core nations. In separate analyses, 
when I regress world-system mobility on semiperipheral and peripheral status, I 
find that the semiperiphery has experienced significant upward mobility relative to 
the core (b = .062, p = .000), but that the periphery has not (6 = -.001, p = .900). 

This is consistent with Kim and Shin’s (2002) finding of the “middle strata” 
driving network convergence in world trade over the past several decades. Kim 
and Shin (2002) suggest that semiperipheral states may have benefited during the 
most recent B-phase (i.e., economic downturn) of the Kondratieff wave starting 
in the early 1970s when production activities began relocating to intermediate 
zones of the world-system. In fact, they find that the 1975-1996 period repre- 
sents the middle strata’s greatest upward surge, which roughly corresponds with 
this study’s 1980-2000 sample period in which the semiperiphery experiences 
significant upward mobility. Thus, in the supplemental analyses, I also control for 
semiperipheral status in order to determine whether this intermediate zone of the 
world- system could also be driving the effects of mobility. 

Both plots include a fitted line from the linear regression of world-system 
mobility on initial world-system position, which is positively sloped and statisti- 
cally significant (b = .045, p = 004). Thus, although the world trade network has 
converged substantially during the post-WWII era, the 1980-2000 period marks 
a dramatic reversal in which the network actually experienced significant diver- 
gence. This “convergence halt” is consistent with Mahutga (2006:1880-81), who 
finds that most of the trade mobility in his network occurred prior to 1980. Two 
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Figure 2a. World-System Mobility (All Trade Commodities) 
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Notes: Plot includes a fitted line from the linear regression of world-system mobility on 
world-system position. EH indicate states located in East Asia. 


factors appear to be driving this trend. First, focusing on the cluster of non-core 
states in the top right of figures 2a and 2b (where most of the semiperiphery and 
much of East Asia can be found), we notice that the continued upward mobility of 
these non-core states during the 1980-2000 period is now beginning to produce 
divergence in world trade. Second, while the semiperiphery has been upwardly 
mobile and the core has been relatively stagnant, the majority of downwardly 
mobile states come from the world-system’s traditional periphery. Taken together, 
these two factors, (1. upward mobility by non-core states in the top tier, and (2. 
downward mobility by non-core states in the lower tier, has halted the conver- 


gence revealed by other studies investigating broader historical periods (Kim and 
Shin 2002; Mahutga 2006). 


World-System Mobility and Economic Growth 


Table 2 shows results from the OLS regression of economic growth rate on the 
independent variables among 110 states during the 1980-2000 period. The analy- 
ses feature difference models with lagged values. Each cell reports the tratio with 
the standardized coefficient in bold. Models 1-4 estimate each of the predictors 
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Figure 2b. World-System Mobility (All Trade Commodities) 
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N = 110 

Notes: Plot includes a fitted line from the linear regression of world-system mobility 

on world-system position. } indicate states customarily located in the core, while * 
indicate states customarily located in the semiperiphery (the remaining states belong to 
the periphery). 


separately, Model 5 estimates the capital and labor inputs simultaneously, and 
Model 6 adds the world-system terms. In models 1-3, the capital and labor inputs 
are positive, significant predictors of economic growth. In fact, both the differ- 
ence and lagged values for all three variables reach the highest level of significance 
except for the lagged value of labor force participation (p < .05). In Model 2, the 
human capital measures are able to produce conditional convergence in economic 
growth (and in all subsequent models where they are included), as indicated by 
the significant, negative effect of the lagged GDP PC value (B = -.724, p < .001), 
thereby replicating past studies (Barro 1991, 2001). Moreover, the human capi- 
tal measures jointly explain more than 30 percent of the variation in economic 
growth (R? = .308) and provide the best fit among the capital and labor models 
(BIC’ = -26.334), 

Model 4 reveals that the world-system measures both reach the highest level of 
significance as positive predictors, jointly explaining more than 50 percent of the 
variation in economic growth (R? = .504) and achieving a BIC’ score of -63.087, 
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easily surpassing all previous models. Also, note that the standardized coefficient 
for world-system mobility (B = .581) is substantially larger than that of world- 
system position (B = .378), indicating that mobility exerts a stronger effect on 
growth than initial position. Moreover, mobility features a much larger ¢-ratio 
(7.628) than initial position (3.386), indicating a greater likelihood that mobility 
is associated with economic growth than initial position. 

Model 5 estimates the capital and labor inputs simultaneously. All three differ- 
ence measures remain significant. Growth in gross capital formation features the 
strongest effect (B = .536, p < .001), followed by secondary school enrollment (B = 
352, p < .001) and labor force participation (B = .245, p < .05). Among the lagged 
measures, labor force participation drops out of significance, while secondary school 
enrollment exerts the strongest effect among all predictors in Model 5 (B = .597, p 
< .001). Overall, the capital and labor inputs explain almost half of the variation in 
economic growth (R? = .485), which represents less explanatory power than that 
generated by the world-system measures by themselves in the previous model. 

When adding the world-system measures in Model 6, almost two-thirds of the 

’ variation in economic growth is explained (R* = .633), and there is “very strong” 
evidence that fit is significantly enhanced (BIC’ = -68.012) relative to Model 5” 
(BIC’ = -40.043). World-system mobility carries the highest ratio (5.965) in 
the model and is the only predictor that remains significant at the highest level. 
Thus, world-system mobility is more likely associated with economic growth than 
any other predictor in Model 6. Moreover, when controlling for the capital and 
labor inputs, mobility exerts an effect on growth that is almost twice as large (B 
= 410) as initial position (B = .228). In fact, the effect of mobility is larger than 
all other difference measures and is second only to the lagged value of secondary 
school enrollment (B = .426). 

In sum, the results from Table 2 suggest that world-system mobility is strongly 
associated with economic growth, displaying effects that are larger than growth 
in capital investment, secondary school enrollment and labor force participation. 
And while world-system position is also a significant predictor of economic growth 
in the above models, being upwardly mobile in the world-system appears to exert 
more powerful effects than a state’s initial position. 

Finally, I performed a set of supplementary analyses that investigate whether 
the inclusion of additional controls influences the main results. I present these 
results in Table 3, which reports models that control for East Asia, the semipe- 
riphery, trade openness and industrialization. I control for East Asia and the 
semiperiphery to investigate whether upward mobility by either group of states is 
driving the results. I control for trade openness to investigate whether trade flows 
(rather than network relations in trade) are driving the results. And I control for 
industrialization to account for the “Firebaugh thesis” regarding the economic im- 
pact of industrial growth among developing countries (Firebaugh 2003; Firebaugh 
and Goesling 2004). In my sample, all nations customarily located in the core 
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experienced negative industrial growth. 

Across models 7-10, world-system mobility is the only predictor that reaches 
the highest level of significance in every model. Mobility carries the largest ¢-ratio 
among all predictors across every model, with a coefficient-to-standard error ratio 
that ranges from 4.872 (Model 9) to 6.381 (Model 8). Moreover, mobility con- 
tinues to exert the second largest effect among all predictors across all four models 
(the lagged value of secondary school enrollment remains the strongest predictor 
in each model), and the magnitude of mobility’s effect remains substantially larger 
than that of initial world-system position in every model specification. 

Consistent with the “Firebaugh thesis,” industrial growth positively affects 
economic growth in models 9 and 10. Thus, above and beyond the impact of 
world-system mobility, the spread of industrialization to the developing world 
may also have contributed to the inequality plateau in the cross-national distribu- 
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tion of income. However, the effect of world-system mobility is considerably larger 
than industrial growth in both models 9 and 10. None of the other additional 
controls affect the strong performance of mobility. The development miracles 
in East Asia and the upward mobility of the semiperiphery do not appear to be 
driving the growth effects of mobility. Trade openness is also unable to account 
for the relationship between world-system mobility and economic growth. In 
fact, growth in trade openness consistently exerts a weak negative effect that even 
reaches marginal significance (p < .10) in Model 10. 


Discussion 


World-system mobility positively affects economic growth in my sample of 110 
states during the 1980-2000 period. Upward mobility in the world trade network 
is significantly associated with growth, net of capital and labor inputs, trade and 
industrial characteristics, and the rise of East Asia and the semiperiphery. The 
likelihood of association between mobility and economic growth is greater than 
that achieved by all other predictors in every model featured above. And the 
magnitude of mobility’s effect is the second largest among all predictors (next to 
the lagged value of human capital), exceeding that of initial world-system posi- 
tion by wide margins across every model. Moreover, the strong, positive effect of 
world-system mobility on growth is robust to alternative specifications (e.g., the 
deletion of outliers and the removal of lagged values). 

Revisiting the theoretical predictions set forth in this study, world-system posi- 
tion and mobility may be able to account for recent trends in the distribution of 
world income over the past several decades. On the one hand, the results show 
that a state’s initial world-system position is an important predictor of economic 
growth, with advanced, core-like states developing more quickly than isolated 
economies occupying the periphery of the world trade network. This finding lends 
support to theoretical predictions generated by world-system network scholars 
and is consistent with past studies (Kick and Davis 2001; Nemeth and Smith 
1985; Nolan 1983; Snyder and Kick 1979). Moreover, it may help account for the 
persistence of global inequality, and perhaps polarization in the world economy. 

On the other hand, upward mobility in the world-system has been an impor- 
tant route for achieving rapid economic growth by developing nations, producing 
effects that are much larger than a state’s initial position. This may help explain 
why world income inequality has leveled off in recent decades, and why a select 
group of developing economies has begun to catch up with advanced nations. 
Given the relatively short period of time under investigation, the amount of up- 
ward mobility by East Asian and semiperipheral economies into more core-like 
positions in world trade suggests that the core/periphery hierarchy in the world 
economy may not be as stable as some theorists commonly presume. More im- 
portantly, the widespread impact of mobility across the developing world suggests 
that stagnation is not an inevitable trajectory for developing states traditionally 
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located in non-core zones of the world-system. 

Nevertheless, one potential reason for why world income inequality has only 
leveled off, and not declined significantly, is the downward mobility experienced 
by many peripheral nations in the world trade network. When coupled with the 
continued upward mobility by almost every developing state from East Asia and 
the semiperiphery, the result has been a “convergence halt” in world trade. In fact, 
the ascendance of this “middle tier” has been so extensive that its continued up- 
ward mobility into the upper echelon of world trade began producing divergence 
during the 1980-2000 period, marking a dramatic reversal from the post-WWII 
era of trade convergence. The middle tier’s ascendance into more core-like posi- 
tions in world trade implies that traditionally core states no longer monopolize the 
exploitive mechanism that produces economic growth. However, the middle tier's 
upward mobility also suggests that the world economy contains additional exploit- 
ers in core-like positions. Consequently, if the trade ascendance and economic 
growth of East Asia and the semiperiphery continues indefinitely, their divergence 
away from exploited states in the periphery may begin to outweigh their conver- 
gence towards other exploiter states in the advanced core, thereby increasing (rather 
than decreasing) world income inequality. For now, it appears as if world income | 
inequality has stopped growing in recent decades, cresting along a fragile plateau. 

[interpret the positive effects of world-system position and mobility as broadly 
consistent with a range of perspectives, including those found in economics and 
the world-system literature. First, the results largely support a long-held proposi- 
tion by economists regarding the relationship between trade and growth. However, 
while this relationship has typically been found by using trade flows and policy 
indicators, a network-based measure of trade integration bolsters confidence 
in theoretical arguments linking economic exchange to increased productivity. 
Moreover, a state's position in the world trade network is a more theoretically 
satisfying measure than flow or policy indicators because network dynamics bet- 
ter capture the mechanisms that economists suggest are operative in producing 
economic growth. Second, the findings not only reveal the growth effects of trade 
integration, but they also highlight the importance of occupying a core position 
in the world economy. Indeed, while upward mobility by developing economies 
tends to erase traditional core/ periphery boundaries, the growth effects stemming 
from mobility also reaffirm the importance of a state’s position and movement 
along this hierarchy. In short, the presence and impact of upward mobility does 
not imply that core/periphery relations have vanished. Rather, upwardly mobile 
states have simply increased the extent to which they exploit others. In this way, 


the stratifying effects of core/periphery relations remain an active component of 
the world economy. 


Additional Diagnostics 


While I find that world-system mobility positively affects economic growth, it 
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is possible that the relationship suffers from endogeneity (i.e., reverse causal- 
ity). Theoretically, however, there is strong reason to suspect that the relationship 
between the two measures runs in the hypothesized direction. There is, by now, 
a large body of research produced by world-system and dependency scholars dem- 
onstrating that a state's position in the capitalist world economy affects economic 
performance. Similarly, economists have long suggested that trade integration 
produces economic growth. In my sample of 110 states, the correlation between 
(1. the initial value of world-system position and GDP PC growth (r = .420) is 
substantially higher than the correlation between (2. the initial value of GDP 
PC and world-system mobility (7 = .185) (see Appendix B). This suggests that 
structural position in the world- system is causally prior to economic development. 

Moreover, in separate analyses, I corrected for potential endogeneity via in- 
strumental variables regression using two-stage least squares estimation with my 
difference measures. In the first stage, the suspected endogenous measure (i.e., 
world-system mobility) is regressed on the exogenous predictors (i.e., capital and 
labor inputs) and my selected instruments (i.e., trade openness and industri- 
alization). In the second stage, the dependent variable (ie., economic growth 
rate) is regressed on the exogenous and endogenous predictors except that the 
endogenous measure (i.e., world-system mobility) is replaced with the predicted 
values from the first equation. Instrumental variables must be correlated with the 
endogenous measure (i.e., world-system mobility), but uncorrelated with the error 
term from the second-stage equation. I use two instruments for my 2SLS model: 
trade openness and industrialization. Theoretically, I suspect that countries with 
increased trade flows have also become more integrated in the world trade network, 
and countries that are industrializing are also upwardly mobile. 

Diagnostics reveal that the instruments are both strong (i.e., correlated with 
world-system mobility) and valid (i-e., uncorrelated with the error term). The first- 
stage results show that trade openness (p = .000) and industrialization (p = .068) 
are both positively related to world-system mobility. The F-statistic (15.823) is well 
above the customary threshold of 10 (see Stock, Wright and Yogo 2002) and is 
highly significant (p = .000), indicating that the two instruments have significant 
explanatory power after controlling for the other covariates in the first-stage model 
(R° = .365; partial R’ = .206). In addition, we cannot reject the joint hypothesis 
that the instruments are valid (i.e., uncorrelated with the error term) and that the 
equation is specified correctly (i.e., the instruments should not be included in the 
second-stage equation as exogenous predictors of economic growth) (Hansen's 

J-statistic = .822, p = .365). Most importantly, the second-stage results show that 
world-system mobility is a significant, positive predictor of economic growth even 
when using the predicted values from the first-stage model (p = .002).* Finally, sev- 
eral endogeneity tests are available in the Stata 10.0 software package (Wooldridge’s 
robust score, a regression-based score and the Sargan-Hansen difference score) that 
determine whether world-system mobility can be treated as exogenous (the null 
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hypothesis) or whether 2SLS estimation is required. The tests uniformly reveal that 
we cannot reject the null hypothesis (p = .683, p = .695 and p = .711, respectively), 


suggesting that we can rely on the OLS estimates. 


Conclusion 


Overall, the predictive power that world-system position and mobility share is 
especially interesting in light of the fact that these two measures are not highly cor- 
related with one another (7 = .271) (see Appendix B). They each capture different 
features of the world economy (dominance and ascendance), but they share util- 
ity in predicting development, and these effects occur simultaneously. Moreover, 
this lack of correlation between world-system position and mobility suggests 
that, while few countries can claim it, those that are both core-like and upwardly 
mobile in world trade have been in a rare position to achieve rapid development 
over the past quarter century. This refers, principally, to several economies in 
East Asia, such as China, South Korea, Singapore, India, Thailand, Malaysia and 

‘ Indonesia. Not only did these economies experience substantial upward mobil- 
ity, but all seven were ranked in the top 25 in world-system position during the - 
1990s (see Figure 2a). East Asia is not only more advanced than other developing 
regions, but its states are also upwardly mobile relative to traditional core states 
in the world economy. 

Perhaps the most remarkable aspect of East Asia’s ascent has been its lack of 
regional integration due to distrust among its major powers (Rozman 2004). In 
contrast to the formal multilateralism that typifies Europe, East Asian regionalism 
is characterized more by a series of bilateral relations among its states. Regional 
organizations tend to be weaker and more informal within East Asia, as state 
sovereignty is more highly guarded (Katzenstein 2005). Indeed, a free trade agree- 
ment in East Asia is perhaps the largest remaining organizational gap in an increas- 
ingly integrated world economy. However, the creation of APEC (Asian Pacific 
Economic Cooperation) in 1989 and the formation of ASEAN (Association of 
Southeast Asian Nations) + 3 in 1996 suggest formative steps towards finally 
bringing China, Japan and South Korea together in an organizational framework 
that can more fully produce regional integration. Ultimately, despite East Asia’s in- 
choate regionalism, its economies have experienced significant upward mobility in 
world trade, assisted notably by its dense inter-regional ties (Kim and Shin 2002), 
depending, in particular, on relations with the United States (Katzenstein 2005). 

In contrast to East Asia, the semiperiphery has not yet established itself in the 
top tier of the world trade network. If we ignore semiperipheral nations from 
East Asia, only three semiperipheral nations were ranked in the top 25 during the 
1990s (Spain, Turkey and Finland). This suggests that the non-East Asia semipe- 
riphery has achieved economic growth primarily from the zone’s network mobility 
rather than its network position. However, almost every semiperipheral state was 
ranked in the top half of the network in the 1990s (see Figure 2b), suggesting 
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that this traditional world-system zone is poised to penetrate the top tier of the 
network in greater numbers in the near future. These non-East Asian states from 
the semiperiphery are primarily located along the perimeter of Europe (Bulgaria, 
Cyprus, Finland, Hungary, Ireland, Portugal and Spain), the Middle East (Iran, 
Israel, Jordan, Pakistan and Turkey), and Latin America (Argentina, Peru, Uruguay 
and Venezuela). Each of these three regions is located close to a different core hub 
in the world economy (Western Europe, East Asia and North America, respec- 
tively). Thus, following East Asia's blueprint for ascendance to the top tier of the 
world trade network, the semiperiphery could gain greater trade prominence by 
establishing deeper inter-regional ties with these core hubs. 

Ultimately, the results from this study demonstrate that the effect of world- 
system mobility persists outside of East Asia and the semiperiphery, suggesting 
that nation-states in a// developing regions should consider the merits of pursuing 
global economic integration. To be sure, this does not imply deepening economic 
dependency on core trading partners via increased trade flows. As the models in 
Table 3 reveal, trade openness does not produce economic growth net of the other 
predictors, and it may even contribute slightly to underdevelopment. Rather, the 
findings suggest that developing nations should establish trade relations with 
a greater number of partners in order to enhance their world-system position. 
In short, it is important to distinguish between the harmful effects of resource 
dependency (i.e., volume of trade relative to GDP) and partner dependency (i.e., 
network location along the core/periphery hierarchy) (Clark 2008). 

Interestingly, then, the findings from this study are consistent with world- 
system/dependency scholars who claim that core/periphery relations are exploitive, 
but inconsistent with those who claim that developing states should therefore avoid 
economic relations with the core. Establishing ties with a greater number of trade 
partners actually reduces dependency and indicates upward mobility along the 
core/periphery hierarchy. In particular, trade agreements with powerful, core states 
expose developing economies to spillovers from the most advanced markets. At 
the same time, developing states should also embrace regional trade agreements 
with other developing partners, as they increase the overall diversity of knowledge 
capital to which states have access. Furthermore, establishing trade links with a 
number of other developing nations in the periphery can be seen as an attempt 
to generate upward mobility and occupy a more core-like position in the world 
trade network. Thus, developing states may wish to pursue trade agreements with 
both core and peripheral partners simultaneously in order to maximize their gains 
from connectivity. 


Notes 


1. Ofthe 55 SITC (Standard International Trade Classifications) two-digit commodity 
types, Smith and White (1992) collect data on 15, while Mahutga (2006) collects 
data on 12. 
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2. See also Babones (2005), who detects substantial world-system mobility between 
1975 and 2002 using attribute (rather than network) data. Of the 103 countries in 
his sample, 30 countries experienced movement across zonal boundaries, with 17 of 
these countries experiencing “permanent” mobility. 


3. Mahutga (2006) also reports block assignments for the 1990-2000 period. However, 
they are based on four-block partitions, instead of five, thereby preventing an 
assessment of block mobility across the 1965-2000 period. 


4. -convergence refers to a negative relationship between initial level and subsequent 
change. 


5. GDP PC (PPP), world-system position, and trade openness are logged prior to 
differencing in order to reduce skew. Thus, the resulting difference scores for these 
three measures should be interpreted as growth rates. 


6. See Firebaugh and Beck (1994), who similarly use difference models that include 
lagged values. 


7. excluded the following outliers in the following replicated models: Albania (Model 
: 10), Angola (Model 9), Bulgaria (Model 10), China (models 1, 3, 4 and 10), Congo 
(R) (models 5, 9 and 10), Mongolia (models 1, 5, 6, 7, 8, 9 and 10), Oman (models _ 
5 and 6), and United Arab Emirates (models 5, 6, 7 and 8). 


8. World-system mobility remains a significant, positive predictor of economic growth 
when I conduct instrumental variables regression via limited-information maximum 
likelihood (p = .003) or generalized method of moments (p = .002) estimation. 
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Globalization and Industrialization in 64 Developing 
Countries, 1980-2003 


Yunus Kaya, University of North Carolina, Wilmington 


This study investigates the effect of the latest wave of economic globalization on 
manufacturing employment in developing countries. It revisits the classic debate on 
the effect of internal and external influences on industrialization, and extends this de- 
bate to contemporary developing countries. In the process, it assesses the evidence for 
development/ productivity, world systems/dependency and globalization explanations, 
and uses a comprehensive dataset on 64 developing countries from 1980 through 
2003. The results generally show that manufacturing employment increased in most 
developing countries. First, this study finds that the level of economic development 
measured by gross domestic product per capita is the most important factor influ- 
encing the size of manufacturing employment. Second, economic globalization also 
influences manufacturing employment in developing countries, but mainly through 
trade. The size of exports and low-technology exports have a significant positive effect 
on manufacturing employment in developing countries. Finally, the analysis provides 
limited support for world systems/dependency theories. Raw materials exports do not 
significantly influence manufacturing employment while foreign direct investment 
has a negative impact in some models. This study concludes that the latest wave of 
economic globalization contributed to the expansion of manufacturing employment 
in developing countries, although it is not the most significant factor shaping the size 
of manufacturing employment in these countries. 





This study explores the effect of the latest wave of economic globalization on 
manufacturing employment in developing countries. ‘The ongoing integration of 
national economies since the18" and 19" centuries has intensified in the past two 
or three decades (Held et al. 1999). Some scholars argue that economic exchanges 
between relatively independent parties have been replaced by complex and highly 
interdependent systems of industrial production and economic exchange orga- 
nized on a global scale (e.g., Dicken 2003; Gereffi 2005). In recent decades, de- 
veloped countries saw a significant portion of their consumption made possible 
by industrial production in developing countries, while developing countries 
became exporters of manufacturing products.’ Concurrently, there were substan- 
tial declines in manufacturing employment in developed countries and significant 
increases in developing countries. ‘The effect of the latest wave of globalization in 
this process is subject to heated debates both in the social sciences and the public 


An earlier version of this article was presented in the 2006 Annual Meeting of American Sociological 
Association in Montreal, Quebec. I am grateful to David Brady for constructive comments and sugges- 
tions on various drafts, and thank Gary Gereffi, Emilio Parrado, Suzanne Shanahan and Ken Spenner 
for suggestions. Direct correspondence to Yunus Kaya, Department of Sociology and Criminology, UNC 
Wilmington, 601 South College Road, Wilmington, N C 28403-5978. E-mail: kayay@uncw.edu. 


© The University of North Carolina Press Social Fortes 66(3) 1756-1182, Marchi 2070 


1154 ¢ Social Forces 88(3) < 


sphere. Some see the latest wave of globalization as a significant force on manu- 
facturing employment in developing countries (e.g., Wood 1994; Dicken 2003). 
However, most of the debate and empirical analyses are confined to developed 
countries, and comparatively little research has been done on developing countries 
(e.g., Alderson 1999; Brady and Denniston 2006). 

Industrialization has always been a central topic in sociology. Most of the founding 
theories of sociology were built on the industrialization that took place in Western 
Europe and the United States in the 19" and early 20" centuries. The spread of 
industrialization throughout the developing world has also been an important con- 
cern for scholars. The effect of external factors on the industrialization of developing 
countries has long been a contentious topic in these debates. Many scholars, ranging 
from early social theorists, convergence and modernization theorists of the 1950s 
and 1960s, and contemporary economists, perceived industrialization as a process 
associated with economic development and productivity within a country. They 
predicted that manufacturing employment will rise and decline across countries 

_as they pass through the stages of economic development. However, many others 
argued that economic development and industrialization in developing countries 
is different from the early-industrialized countries. Dependency and world-systems — 
theories offered an account of the world economy as a whole, in which the develop- 
ing economies are dependent on early-industrialized countries. 


Past Research 


A strong line of thinking in the social sciences perceives industrialization as a 
process associated with the level of economic development and productivity in 
a country where external factors play a lesser role. One theory holds that the 
relationship between development and productivity, and industrialization should 
be similar across countries. In contrast, world-systems and dependency theorists 
argue that there is an exploitative relationship between developing and early- 
industrialized countries, which limits the industrialization of the former. Recently, 
the rapid changes in the world economy led many to argue that economic global- 


ization became a powerful influence in expanding manufacturing employment in 
developing countries. 


Economic Development, Productivity and Manufacturing Employment 


Many scholars, explicitly or implicitly, predict that when conditions are met, the 
process of industrialization in Western countries will replicate itself in other coun- 
tries.’ Among earlier theorists, Marx saw a uniform path of change for every society, 
in which feudalism, capitalism and socialism emerge consecutively. Saint-Simon 
thought that all societies would transform into industrial societies (Kerr 1983). 
In the early 1940s, Clark (1957[1940]) proposed an influential theory of indus- 


trial employment ina country. He claimed that as an economy develops, the price of 
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agricultural products and the relative demand for them decrease, while the demand 

for manufacturing increases. As a result, manufacturing employment increases. In 

the later stages of economic development, however, the demand for manufacturing 
also decreases while the demand for services increases. This shifts employment from 

manufacturing to services. Thus, Clark projected a reversed U-shaped relationship 

between economic development and manufacturing employment. In the recent 
debate on deindustrialization in developed countries, most economists adopt Clark’s 

view, but add productivity to the story. Krugman and Lawrence (1994) argue that 
spending on manufacturing goods and their prices decline due to higher productiv- 
ity in manufacturing. Rowthorn and Wells (1987) and Rowthorn and Ramaswamy 
(1997) claim that deindustrialization emerges with increasing productivity and has 

little to do with foreign direct investment or trade.’ 

The question of industrialization in countries other than the early-industri- 
alized countries came under greater scrutiny by both Western and non-Western 
intellectuals with the emergence of colonies as independent nation-states after the 
Second World War.‘ In this period, two relatively optimistic theories envisioned 
the realization of modernization and industrialization in developing countries. 
Modernization theorists argued that industrialization in developing countries is 
possible through the replication of the conditions that created modern economies 
in early-industrialized countries (Levy 1966). Inkeles and Smith (1974) predicted 
the emergence of “modern man” in developing countries as a building block for 
modern industrial societies. In similar fashion, convergence theorists predicted 
that institutional structures and labor relations similar to that of industrialized 
countries will emerge in developing countries (Kerr, Harbison, Dunlop and Myers 
1960; Moore and Feldman 1960; Moore 1965). 

In sum, both earlier and contemporary proponents of this account expect 
similar patterns of manufacturing employment across countries, in which the size 
of manufacturing employment can be predicted by the level of productivity and 
economic development. 


World Systems and Dependency 


In the 1960s and 1970s, dependency and world systems theories challenged mod- 
ernization theories, These theories advanced the position that there is a relationship 
of dependency and exploitation between developing and industrialized countries 
(Cardoso and Faletto 1979).° Wallerstein (1974, 1980 and 1989) argues that a 
European-centered world system was established in 1450-1640, which he calls the 
long 16% century, and was continued in subsequent cycles. This system is organized 
around two sets of countries: the core and the periphery. Powerful industrialized 
countries constitute the core, and the rest of the world constitutes the periphery.° 
Wallerstein proposes that a country’s mode of integration into the world economy 
determines development outcomes. Core countries do most of the industrial pro- 
duction and use peripheral countries as markets and the source for raw materials. 
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Chase-Dunn and Grimes (1995) explain that, in the post-colonial world, the 
maintenance of the world system is achieved through market mechanisms rather 
than the use of direct threat by military might as in earlier times. Dependency 
and world systems theorists perceive trade and foreign capital as the main market 
mechanisms. Foreign trade composition, in which a peripheral country exports raw 
materials and semi-processed goods and imports manufactured and/or capital goods, 
is perceived as an indicator of dependence on core countries. Therefore, dependency 
and world systems theorists perceive open trade as detrimental to the development 
of peripheral countries and favor protectionism and disengagement as necessary for 
industrialization (Cardoso and Faletto 1979). For example, Schwartzman (1995) 
argues that trade dependence hurt industrialization in Brazil and Mexico. 

In the1980s, FDI in developing countries became a focus of inquiry for world 
systems and dependency theorists (Firebaugh 1997). Bornschier, Chase-Dunn 
and Rubinson (1978) claim that FDI in peripheral countries creates investment 
dependence. They and others argue that this dependence on foreign capital im- 

_pedes development in the third world countries (Bornschier and Chase-Dunn 
1985; London and Smith 1988; Boswell and Dixon 1990).’ Despite providing a 
rigorous operationalization and test of world systems and dependency theories, E 
these scholars focused on economic growth and development, not industrializa- 
tion. However, if the dependency theorists are correct, perhaps it is reasonable to 
expect FDI in developing countries to be a negative influence on industrialization 
as well. For example, Cardoso and Faletto (1979) initially suggested that the de- 
pendent industrialization Latin America limited employment in manufacturing.* 
Bornschier, Chase-Dunn and Rubinson (1978) and Bornschier and Chase-Dunn 
(1985) argue that while FDI flows might enhance economic growth in developing 
countries in the short run, the long-term dependence on foreign capital, indicated 
by FDI stock, hurts economic growth. Although it is crucial to acknowledge that 
dependency scholars have mainly focused on development, perhaps an extension 
of their beliefs would conclude that long-term dependence on foreign capital im- 
pedes the development of domestic capabilities for industrialization and prevents 
a country from industrializing fully. 


Globalization and Manufacturing Employment 


The term globalization was not widely used in academic and public discussion 
until the early 1990s (Robertson 1992). By the end of the decade there were 
countless articles and books on globalization. Although it has many dimensions 
and the theories of globalization have long historical trajectories, the locus of glo- 
balization is the change in the world economy over the past two or three decades.° 

The global reorganization of manufacturing, which is referred to as the new 
international division of labor, is considered by some to be the defining charac- 
teristic of the latest wave of globalization (Frébel, Heinrichs and Kreye 1980; 
Castells 1996; Hoogvelt 1997). Scholars think that manufacturing is now or- 
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ganized through complex networks of firms, which are sometimes labeled as 
Global Commodity Chains or Global Value Chains (e.g., Gereffi 2005; Dicken 
2003). Within these networks, firms from developed economies relocate their 
manufacturing activities in developing countries by investing in or subcontracting, 
while they focus on higher value activities such as research and development, and 
marketing (Gerefhi 2001; Kaplinksy 2000). This, then, translates into increasing 
manufacturing exports and FDI, and expanding manufacturing employment in 
developing countries. 

Using Heckscher-Ohlin theory,'® Wood (1994) claims that manufacturing ex- 
ports increase the size and earnings of low-skilled workers in developing countries. 
He insists that the abundance of low-skilled labor in developing countries creates 
a comparative advantage, and argues that trade lessens the manufacturing autarky 
of developed countries in which they produce most of the products for domestic 
consumption (Wood 1995).!! Dodzin and Vamvakidis (1999) show that increas- 
ing levels of trade led to greater levels of industrialization, with the emergence of 
labor-intensive industries in predominantly agricultural economies from 1970 
to 1995. Similarly, Bollen and Appold (1993) demonstrate that manufacturing 
exports significantly increases the size manufacturing employment. 

Many now believe that the low production costs in developing countries at- 
tracted foreign investment in recent years, even more so than domestic markets 
and natural resources. A U.N. report (2000:2) introduces Bangladesh as “Besides 
representing a potential market in itself (and potential access to the much larger 
South Asian market), Bangladesh also offers considerable potential as a base for 
labour-intensive manufacturing.” Some use the concept export-oriented FDI to refer 
this phenomenon (see Zhang 2005). According to this idea, firms relocate their 
manufacturing facilities in places such as China and Bangladesh to take advantage 
of low costs and ship their products all over the world rather than selling in the 
domestic markets of these countries. To the extent that this new type of FDI exists, 
it may positively influence manufacturing employment in all developing countries. 

In recent years, some developing countries became outward international inves- 
tors, especially in other low-cost developing countries as the costs of production 
increased in their domestic economies (Gereffi 2005; Gerefhi and Memodovic 
2003).!2 The investments by Taiwanese and Korean firms in mainland China 
exemplify this trend (Zhang 2005). This FDI outflow may halt the increase in 
manufacturing employment and cause a decline. For example, Chen and Ku 
(2003) found that Taiwanese firms that invest abroad significantly decrease the 
size of their manufacturing labor force within Taiwan. Any recent declines in 
manufacturing employment in these countries might be due to such outward FDI. 

One of the main features of the latest wave of economic globalization has been 
the globalization and dispersion of manufacturing. This process has allowed firms 
to utilize low labor costs around the world, as well as looser tax and environmental 
regulations. This, in turn, has contributed to the industrialization of developing 
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countries and the deindustrialization of richer countries. To my knowledge, no study 
has fully scrutinized the various dimensions of globalization’s impact on manufactur- 
ing employment in developing countries during this most recent period. 


Data and Methods 


To explore the influence of globalization on manufacturing employment in de- 
veloping countries, this study analyzes an unbalanced panel of 64 developing 
countries from 1980 to 2003.'* The sample includes all developing countries 
that met three criteria: a population of half a million or higher, less than $5,000 
GDP per capita in 1980, and data on the labor distribution for at least two time 
points.'4 All countries fitting these criteria have been included in order to increase 
the representativeness of the sample.'* Ex-communist countries are excluded from 
the sample.'® The changes in manufacturing employment in these countries may 
be the result of the transition to capitalism because they experienced massive 
industrial restructuring and privatization, following the fall of communism (Nee 

° 1989; Stark1992). In Wallerstein’s terminology, my sample is composed of periph- 
eral and semi-peripheral countries. Data have been collected for years 1980, 1985, - 
1990, 1995, 2000 and 2003. Because of missing data for some countries, countries 
have between two and six observations. The unit of analysis is the country-year 
and the total number of observations is 234. 


The Dependent Variable 


Table 1 presents descriptive statistics and sources for all variables. The dependent 
variable is the manufacturing share of employment, calculated as the percentage 
of total employment in a country (World Bank 2007). The manufacturing share 
of employment is the standard indicator of industrialization in a country and is 
used commonly in similar studies (Alderson 1999, 1997; Brady and Denniston 
2006; Rowthorn and Wells 1987). 

Figure 1 shows the trend in manufacturing employment (the averages of coun- 
try percentages) in the countries in the sample between 1980 and 2003. The 
average manufacturing employment increased from 15 percent in 1980 to 22 
percent in 2003."” This suggests a significant surge of industrialization in develop- 
ing countries after 1980. By comparison, manufacturing employment declined to 
an average of 25 percent by 2001 in developed economies (Brady and Denniston 
2006). However, there is also great variation across countries. Manufacturing em- 
ployment was 15 percent in the 5® percentile countries in 2003, while it was 32.5 
percent in the 95" percentile. A close analysis of individual cases demonstrates 
that some countries are now experiencing a decline in manufacturing employ- 
ment after an earlier surge of industrialization. In South Korea, manufacturing 
employment peaked around 36 percent in the early 1990s and gradually decreased 
to 27 percent in 2003. Similarly, it rose to 33 percent in Malaysia in the late 
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Table 1 continued 
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1990s and started to decline gradually to 30 percent in 
2003. Interestingly, manufacturing employment exceeds 
service employment in only one of the 234 observa- 
tions (Mauritius in 1990). Unlike early-industrialized 
countries, manufacturing employment does not pre- 
cede or exceed service sector employment in developing 
countries, as has been documented by previous research 


(Browning and Roberts 1980; Koo 1990). 


Independent Variables 


Control Variables 

The first control variable is real GDP per capita (logged) 
measured with purchasing power parity (World Bank 
2007).'8 GDP is an indicator of economic development 
or the level of productivity in a country (Alderson 1999; 
Brady 2003). Clark (1957[1940]) and other scholars ar- 
gue that manufacturing employment follows an inverse-U - 
shape with economic development. As mentioned above, 
some countries experienced a downturn in manufacturing 
employment in recent years. Thus, to explore if economic 
development has a curvilinear effect on manufacturing 
employment in developing countries, I add the square 
of GDP per capita (logged) into the model. To reduce 
collinearity, I centered GDP per capita before squaring. 
According to the development/productivity account, 
these two variables should predict the level of manufac- 
turing employment in developing countries regardless of 
the level of integration into the global economy. 

This study also controls for demographic variables that 
may account for changes in the labor force composition. 
Urbanization is measured as the percent of the population 
living in urban areas; and population growth is measured as 
the annual percentage change in population from one year 
to the next (World Bank 2007).!° The level of urbaniza- 
tion in a country may have an impact on employment by 
creating relatively better educated “excess labor” in the 
urban areas, especially in the so-called global cities (Portes 
and Roberts 2005). Secondary school education, measured 
by the enrollment rate, controls for the effect of the skill 
level within a country.2? Education might have an impact 
on manufacturing employment as it provides the basic 


skills required for manufacturing jobs (Wood 1995).2! 
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Figure 1. Trends in Manufacturing Employment 
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Gross domestic investment as a percentage of GDP is also included in the models 
as a control. It is measured as gains in fixed assets plus the net change in inventory 
levels (World Bank 2007). Many scholars argue that domestic investment is more 
beneficial for the domestic economies because profits from domestic investments 
are more likely to be reinvested, so it is commonly used as a control in similar 
studies (Bornschier and Chase-Dunn 1985; Kentor 2001; Kentor and Boswell 
2003; Jorgenson 2006). 

Geographical regions are also added as controls in some models. ‘The regions 
are East Asia, South Asia, East Europe and Central Asia, Middle East and North 
Africa, Sub-Saharan Africa, and Latin American and Caribbean. Many have ar- 
gued that there are significant regional differences in industrialization, and recent 
export-oriented industrialization is mainly an East Asian phenomenon with little 
implication for other parts of the world (see Gereffi and Fonda 1992). To test for 
this, geographical regions are included in the models as dummy variables with 


East Asia as the reference region. 


Globalization Variables 
Economic globalization operates through the international flows of goods and ser- 
vices, and the stocks of capital. Globalization measures aim to capture both the size 
and the structure of trade and foreign direct investment in developing countries.” 
The first globalization variable is exports measured as a percentage of GDP 
(World Bank 2007). This variable is expected to have a positive effect on manu- 
facturing employment. Second, /ow-technology exports are measured using Lall’s 
(2000) classification,’ a percent of all exports. This variable is expected to have 
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a positive effect (Dicken 2003; Wood 1994). Third, raw material exports are 
calculated with Lall’s (2000) classification by combining primary products and 
resource-based products minus oil (both crude and refined) and natural gas ex- 
ports [(PP+RB)-(oil and gas exports)] 4 This variable is also a percent of all exports. 
Raw materials exports are seen as one of the main mechanisms of dependency and 
should have a negative influence on industrialization. Similarly, some scholars 
argue that dependence on raw material exports can impede industrialization, a 
phenomenon that they refer as the “resource curse.” (Auty 2000) 

The fourth globalization variable is inward FDI. FDI can be measured both 
as a stock and flow, and this study examines both. The FDI stock variable enters 
into the models as the percent of GDP. The FDI flow variable is calculated by 
taking the three-year average (t, t-1 and t-2), because of fluctuations in annual 
FDI flows, and dividing it by the current GDP in t. The globalization account 
expects that inward FDI will positively affect manufacturing employment (Zhang 
2005), although this account rarely differentiates between stock and flow. By 

_ contrast, dependency theorists argue that FDI flows may have a short-term posi- 
tive effect on economic growth in developing countries, while FDI stock should. 
have a long-term negative effect (Bornschier, Chase-Dunn and Rubinson 1978; 
Bornschier and Chase-Dunn 1985). Thus, FDI flows should boost manufacturing 
employment, while FDI stock should undermine it.”° 

In addition, outward FDI is included in the models. Many developing coun- 
tries are now outward foreign investors (Gerefh 2005; Chen and Ku 2003).” 
Outward FDI, which is also measured in terms of both stock and flow, should 
have a negative impact on manufacturing employment. 

Finally, some models include inward FDI in primary and secondary sectors, both 
stock and flow, as a percent of total FDI. The primary sector comprises agricultural 
and mining activities, while the secondary sector mainly comprises FDI in manu- 
facturing (UN 1992, 1994, 1996, 2000, 2003, 2004; OECD 2001; UNCTAD 
2008). ‘The sectoral composition of FDI, especially FDI in the secondary sector, 
may have a significant impact on manufacturing employment. For example, some 
recent studies showed a significant impact of FDI in the manufacturing sector 
on environmental outcomes in developing countries (Jorgenson 2007a, 2007); 
Jorgenson, Dick and Mahutga 2007). 

To show that there is not a collinearity problem among the independent vari- 
ables, the effects of the independent variables on each other are also estimated.28 
In these models, the fits indicated by the R-squared are modest. Thus, only small 
portions of the variance in the independent variables are accounted for by other 
independent variables, and the estimation of unique effects is not problematic.” 
Appendix 1 shows that most correlation coefficients among independent vari- 
ables are below .5, and only two are over .7. | 
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Models 


Ordinary least squares regression is not appropriate for the cross-sectional, unbal- 
anced panel data used in this analysis. Pooled cross-sections of different time points 
violate the assumption of zero correlation between error terms in ordinary least 
squares regression (Wooldridge 2003). Facing the possible existence of unmea- 
sured time-constant, cross-national heterogeneity in this type of data, researchers 
often use fixed-effects or random-effects models (Hsiao 2003; Wooldridge 2003). 

There has been a heated debate on the choice between random-effects and 
fixed-effects models. Halaby (2004) states that fixed-effects models more effec- 
tively control for unobserved unit effects, and strongly advocates for their use 
against possible bias and inconsistency due to unobserved unit effects. Fixed- 
effects models may also be more stringent against reverse endogeneity (Winship 
and Morgan 1999). However, a problem some researchers see in these models is 
that they do not allow the independent variables to explain between country varia- 
tion (e.g., Nielsen and Alderson 1995). Fixed-effects models run ordinary least 
squares regression after differencing variables from their country-specific means. 
Therefore, they dispose of all cross-national variation, and only model the histori- 
cal variation within countries. Random-effects models take into account cross- 
national variation by using a general error term in addition to country-specific 
error terms and differencing variables from only a portion of the country-specific 
means (Woolridge 2003). Fixed-effects models also cannot handle time-invariant 
variables (i.e., dummy variables) (Beck 2001; Beck and Katz 2001). 

Some similar studies used fixed-effects models with first-order autocorrelation 
correction (Alderson 1999; Rowthorn and Ramaswamy 1997). However, fixed- 
effects models with AR1 use one degree of freedom for each country, causing a 
loss of precious information, and they are difficult (or impossible) to estimate and 
interpret when some countries have only two observations. This is not a problem 
for random-effects models with AR1. These models are generalized least squares 
estimators, which can accommodate unbalanced and unequally spaced panel data 
(Baltagi and Wu 1999). 

Taking into account the properties of these models and the objectives of this 
research, the results will be presented with all four techniques. Both historical 
and cross-national variation is important to this research, and the use of dummy 
variables is necessary to control for the effect of regional differences. Presenting all 
four techniques together will provide a more rigorous test of the three accounts of 
industrialization in developing countries, and allow different model specifications. 

The tables present results for these models.” Statistical significance is reported 
beginning at the .10 level with two-tailed tests. Tables present unstandardized 
coefficients and t-scores. Statistical significance, fit statistics and the Bayesian 
Information Criterion Prime are used for model selection. BIC’ selects the more 
parsimonious model unless model fit is substantially improved (Raftery 1995). 
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Models 1 through 4 on Table 2 present analyses with control and globalization 
variables without FDI sector, while models 5 and 6 display the RE and RE-AR1 
analyses with the inclusion of region dummies (again without FDI sector). In 
analyses available upon request, each independent variable is added one at a time. 
The results were wholly consistent, so simply the combined models are presented 
here. In these models, FDI is measured as stock; and in the subsequent table FDI 
is measured as flow. 

All six models consistently show a very significant positive effect of GDP, 
while the square of GDP has a significant negative effect in the random-effects 
model without region dummies at the .1 level. This demonstrates that the level 
of economic development is strongly associated with the share of manufacturing 
employment, and suggests that the relationship between economic development 
and manufacturing employment in developing countries can be predicted by an 
inverse-U shaped curve.*! The insignificance of GDP square in fixed-effects and 

" fixed-effects with AR1 models can be attributed to the fact that only a few coun- 
tries in the sample are in the advanced stages of industrialization and experiencing’ 
the peaking and downturn of manufacturing employment. 

Secondary school enrollment has a significant negative effect on manufacturing 
employment in the fixed-effects model at the .05 level and in the random-effects 
model with region dummies at the .1 level, although it does not have any effect 
in other models. This provides suggestive evidence that as human capital increases, 
employment in manufacturing decreases. Urbanization has a positive effect in the 
random-effects and random-effects with AR1 models, while it has no effect in the 
fixed-effects and fixed-effects with AR1 models. This suggests that more urban- 
ized countries have a larger portion of their labor force in manufacturing, while 
changes in urbanization within countries do not significantly affect manufacturing 
employment. Population growth has no effect in all models. Gross domestic in- 
vestment has a significant positive impact in fixed-effects with AR1 at the.05 and 
.1 levels. This suggests that domestic investment enhances the industrialization of 
developing countries. 

Among the geographical regions, only Sub-Saharan Africa differed significantly. 
Even controlling for all variables, Sub-Saharan Africa has significantly less manu- 
facturing employment compared to other regions. Other regions did not have any 
significant differences in either model. 

Among globalization variables, the results for the trade variables are noteworthy. 
Both exports and low-technology exports have a significant positive impact on 
manufacturing employment. There is some variation in the size of the effects of 
these variables depending on the technique. The effect of low-technology exports 
is generally more pronounced than the size of exports. This suggests that integra- 
tion into the global economy through trade as exporters, particularly exporters of 
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low-technology products has increased the level of manufacturing employment in 
developing countries. This provides support for globalization theories (e.g., Wood 
1994; Dicken 2003), which claim that integration into the global economy can 
lead to industrialization. Moreover, this effect remains unchanged when controls for 
regional differences are added in models 5 and 6, and when all cross-country differ- 
ences are controlled for in fixed-effects models. This suggests that export-oriented 
industrialization was not merely an East Asian phenomenon. Last, but not least, raw 
material exports have no effect in all models. The industrialization of developing 
countries does not appear to be constrained by an emphasis on raw material exports. 

Next, inward FDI stock has a significant negative effect on the random-effects 
model at the .05 level and the random-effects with AR1 model at the .1 level. 
This suggests that countries with higher levels of inward FDI stock tend to have 
less manufacturing employment. This provides support for dependency theorists. 
The results do not provide support for the export-oriented FDI thesis raised by 
some globalization scholars. Finally, FDI originating from developing countries 
has no effect on manufacturing employment in developing countries. This fails 
to provide support for concerns with deindustrialization in developing countries 
due to outward FDI. 

Models 7 through 10 present models with all four techniques with the inclusion 
of FDI in primary and secondary sectors. ‘The effect of control variables is generally 
the same, while the effect of other globalization variables are less pronounced. FDI 
in the primary sector (mining and agriculture) has a significant negative effect on 
manufacturing employment at the .1 level in fixed-effects with ARI models, while 
FDI in the secondary sector (manufacturing) has a significant positive impact at 
the .05 level in fixed-effects, random-effects and fixed-effects with AR1 models, 
and at the .1 level in random-effects with AR1 model. Because of the decline in 
degrees of freedom these results may not be comparable to the previous six models, 
but they do provide suggestive evidence that FDI in secondary sector helps the 
industrialization of developing countries, while FDI in the primary sector, which 
includes agriculture and mining, has a modest negative effect. These results provide 
some support for globalization theorists who see manufacturing foreign investment 
as a positive influence in developing countries; and some support for dependency 
theorists who see primary sector investment and dependence as constraining in- 
dustrialization. Finally, BIC’ prefers random-effects with AR1 without region dum- 
mies and EDI sector (Model 4) over the other models. 


Models with FDI Flow 


Table 3 presents the models in which the FDI stock variables in Table 2 are re- 
placed by FDI flow variables. The effect of GDP and other control variables do 
not change with the inclusion of FDI flow variables, except for gross domestic 
investment, which is now significant in all models.” Among trade variables, the ef- 
fect of total exports is less significant, while low-technology exports are still signifi- 
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cant in most models (both still positive). Raw material exports again have no effect. 
There is also no change in the effect of geographic regions. Sub-Saharan Africa 
is again the only region with significantly different manufacturing employment. 

Inward FDI flows do not have any effect on manufacturing employment. 
Interestingly, and distinct from the analyses of stock in Table 2, FDI outflow 
has a significant negative effect at the .1 level in random-effects, and at the .05 
level, in random-effects with AR1 models without the inclusion of the FDI sector 
variables. This suggests that investment originating from developing countries 
may have a negative influence on manufacturing employment, but the evidence 
is not strong. Surprisingly, FDI flow in primary and secondary sectors does not 
have any impact of manufacturing employment in developing countries. This does 
not provide support for either globalization or dependency theories. BIC’ again 
prefers random effects with AR1 without region dummies and FDI sector (Model 
4) over the other models. 


Discussion 


The emergence and conditions of industrialization in developing countries is a very. 
important and contested topic in social sciences. Many scholars think that indus- 
trialization in developing countries is unlikely because of factors such as constant 
political conflict and violence, dependence on natural resources in resource-rich 
countries, and exploitation by early-industrialized countries (e.g., Auty 2000; 
Collier et al. 2003; Wallerstein 1974, 1980). Other scholars view industrialization 
as possible only under certain conditions such as the existence of a past industrial 
heritage or the ability to implement selective controls on international trade and 
FDI (Amsden 2001). The increasing integration of the world economy during 
the most recent wave of economic globalization has added to the controversy. 
While many scholars such as Wood (1994), Gereffi (2005) and Dicken (2003) see 
increasing international trade and FDI as positive influences on industrialization 
in developing countries, many remain skeptical and argue that trade and FDI are 
either irrelevant or even harmful to industrialization and development of devel- 
oping countries (Bornschier and Chase-Dunn 1985; Arrighi, Silver and Brewer 
2003). ‘This study addressed the debate by exploring the effect of the latest wave 
of economic globalization on manufacturing employment in developing countries 
from 1980 to 2003 with a comprehensive sample. 

This study assessed the evidence for three explanations: economic develop- 
ment/productivity, dependency/world systems, and economic globalization. 
In the past two and a half decades, developing countries have experienced sig- 
nificant increases in manufacturing employment. However, unlike early-indus- 
trialized countries, manufacturing employment never.became the dominant 
form of employment. When manufacturing employment peaked in successful 


industrializers such as South Korea, its share of all employment was lower than 
the share of service sector employment. 
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Despite enormous changes in the global economy, the relationship between 
economic development and manufacturing employment has some similarities 
to early-industrialized countries. Although many developing countries are con- 
tinuing to industrialize, some such as South Korea and Malaysia are actually 
deindustrializing as they continue to economically develop. In the models, GDP 
has a highly significant effect in nearly all models while GDP square has a negative 
significant impact in some models. This indicates that manufacturing employ- 
ment increases in earlier levels of economic development and starts to decline at 
higher levels of economic development. The models and the analysis of individual 
cases support the arguments made by Clark (1957[1940]), Rowthorn and Wells 
(1987), Rowthorn and Ramaswamy (1997) and Krugman and Lawrence (1994). 
‘This finding is important because it indicates that industrialization in developing 
countries can be partially understood by looking to the industrialization process 
in early-industrialized countries, and predictions can be made based on the experi- 
ences of these countries. However, the level of economic development does not 
explain all the variance in manufacturing employment in developing countries. 

Economic globalization was a salient influence on the share of manufacturing 
employment in developing countries from 1980 to 2003. ‘The size of exports, 
measured by the ratio of all exports to GDP, and the proportion of low-technology 
exports among all exports has a strong positive effect on manufacturing employ- 
ment in the models. The effect of globalization on manufacturing employment 
is mainly through international trade. Inward FDI stock has a negative impact 
on industrialization in some models, while FDI flow has no effect. FDI stock in 
secondary sector has a significant positive effect on industrialization in developing 
countries when added to the models, while FDI flows in this sector has no effect. 
These suggest that direct investment in developing countries by foreign companies 
may be less important in the global reorganization of manufacturing than the 
trade between these companies and the companies from developing countries. 
Dicken (2003) and Kaplinsky (2000) explain that most of global manufacturing 
is facilitated through subcontracting, in which firms from rich countries transfer 
their manufacturing activities to firms in developing countries. Moreover, these 
companies do not necessarily “own” their subcontractors and regularly shift be- 
tween different companies in different countries (Gereffi 2005). Finally, there is 
also not much evidence for deindustrialization in developing countries as a result 
of outward FDI from these countries. 

Overall, the analyses provide limited support for dependency and world-sys- 
tems theories. Raw materials exports do not have a significant effect on manufac- 
turing employment. FDI stock has a negative effect in some models while FDI 
flows have no effect. Thus, integration into the world economy does not appear to 
have an overwhelming negative effect on industrialization in developing countries. 
Nevertheless, there are important ways in which the sectoral composition of trade 
and FDI constrain the industrialization of developing countries. 
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One argument is about the quality of the manufacturing jobs created with 
integration into the contemporary global economy. For example, Mander and 
Goldsmith (2001) argue that export-oriented industries exploit the workers in 
developing countries and do not provide a living wage. The images of third-world 
sweatshops that attracted the attention of American university campuses in the 
late 1990s are still very real (Gereffi 2005). In fact, many scholars of globalization, 
(e.g., Gereffi 2001; Kaplinsky 2000), cite the low cost of production in develop- 
ing countries, originating from low labor compensation, and loose labor, tax and 
environmental regulations in these countries, among the main reasons behind 
the global reorganization of manufacturing and the global mobility of the capital. 
In this manner, Arrighi, Silver and Brewer (2003) argue that although the gap 
in industrialization between first- and third-world countries has diminished, the 
income gap still exists. However, some scholars argue that globalization might 
actually have beneficial effects on workers. For example, Neumayer and de Soysa 
(2006) show that countries more open to international trade have fewer labor 
rights violations, while FDI does not seem to have any impact, be it positive or 
negative.*’ In addition, they argue that developing countries that have higher 
levels of trade openness and FDI stock have lower levels of child labor force 
participation (Neumayer and de Soysa 2005). Although significant, this debate 
is beyond the focus of this article, but warrants future research. Researchers can 
expand this debate to assess a broader question of the relationships between glo- 
balization, industrialization and well-being in developing countries. 

There are also other issues that the current study could not address. One impor- 
tant question emerges from the near absence of industrialization in some countries, 
especially in Sub-Saharan Africa. These countries and the almost complete absence 
of industrialization should be addressed separately in more detail. In addition, 
more research needs to be done as to the role of the service sector on economic 
development in developing countries. This and previous studies showed that the 
service sector, rather than manufacturing, became the dominant form of employ- 
ment following the decline of agricultural employment in nearly all develop- 
ing countries. Finally, it would be valuable to examine partner concentration in 
international trade and investment. As suggested by some dependency scholars, 
dependence on a single or a few trade and investment partners can have significant 
consequences for development and industrialization in developing countries. 

Overall, this study contributes to current debates on globalization, and the 
longstanding debate on industrialization in non-Western societies, and particu- 
larly, the effect of external factors in this process of industrialization. It shows that 
the globalization of the world economy has emerged as a significant influence on 
industrialization in developing countries in the past two and a half decades. This 
implies that a comprehensive understanding of contemporary economic and social 
structures in developing countries requires a thorough understanding of the latest 
wave of globalization. At the same time, globalization is not the most important 
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factor, and domestic factors still shape industrialization and development. Finally, 
although economic globalization seems to result in spreading industrialization 
around the world, the question of possible exploitation of developing countries 
by rich countries still merits scholarly attention. 


Notes 


1. Lall (2000) shows that from 1985 to 1998, the manufacturing exports of developing 
countries increased 12.5 percent on average annually. He also shows that developing 
country exports constituted 25 percent of all manufacturing exports in 1998, up from 
16 percent in 1985. My calculations from the World Trade Analyzer (Statistics Canada 
2003) using Lall’s (2000) classification shows that it was about 35 percent in 2003. 


2. I do not mean to imply here that there is a consensus in social theory on the 
emergence of industrialization in developed countries. In fact, there have been serious 
disagreements and debate among scholars. For example, Mendels (1972) and Tilly 
(1983) argue that in continental Europe, unlike the factory-based industrialization 
in Britain, industrialization started in rural areas by the multiplication of small 
production units, which they label as proto-industrialization. Sewell (1986) and Perrot 
(1986) explain that in France, industrialization took place gradually with artisans 
playing an important role well into the 20" century. Robinson and Briggs (1991) 
found that 19" century industrialization in Indianapolis, Indiana emerged from 
artisan production. However, whatever path they followed, in all early-industrialized 
countries the agriculture-based economies gave way to the industrial economies in 
which manufacturing became the main source of employment, except for the Unite 
States in which manufacturing and service employment went hand-in-hand during 
industrialization (Carter 1997). Then, in time, service sector employment increased 
and took over manufacturing to become the main source of employment (Clark 


1957[1940}). 
3. Rowthorn and Wells (1987), and Rowthorn and Ramaswamy (1997) warn that 


analyzing deindustrialization in terms of the share of manufacturing in national 
income can be misleading. They offer the concept of “positive industrialization” 
where the decline in manufacturing occurs when productivity grows at a rate higher 
than the growth in output. 


4, There is a rich literature on the internal factors leading to economic development 
and high productivity in developing countries. Most prominent are the studies 
of institutional structures. Some scholars argued that central planning, control of 
workers and educational expansion by the developmental states led to increasing 
productivity and economic growth is some countries. (e.g., Evans 1995; Wade 1990). 
Some argued that culture played a crucial role in the successful industrialization and 
development of some developing countries by contributing to labor productivity and 
discipline (see Gereffi and Fonda 1992). The role of colonial past was also put under 
scrutiny by development scholars (e.g., Kohli 2004). Although very important, this 
literature is beyond the scope of the present study, which concentrates on the effect 
of economic globalization. 

5. See Gereffi (1983) and Evans (1979) for comprehensive reviews of dependency and 
world systems schools. 


6. Wallerstein also theorized a semi-periphery which engages in limited industrial 
production. 
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Nevertheless, Firebaugh (1996, 1992), Szymanksi (1983), and de Soysa and Oneal 
(1999) contest the claims of the dependency school. 


Cardoso and Faletto (1979:xxii) state that “...peripheral industrialization is based 
on products which in the center are mass consumed, but which are typically luxurious 
consumption in dependent societies. Industrialization in dependent economies 
enhances income concentration as it increases sharp differences in productivity 
without generalizing this trend to the whole of the economy.” Then they argue 
that “The type of technology adopted by the most modern sectors used little labor 
and thus increased what seemed to be an overt incapacity to solve occupational 
demand through industrialization. The creation of new industrial sectors dismantled 
handicraft ones, destroying more jobs than creating new ones.” (Cardoso and Faletto 
1979: 4-5) Although Cardoso and Faletto’s views do not fully represent dependency 
and world system scholars after them, their argument indicates a possible negative 
influence of dependency on manufacturing employment. 


Skeptics of globalization reject the idea that the world economy in late 20" century 
and afterwards is different from earlier periods, specifically the late 19 century. 
Therefore, they argue against building any theoretical and empirical endeavor on 
this assumption. Hirst and Thompson (1996) claim that developments labeled as 
globalization are not historically unprecedented. They argue that integration and 
interdependence among countries was higher in the late 19" century with more trade-_ 
to-national-output ratios and more open economies. Bairoch (1996) claims that the 
trend in the volume of trade and FDI flows do not back the idea of globalization. 
Obstfeld and Taylor (2003) argue that foreign capital is now discouraged more than 
it was a century ago. Following Robertson (1992), Held and colleagues (1999) and 
Therborn (2000), I perceive globalization as a long historical process, accepting the 
fact that the integration of the world economy started before the late 20" century 
but maintaining the idea that the current era has distinctive properties. Baldwin and 
Martin (1999) argue that trade/GDP, foreign direct investment/GDP and liberalism 
are quite similar between two periods, but the current period is different because 
of the dramatic reduction in transportation and communication costs. Held and 
colleagues (1999) think that the current era is distinct with its extensiveness, real 
time and round the clock economic transactions. Dicken (2003) says that unlike the 
shallow integration of the earlier periods, the current global economy (post-WWII) is 
characterized by the functional integration of economic activities. 


HO basically contends that the relative abundance of factors of production (land, 
labor and capital) creates the comparative advantage of a country. An extension of 
this model is the Heckscher-Ohlin-Samuelson Theorem, which theorizes factor price 
equalization as a result of trade between two countries (Rodrik 1997). 


Wood (1995), however, stresses that the developing countries need to provide at least 


basic levels of education to their labor forces in order to utilize their comparative 
advantage. 


These countries could now be classified as developed or high-income countries. For 
example, Singapore now has a relatively high GDP per capita. But, my contention 
is that such countries should be included in a sample of developing countries in 
1980. These countries fit the developing country “criteria” in 1980 (see Note 14). 
Any sample of developing countries should include both the persistently poor 
countries and the few that were considered poor but have succeeded in improving 
their status significantly. Also, it would be interesting to see if EDI outflow caused 
any contraction in manufacturing employment in these countries. 
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Developed countries are not included in the sample because the changes in these 
countries are well documented. 


Argentina had a GDP per capita of more than $5,000 in 1980, but that figure fell in 
following years. Though Singapore rose to a very high level of economic development 
at the beginning of 1980, its GDP per capita was below $5,000. Hence, it is essential 
to include in any sample the countries that were able to pull themselves out of the 
developing country category. To my knowledge, there is not a standard definition of 

“developing countries.” I used $5,000 dollars GDP per capita assuming that countries 
over this benchmark had achieved significant economic development. The countries 
with populations of less than 500,000 were not included in the sample because they 
may not be comparable as units of analysis. 


The countries in the sample are Algeria, Angola, Argentina, Bangladesh, Benin, 
Bolivia, Brazil, Burkina Faso, Cameroon, Central African Republic, Chad, Chile, 
Colombia, Congo Democratic Republic, Congo Republic, Costa Rica, Cote d Ivorie, 
Dominican Republic, Ecuador, Egypt, El Salvador, Fiji, Gabon, Gambia, Ghana, 
Guatemala, Honduras, India, Indonesia, Iran, Jamaica, Kenya, Madagascar, Malaysia, 
Mali, Mauritania, Mauritius, Mexico, Mozambique, Myanmar, Nepal, Nicaragua, 
Pakistan, Panama, Papua New Guinea, Paraguay, Peru, Philippines, Rwanda, Senegal, 
Sierra Leone, Singapore, South Korea, Sri Lanka, Sudan, Thailand, Togo, Trinidad 
and Tobago, Turkey, Uganda, Uruguay, Venezuela, Yemen and Zambia. 


Countries that are still governed by communist parties, China and Vietnam, are also 
treated as ex-communist countries. The results and conclusions are robust if China and 
Vietnam are excluded from the ex-communist category and included in the sample. 


The countries with available data for each year change because of missing data. When 
I adjust this for countries with available data for all years, the mean is 19.8 percent 
in 1980 and 23 percent in 2003. 


I ran the models both with GDP PPP per capita and GDP per capita (World Bank 
2007) and the results were basically identical. Although PPP is not an ideal measure, 
it may have some reliability advantage for cross-country comparisons because it takes 
into account the differences in cost of living. 


One might argue that the age structure instead of population growth may be more 
important here. The results were robust when I substituted population growth with 
population under age 15, between ages 15 and 65, and the ratio of population under 
15 to population between 15 and 65. 

Enrollment rate is calculated by the World Bank (2007) as the ratio of all the students 
in secondary schools, regardless of age, to the total number of people at secondary 
school age. 


. Talso used the portion of labor force with secondary education as provided by the 


World Bank (2007) to measure education. However, because of missing data the 
number of observations declined to 57 and the number of countries included in the 


analysis declined to 31. 

I also considered net migration as a measure of labor globalization. However, this 
variable seems prone to simultaneity bias where manufacturing employment drives 
migration. 


The Lall (2000) classification is composed of five groups ranging from primary 
products (PP), resource-based products (RB) and low-technology products (LT) to 
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medium-technology (MT) and high-technology (HT) products. The trade data is 
classified into Lall’s (2000) groups at the three-digit level of Standard International 
Trade Classification (SITC) using UN international trade data through World Trade 
Analyzer (WTA) (Statistics Canada 2003). UN’s COMTRADE Database is also used 
to supplement WTA when necessary (United Nations 2005). 


In analyses available upon request, I also included raw material exports into the 
models without subtracting oil and gas exports. The results were very similar. 


Foreign direct investment is measured as the ownership of 10 or more percent of an 
economic establishment by a foreign entity (UNCTAD 2005). 


By contrast, Firebaugh (1992) critiques the conclusions of dependency scholars 
regarding FDI. It is important to emphasize that Firebaugh’s critique should not 
necessarily be applied to manufacturing employment. Because growth and GDP are 
not dependent variables here (and are controlled for), the debate over denominator 
effects should be less of a concern. 


Thirty-one countries in the sample have an outward FDI stock larger than 1 percent 
of their GDP in the latest year with available data, while 1 lcountries have more than 
5 percent. 


The results of these analyses can be found at http://www.uncw.edu/soccrj/about- 
faculty-kaya.html. In these models, I also present the variance inflation factor values. 
The VIF values range from 1.373 to 4.425 with an average of 2.334. These suggest 


that there is not a significant collinearity problem among the independent variables. 


Another issue is the possible existence of endonegeity and simultaneity among the 
variables, which is always a concern with this kind of research. Although, it is not 
possible to fully address these concerns, a few things can be said. First, I utilize several 
statistical techniques to estimate the models. In particular, FE models (and FE- 
ARI models) are considered a more stringent approach against reverse endogeneity 
(Winship and Morgan 1999). As for the relationship between economic globalization 
variables and industrialization, there is enough evidence in the literature to suspect 
that economic globalization (FDI and trade) may precede and affect the level of 
manufacturing employment (e.g., Wood 1995; Brady and Denniston 2006; Villareal 
and Yu 2007) rather than the opposite. 


I also ran OLS with Panel Corrected Standard Error models as suggested by Beck 
and Katz (1995). The results of the PCSE models are generally similar to the FE and 
RE models. However, the unbalanced nature of my data constitutes a problem for 
running PCSE models. Because of missing data, many countries in the analyses have 
data for a few of the six time points; and there is not a time point common to all 


countries. In addition, Beck and Katz (1995) warn against using their model with 
less than 10 observations. 


If only run the linear term, the model fit is significantly lower and the effect of GDP, 
indicated by standardized coefficient and statistical significance, is less pronounced. 


‘This does not change when stock and flows are included in the model. 


However, in a recent study, Martin and Brady (2007) found no relationship between 
unionization and globalization. ‘ 
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For Export Only: Diffusion Professionals 
and Population Policy 


Deborah Barrett, University of North Carolina 
Charles Kurzman, University of North Carolina 
Suzanne Shanahan, Duke University 


Export-only diffusion occurs when innovators do not adopt an innovation themselves, 
but rather promote it to others for adoption. Potential adopters do not take their cues 

from early adopters, but rather from diffusion professionals who make it their job 

to spread a practice or institution. The global spread of national-level, population- 
control policies during the Cold War is one such instance: developed and promoted 

by wealthy countries that did not themselves adopt such policies, they came to be 

widespread among poorer countries, thanks in large part to the mobilization of diffu- 
sion professionals. This article offers an analytical account of this diffusion, as well as 

an event-history analysis of 163 countries over the period 1950-1990 demonstrating 
the importance of linkages between policy adopters and the non-adopting institutions 

of diffusion professionals. 





Diffusion processes are among the most widely cited phenomena in the social sci- 
ences. They are central to the study of organizational behavior, social movements, 
demographic transition, health-related behaviors, national policies and numerous 
other subjects (Casterline 2001; Elkins and Simmons 2005; Evans 2004; Levi-Four 
and Jordana 2005; Lopes and Durfee 1999; Meseguer 2005; Rogers 2003; Strang 
and Soule 1998; Wejnert 2002). In general, these analyses identify three stages of 
diffusion: the emergence of innovation, the first acceptance of the innovation by 
early adopters, and the spread of the innovation—whether through parallel situations, 
imitation, learning, persuasion, coercion or other mechanisms—to late adopters. 

Recent work on institutional diffusion, however, tends to collapse the first two 
stages into a single process. In these studies, early adoption is treated as equivalent 
to innovation. A prominent recent study by Simmons and Elkins (2004), for 
example, examines the timing of neo-liberal economic policies around the world, 
without a separate discussion of the emergence of the neo-liberal innovation. In 
other important recent studies: the Watts area in Los Angeles is both the innovator 
and first adopter of the urban racial riot (Myers 2000), the Geological Society of 
London is both innovator and first adopter of the geological professional associa- 
tion (Schofer 2003), and so on. 


We thank Nitza Berkovitch, Gili Drori, Laura Gémez, John Meyer, Layna Mosley, Ijlal Naqvi, 
Manuela Romero and the Social Forces editor and reviewers for their suggestions and encouragement 
throughout this long project. We thank Mark Crescenzi and Catherine Zimmer for their statistical 
assistance, and Gary Thompson and Amanda Henley for producing the world map that appears in 
Figure 2. Direct correspondence to Deborah Barrett, School of Social Work, University of North 
Carolina, Chapel Hill, NC 27599-3550. E-mail: dbarrett@unc.edu. 
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Figure 1. Information Diffusion 











Figure 1a. Regular Diffusion 
Early adopter gets information from innovator: later 
adopters get information from early adopters. 


Individual-level studies, by contrast, more frequently maintain a distinction 
between innovation and early adoption. The classic work by Everett Rogers 
(2003:282-83), first published in 1962, associates each stage with contrasting per- 
sonality types: innovators are “cosmopolites” with “substantial financial resources” 
and an obsession with venturesomeness; early adopters, by contrast, are “localites” 
in a position of “opinion leadership” within a social system. Even in Rogers’ treat- 
ment, however, the terminology overlaps. Rogers’ book includes a diagram from a 
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Figure 1 continued 
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Figure 1b. Export-Only Diffusion 
Most adopters get information directly from 
innovator, not from other adopters. 


Note: Adapted from Rogers (2003:293) 


1950s study of pesticide use in an Iowa farm community, in which the “innovator” 
is identified as the first farmer in the neighborhood to adopt the new weed spray, 
and the “early adopter” is the second farmer in the neighborhood to adopt the 
spray. The agricultural scientists who developed the pesticide and marketed it to 
the farmers are represented by a cartoon character in a lab coat holding a vial and 
standing outside of the neighborhood diagram (see Figure 1a). 
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Several recent studies have brought the lab coat innovators, or their equivalents, 
into institutional-level analyses. For example, Abrahamson and Fairchild (1999) 
analyze the role of “management-knowledge entrepreneurs” —academics, journal- 
ists and management gurus—in the diffusion of “quality circle” techniques among 
business leaders, regardless of whether the entrepreneurs adopted these techniques 
themselves. Van den Bulte and Lilien (2001) emphasize the influence of drug- 
company marketing campaigns in the diffusion of the drug tetracycline. Kaufman 
and Patterson (2005) discuss the role of elites and “cultural entrepreneurs,” who 
may or may not be athletes, in the spread of cricket playing. Van Dyke, Dixon and 
Carlon (2007) focus on the role of union recruitment programs in diffusing labor 
activism to non-conventional constituents. In each of these studies, innovators try 
to promote their innovations to potential adopters. In some cases, innovators may 
overlap with adopters. In others, innovators may be blocked from adopting the 
innovation themselves—they may be too old to play cricket, for example, or not 
institutionally placed to implement a corporate or governmental policy. We pro- 
pose that the distinction is of considerable theoretical and empirical significance, 
beyond these particular case studies. 


Export-Only Diffusion 


At airport shops in various poor countries one may find goods with the label 
“export only” or a similar designation. The label seems intended to signal that the 
product is of higher quality than one would have found outside of the airport, 
and to justify the premium charged in the airport. Conversely, in rich countries 
“export only” labels may be slapped on goods that are inferior—too dangerous to 
meet local standards, for example, in the case of certain pesticides. One class of 
export-only goods in rich countries is policy advice that involves an element of 
“do as we say, not as we do.” A considerable industry of specialized consultants 
has developed to serve the export-only policy market, traveling the world to offer 
advice that is not applicable, for one reason or another, at home. 

Diffusion professionals —Rogers (2003:365-401) calls them “change 
agents” —are not limited to the international arena, but can be found at all levels 
of analysis: management consultants for businesses, for example, and a host of 
parallel industries for individuals, including financial consultants, psychothera- 
pists and salespeople. What these roles have in common is a professional interest 
in diffusing behaviors that they do not necessarily adopt themselves. This distinc- 
tion does not have to involve deception or exploitation; it can just as easily be 
humanitarian, such as public health activists who work to promote safe behaviors 
by governments, corporations and individuals, 

The existence of diffusion professionals is not particularly novel, although the 
scale of these industries may be broader than is commonly acknowledged in dif- 
fusion studies. What is novel is the recognition that the pattern of export-only 
diffusion differs from the usual image of diffusion, in that some of the key agents 
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of change remain outside the network of adopters. In an extreme case where dif- 
fusion professionals dominate the diffusion process, adopters would be attuned 
very little to early and potential adopters, and would be entirely oriented toward 
the message of the outside innovators (see Figure 1b). The center of these networks 
of adoption is occupied not by influential early adopters but rather by innovators, 
promoters, educators and other professionals. 

In addition to differences in pattern, the linkages in export-only diffusion 
are potentially quite different from the linkages between early adopters and later 
adopters. Diffusion studies generally emphasize the acceleration effect of similari- 
ties between early and later adopters. Indeed, the logic of these studies is typically 
to introduce a puzzle of seeming heterogeneity among adopters, and then to 
resolve the puzzle through the statistically significant coefficient of some element 
of homogeneity. Export-only linkages, by contrast, can be analyzed in terms of 
fundamental dissimilarity between innovators and adopters. These dissimilarities 
may involve: 


a. Scale. The innovator may be an individual or organization 
that is not on the right scale to implement the innovation, 
for example a policy advocacy group that is not itself a 
government, and therefore cannot adopt the policy that it 
is advocating. 


b. Location. The innovator may be situated in a location 
where adoption seems unnecessary, for example public 
health activists who promote the boiling of water before 
drinking it, but who live in places that offer potable tap water. 


c. Class. Innovators may be blocked by poverty from adopting 
the behaviors they recommend, for example high-end real- 
estate agents or stereo salespeople who cannot afford the 
latest fashions that they peddle to their customers. 


d. Identity, The innovator may simply not be the sort of 
person or institution for whom adoption makes sense, for 
example eugenicists who encourage the “less fit” to limit 
their reproduction, but who invariably consider themselves 


among the genetically fit. 


We follow Strang and Meyer (1993:487,490) in emphasizing “theorized ac- 
counts of actors and practices” that construct categories of potential adopters as 
particularly appropriate, so that “where actors are seen as falling into the same 


category, diffusion should be rapid.” Elsewhere, Meyer (1994:31) discusses the 
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rise of “rationalized others,” professionals whose claim to authority rests in their 
disinterestedness and expertise, and who “give advice to nation-state and other 
actors about their true and responsible natures, purposes, technologies, and so on. 
Rationalized others are now everywhere.” (Meyer et al. 1997:162) We propose that 
export-only diffusion combines these two insights, in which potential adopters are 
conceptualized as both different from the innovators and similar to one another. 

Export-only diffusion is an ideal type, but we argue that it can be observed in 
action in the diffusion of national population policies during the Cold War period. 


Population Policy for Export 


‘The first fertility-control policy of the modern era was enacted in India in 1952 
(Planning Commission 2003). But India didn’t invent the population policies that 
it adopted. These were the product of an intellectual movement in the United 
States and Great Britain called “demographic transition theory.” This theory noted 
that population numbers in Western Europe had increased significantly over time 
as improved national wealth and public health lowered mortality rates. Eventually, 
fertility rates declined and population growth stabilized, or even reversed coursé. 
However, the lag between reduced mortality rates and reduced fertility rates posed 
a troubling specter for poorer regions of the world that were beginning to experi- 
ence—or were expected to experience—lower mortality rates. Demographers set 
out to explore how long high fertility rates might last, how large these populations 
might get before fertility rates matched mortality rates, and what effect population 
growth might have on per capita income (Szreter 1993). 

Demographic transition theory was a striking departure from earlier theories of 
population growth, which had equated national welfare with larger populations 
(Barrett 1995; Barrett and Frank 1999; Overbeek 1974). Governments through- 
out history had almost universally sought to encourage reproduction, not restrain 
it. There were no examples of successful state-led fertility-control programs in 
core countries for diffusion professionals to point to as models for other countries 
to imitate. Indeed, the main Western experience with fertility control—eugenic 
sterilization—came to be associated with Nazi Germany and was treated as an 
embarrassment to be hidden, not a model to be adopted by other parts of the 
world (Barrett and Kurzman 2004). Instead, fertility-control policies were to be 
pioneered in “underdeveloped areas,” as defined by the new focus on annual na- 
tional income per capita (Arndt 1987; Cooper and Packard 1997; Escobar 1995). 
Sites for adoption were identified according to the West’s Cold War priorities 
(Greenhalgh 1996; Hodgson 1983, 1988; Szreter 1993). During World War II, 
the U.S. State Department funded the Office of Population Research at Princeton 
University to expand its focus to include Asia. This shift was not decided on ex- 
clusively demographic grounds, since population growth was on average higher in 
Latin America (United Nations 1952). Rather, the most populous countries of the 
world were in Asia, and these promised to be the most prominent battlegrounds 
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for global competition with the Soviet Union. Strategic thinkers in the West 
considered mass poverty to be fuel for communist organizing, and saw population 
control as a means of reducing this potential threat. 

Within Asia, India was identified as having particularly thorough population 
data and as typical of other underdeveloped countries (Notestein 1945, 1951; see 
also Mamdani 1972). Kingsley Davis’ influential study of the population of India 
and Pakistan, for example, remarked that America and the Indian subcontinent 
were fundamentally “opposites,” while South Asia was representative of “compa- 
rable areas” in “the crowded agricultural regions of the world.”(Davis 1951:vii) 
In 1949, the Population Commission of the United Nations asked the Secretary- 
General to select a country to approach for an initial study whose results would 
be generalizable to “under-developed countries.”(United Nations 1950:650) The 
Secretary-General selected India (United Nations 1951). 

Indian advocates of population control—themselves frequently trained in the 
West—also emphasized India’s difference with the West. Lady Dhanvanthi Rama 
Rau, founder of the Family Planning Association of India, said she became in- 
volved in family planning because of the contrast in living conditions that she 
witnessed when she returned from Washington, D.C., to India in 1949, and 
on the advice of “demographers from abroad as well as Indian experts.”(Rama 
Rau 1977:241-50) A few Indian intellectuals and activists had supported fam- 
ily planning since the 1920s, and had even gotten several measures included in 
the pre-independence platforms of the Congress movement (Connelly 2003; 
Srinivasan 1995). In 1952, Indian government officials consulted “dog-eared” 
copies of Davis's (1951) book (Van der Tak 1991). 

At the same time, population control in India faced opposition from two sig- 
nificant movements. First, the leader of the independence movement, Mahatma 
Gandhi, was absolutely opposed to the use of contraceptives, which he believed 
would encourage people’s base urges: “Increase in population is not and ought not 
to be regarded as a calamity to be avoided. Its regulation or restriction by artificial 
methods is a calamity of the first grade whether we know it or not. It is bound to 
degrade the race, if it becomes universal, which, thank God, it is never likely to 
be.” Rather, “the curse of unwanted children” should be solved “by the sovereign 
remedy of self-control.” (Gandhi 1947:173-74) This ethical objection paralleled 
the religious objections of some Catholics, including the French and Belgian 
governments (Symonds and Carder 1973). 

A second source of opposition lay in socialist ideologies that considered popu- 
lation control to be a poor substitute for the redistribution of wealth and rapid 
industrialization. Prime Minister Jawaharlal Nehru expressed this reservation in a 
message to an international family planning conference that was held in India in 
1952, on the initiative of the renowned American birth-control activist, Margaret 
Sanger (Rama Rau 1977; Wadia 2001). “Some people imagine that almost all the 
ills of India are due to over-population and therefore the basic remedy for those 
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ills is to try to limit the growth of this population. This approach, to some extent, 
diverts attention from important social problems to the population aspect of them. 
I think this is wrong.” (International Planned Parenthood Federation 1952:2) 
This objection echoed the opposition of the Soviet Union and other state socialist 
governments to population control (Besemeres 1980; Symonds and Carder 1973). 
Both forms of opposition maintained the distinction between population policy 
innovators and potential adopters: indeed, they accused the United States of trying 
to foist these policies on other countries. 


Population Policy Diffusion 


Notwithstanding these objections, the Indian government approved family plan- 
ning’s inclusion in the first Five Year Plan (Srinivasan 1995). Western population 
experts continued to focus on India as its example was expected to have significant 
implications for other poor countries. A major study of India by the Princeton 
Office of Population Research noted that “it was natural to see whether our analysis 

~~ of India’s prospects applied elsewhere.” The study examined the situation in Mexico, 
selected for its contrast with India—more unused arable land, deeper international 
economic linkages—and found that “even when we deliberately search for differ- 
ences between the Mexican setting and that of India, we still come to the conclu- 
sion that a reduction in fertility would add substantially to total output.” The book 
then extrapolated to all countries with low per capita incomes, high birth rates and 
increasing economic productivity (Coale and Hoover 1958:4, 319-20). 

In the typical diffusion models, other poor countries should have looked to 
India when considering their own adoption of a population policy. The countries 
that adopted soonest would be those with the closest ties to India, or the most 
similarities with India, or the most respect for India, or the greatest ability to 
learn from India’s experience. For several early adopters, we can trace one or 
more of these pathways: Pakistan, the second adopter in 1960, shared a common 
colonial heritage with India. Fiji, which adopted a population policy in 1962, had 
a significant minority population of Indians. Other adopters are harder to link to 
India: South Korea, for example, which adopted a population policy in 1961, and 
African countries that adopted policies in the late 1960s and early 1970s. There 
is little evidence that these countries learned of the benefits of population policies 
directly from India, and it is not clear what they would have learned, since India’s 
population growth was not appreciably lower than countries without population 
policies— both India and Asia as a whole grew at 1.9 percent per year during the 
1950s (United Nations 1961). Over the subsequent decades as well, national 
population-control policies were not statistically associated with lowered fertility 
rates (Schultz 1994; Tsui 2001). ; 

At the same time, many countries were clearly influenced directly by diffusion 
professionals in the United States, Great Britain and other Western countries. For 
example, diffusion professionals sought to expand their own cadre by training ex- 
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perts from the potential adopter countries. In one program, the Population Council 
granted approximately 1,500 fellowships for graduate training in the half century 
after 1952. Through the auspices of the United Nations and other international 
organizations, and funded by development agencies in wealthy countries, demo- 
graphic programs were established in the developing world, beginning with centers 
in India and Chile in 1957 (Donaldson and Tsui 1990; Menken, Blanc and Lloyd 
2002). Diffusion professionals continually urged the expansion of such programs, as 
in a meeting in 1989, when “a very large number of delegates stressed the need for 
training of demographers, population specialists, programme managers and others 
involved in the population field.” (United Nations Population Fund 1989:73) 

To promote adoption among other countries, diffusion professionals organized 
themselves into an array of agencies such as the International Planned Parenthood 
Federation and the International Union for the Scientific Study of Population. 
These groups began in wealthy countries but increasingly sought partners in the 
developing world; by 1990, both organizations had members in more than 100 
countries (Barrett 1995). Several Indians joined core institutions of diffusion 
professionals: Rama Rau became co-president of the IPPF in 1952; India joined 
the Population Commission at the United Nations in 1955, though its representa- 
tive was not particularly influential (Symonds and Carter 1973); and many Indian 
specialists in population studies and related fields took positions in the United 
States and Western Europe. But few countries encountered population-control 
ideas via contact with Indians. In South Korea, for example, an Anglo-Canadian 
representative of the IPPF met with local family planning advocates in 1960; a 
Korean chapter of the federation was formed in early 1961; and the Korean gov- 
ernment adopted a population policy later in 1961 (Kim, Ross and Worth 1972). 

In addition, diffusion professionals in the core of the world system lobbied private 
funders, their own governments and intergovernmental agencies to support poten- 
tial adoption abroad (Connelly 2008). A network of American foundations joined 
early supporters like the Carnegie Corporation and the Rockefeller Foundation in 
donating millions of dollars to the effort (Caldwell and Caldwell 1986; Connelly 
2008; Harkavy 1995). In addition, a series of official bureaucracies emerged for the 
study, promotion and implementation of population policies, including various 
units of the United Nations (Symonds and Carder 1973; Johnson 1987), the World 
Bank (Sai and Chester 1990), and development agencies in the United States and 
other wealthy countries that did not have population-control policies themselves 
(Baron 1981; Donaldson 1990; Ethelston 2004; Wolfson 1983). 

Diffusion professionals organized a series of international meetings on popula- 
tion problems, to which potential adopter countries were invited to send repre- 
sentatives. Most prominent among these were the world population conferences, 
which attracted delegates from many countries: 74 countries were represented at 
Rome in 1954, 88 at Belgrade in 1965, 136 at Bucharest in 1974, 146 at Mexico 
City in 1984, and 179 at Cairo in 1994. Despite significant opposition by social- 
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ist governments, the Bucharest conference ratified a World Population Plan of 
Action to “promote the development and implementation of population poli- 
cies where necessary... Countries which consider that their present or expected 
rates of population growth hamper their goals of promoting human welfare are 
invited, if they have not yet done so, to consider adopting population policies 
within the framework of socio-economic development.” (Singh 1979:27-28) These 
conferences allowed government officials from various countries to learn the latest 
models developed by scientific experts and to be impressed by the large numbers 
of other countries that had also expressed an interest in population policy by 
sending their own delegations. Conference attendance was both an indicator of a 
government's interest in adoption and a spur to adoption. 

It should come as no surprise, then, that quite similar population policies were 
adopted by governments that agreed to view themselves as part of the developing 
world. These policies converged on the global model of population control being 
promoted by diffusion professionals (Crane 1993; Kirby and Kirby 1996). The 


preambles to the policies, for example, typically echoed one another: 


Ghana: Unless birth rates can be brought down to parallel falling death rates, 
Ghana's population will climb at a rate dangerous to continuing prosperity, and 
the children of the next few generations will be born into a world where their very 
numbers may condemn them to life-long poverty. (Ghana 1969:3) 


El Salvador: The high density and high rate of increase of the population, as com- 
pared with the limitation of natural and financial resources|,] create the necessity for 
taking action with regard to both aspects of the problem: from the economic point 
of view, we must seek an increase in productivity of those elements which participate 
in the economic process and from the point of view of population we must seek si- 
multaneously to reduce the rate of increase, to rationalize its geographic distribution 


and to give our people a greater capacity in the struggle for life. (El Salvador 1974:7) 


Philippines: Whereas, the United Nations General Assembly has declared and des- 
ignated 1974 as World Population Year; Whereas, such a declaration is envisaged as 
an important milestone in a continuing global effort to generate awareness, under- 
standing and concern about the problems of a rapidly growing population and the 
need to apply and direct a concerted action to achieve a rational balance between 
population and resources and improve the quality and dignity of human existence; 
... all departments, agencies and instrumentalities of government [shall] assist the 
National Committee for World Population Year in carrying out the programme 


and objectives of the observance of World Population year in 1974. (Philippines 
1974:13) 


Diffusion professionals became increasingly sophisticated and self-reflexive 
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about their promotional activities. They developed score cards to publicize the 
spread of population policies and to put laggards on notice, beginning with the 
United Nations Inquiry on national population policies in 1963 (United Nations 
1988). This information was formalized into a data bank at the United Nation's 
Population Division in the 1970s (Singh 1979), and then published in book form 
every other year beginning in 1987 (United Nations 1987). In 1988, the United 
Nations convened an Expert Group Meeting on International Transmission of 
Population Policy Experience, which evaluated the success of diffusion and offered 
recommendations for greater effectiveness (United Nations 1990). 

Within a few years of that meeting, however, the population policy environ- 
ment changed radically. In the early 1990s, the international campaign for popu- 
lation policy largely shifted its emphasis from neo-Malthusian considerations 
of population and economic development to feminist-derived considerations of 
women’s reproductive rights (Barrett 1995; Eager 2004). The term “population 
control” itself dropped out of usage among diffusion professionals. This shift in 
ideology, coinciding with the end of the Cold War, marked a new era that we 
leave for separate analysis. 


Diffusion Data 


Which countries adopted policies to reduce fertility? Figure 2 maps the 60 coun- 
tries that adopted such policies across the globe in the years 1945-1990, as col- 
lected from the United Nations Global Population Policy Database and other 
sources. The map illustrates the finding that population-control policies are all but 
absent among the innovators of demographic transition theory, notwithstanding 
the efforts of the Zero Population Growth movement and other organizations 
that defined population problems in terms of consumption rather than in the 
dominant terms of fertility rates, life expectancy and per capita wealth. By 1990, 
more than half of the countries categorized as low or lower-middle income by the 
World Bank in 1990 had adopted such policies, as compared with six of 41 upper- 
middle or high-income countries (World Bank 1992). ‘The map also reveals that 
there is little regional clustering of policy adoptions by period. Early adopters do 
not appear to be concentrated in any one region, nor do later adopters. 

Our objective is to explain this pattern of policy adoption. Accordingly, we 
model the adoption rate of national population-control policies using event- 
history analysis, a dynamic method that takes into account both the timing and 
sequence of events. Our dependent variable is the rate at which countries adopt 
their first national policy to reduce fertility. Based on an analysis of the base 
hazard rates over time, we employ a Cox proportional hazards model, presenting 
exponentiated coefficients.’ The analysis contains information on 163 countries. 
These countries may have several missing spells due to unavailable variables, but 
Some information about each remains in the model. Countries enter the risk 
set (units at risk of adopting a population-control policy) in 1950 or the year of 
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independence, whichever comes later. They leave the dataset once they experience 
an event (the adoption of a population-control policy). The data are censored to 
the right, stopping in 1990. The data are censored to the left in one case, where a 
country adopted a policy before independence (Fiji). In the multivariate analyses, 
12 countries are dropped for lack of consistent data. Figure 3 depicts the cumu- 
lative count of population-control policy adoption worldwide. All models are 
estimated in STATA 10.0. 

We examine the effects of three sets of independent variables on the rate of 
population policy adoption. All variables are measured annually. The first set of 
independent variables represents the main national-level demographic variables 
that diffusion professionals focused on during the Cold War. Policy makers and 
scholars alike expect that these factors will have a strong positive effect on the rate 
of policy adoption: 


Total fertility rate was the most widely cited indicator in the literature on popula- 
tion growth during the period under study. This variable, drawn from the United 
Nations (2001) dataset on World Population Prospects, gives one data-point for 
every five years in each country. We also analyzed annual population growth, but 
its effects were not statistically significant and it covaried with total fertility rate. 


Population density, another factor frequently cited in the population-growth literature, 
is population (in thousands) per square kilometers of territory, drawn primarily from 
the Penn World Table, Version 5.6 (described in Summers and Heston 1991) and the 
World Development Indicators datasets. 


A second set of control variables represents national-level economic, political and 
social variables that have been proposed by various scholars as potentially affecting 
the adoption of population policies: 


Economic development, using the standard indicator of logged real GDP per capita 
(in constant 1985 dollars), is drawn from the Penn World Table, Version 5.6. Later 
versions of the Penn World Table, as well as more recent datasets such as the World 
Bank’s World Development Indicators, offer no data for the 1950s and are therefore 
less valuable for present purposes. We maintain that low-income countries are more 


apt to adopt population-control policies. 


Religious tradition is one of the key concepts associated with domestic sources of 
opposition to population-control policies (Connelly 2008; Field 1983; Wasserman 
1983). The percentage of the population that is nominally Catholic or Muslim, 
circa 1970 (Center for the Study of Global Christianity 2005; see the analysis 
of this source in Hsu et al. 2008), examines the impact of the Vatican's opposi- 
tion to birth control (Keely 1994) and the controversies among Islamic scholars 
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Figure 3. Cumulative Number of Countries Adopting First National Population- 
Control Policy, 1950-1990 
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on the topic (Jones and Karim 2005; Winckler 2005). Thirty-three percent of 
the Catholic-majority countries and 36 percent of Muslim-majority countries 
adopted population-control policies by 1990, compared with 38 and 37 percent 
of Catholic-minority and Muslim-minority countries, respectively. Despite this 
small difference we maintain that Catholic countries may be less likely to adopt 
population-control policies. 


Francophone heritage, a dummy variable, explores the possible effects of French 
or Belgian colonial control (Boute 1989), given these countries’ longstanding 
opposition to population control (André and Pereira-Roque 1974; Armengaud 
1975). Thirty-nine percent of Francophone countries adopted population-control 
policies by 1990, compared with 36 percent of other countries. 


Communism, a time-varying dummy variable for state socialist regimes, examines the 
effect of membership in, or alliance with, the Communist bloc countries, as indicated 
by attendance at major international events such as the celebration of the 50th an- 
niversary of the October Revolution in Moscow in 1967 (Conference of Communist 
and Workers’ Parties 1964; Spaulding 1983, 1998). Sixteen percent of state-socialist 
countries adopted population-control policies by 1990, compared with 43 percent 
of other countries. In keeping with communist objections to Western demographic 
reasoning, we do not expect these countries to embrace population-control policies. 
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A third set of independent variables represents international diffusion mechanisms 
proposed for population policies (Lee and Walt 1995; Lee et al. 1995; Warwick 1982): 


World policy adoption, the annual number of countries adopting population poli- 
cies in the previous year, examines possible bandwagon effects. We also examined 
other global variables that mark off particular periods, such as the years after the 
United Nations Declaration on Population, which was ratified in 1966, or the 
years of the United Nations Plan of Action, which called in 1974 for countries to 
examine their demographic situation and adopt suitable policy. These variables 
were significant at the bivariate level but covaried strongly with other variables. 
We chose not to include a cumulative count as it was too highly correlated with 
our regional adoption variable. 


Regional policy adoption, the cumulative number of countries within each country’s 
region that adopted population policies up through the previous year, is included 
because recent work has noted regional effects for the diffusion of other govern- 
mental regimes (Brooks 2005; Wejnert 2005). The regions are defined for pres- 
ent purposes as East Asia, Europe and its settler colonies in North America and 
Australasia, Latin America, the Middle East, South Asia and Sub-Saharan Africa. 


International conferences, the cumulative number of years since 1952 that each 
independent country sent participants to international population conferences, 
is derived from published proceedings of the 17 most important international 
conferences on population during this period (Barrett PO) 


Population Council fellowships, the log of the cumulative number of fellowships 
in demography that the Population Council sponsored annually for each country 
outside of Europe, is coded from the Council’s annual reports. The Population 
Council, one of the most prominent bases for diffusion professionals in this field, 
began funding fellowships in demography in the 1950s in the hope of creating 
a critical mass of experts in countries around the world; by the early 1990s, it 
had subsidized scholars from 160 countries to study in prominent demography 
programs in the United States (Menken, Blanc and Lloyd 2002). The variable is 
logged to account for skew. 


USAID population funding, the log of the cumulative amount (in thousands of cur- 
rent U.S. dollars) received by each country outside of Europe for population projects 
from the United States Agency for International Development, is drawn from AID’s 
annual reports to the United States Congress. AID was not the only governmental 
agency sponsoring work in the population area, but complete data for other agencies 
are not available, and AID was arguably the most important single source (Warwick 
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1982:45; see also Barrett and Tsui 1999). Assistance began in the late 1960s and 
grew to include 114 countries by the end of the Cold War, with the largest annual 
allocation being $44 million to Bangladesh in 1990. The variable is logged to ac- 
count for skew. AID funding may also be a vehicle of coercive pressure for diffusion 
because adoption of a population-control policy resulted in a country receiving an 
average of $1.7 million more in AID population assistance (Barrett and Tsui 1999). 


Findings 


The results of the event-history analysis are summarized in Table 1. First, Model 1 
shows the demographic basis for population policy adoption. Both total fertility 
rate and population density are positively and significantly correlated with the 
likelihood of policy adoption. Model 2 explores the effects of national contex- 
tual variables. When these variables are added, the coefficient for total fertility 
rate loses statistical significance. Though not consistently statistically significant 
across all of our models, national income, Communism and Francophone heritage 

. sometimes decrease the rate of policy adoption (that is, their odds-ratios are less 
than 1). The coefficients for Catholicism and Islam are not statistically significant : 
in any of our models. 

Models 3-5 incorporate various sets of international diffusion mechanisms. 
These are displayed in three permutations because there is significant covariance 
among several of the international factors. All of these models display better fit 
than the national-level models, offering prima facie evidence that international 
diffusion is at work. All of the transnational factors have statistically significant 
effects on population policy adoption: world policy adoption, regional policy 
adoption, attendance at international conferences, Population Council fellow- 
ships and population funding from AID. 

In Model 3, for example, we find in particular that both the annual count of 
policy adoptions worldwide and the cumulative regional count positively and 
significantly affect the likelihood of policy adoption. With each additional policy 
adoption worldwide, the likelihood of policy adoption in the following year in- 
creases by more than 30 percent. Regional adoptions increase the likelihood of 
additional policy adoptions somewhat less. With each additional regional adop- 
tion, the likelihood increases by approximately 14 percent. 

World policy adoption and regional policy adoption do not, by themselves, 
demonstrate export-only diffusion. In both cases, one country's adoption might be 
due to the direct observation or influence of peer countries’ actions. In particular, 
the regional policy adoption variable might seem to run counter to the export- 
only model because it implies that late adopters were affected by their neighbors’ 
pattern of policy adoption. However, the significance of international conference 
attendance in Model 4 suggests that export-only diffusion operated alongside 
neighbor-to-neighbor diffusion. Export-only diffusion is further demonstrated by 
the significance of Population Council fellowships and AID funding in Model 5; 
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each of these mechanisms involve a direct linkage between potential adopters and 
diffusion professionals in core countries, especially the United States, that did not 
themselves adopt population policies. In Model 4, which reports a strong positive 
effect of international conference attendance, increasing attendance by a single 
conference signals a better than 25 percent increase in the likelihood of policy 
adoption. In Model 5, which reports significant effects of aid and fellowships, 
every | percent increase in the amount of aid increases the likelihood of policy 
adoption by approximately 13 percent, and every 1 percent increase in the number 
of population fellows increases the likelihood of policy adoption by 46 percent. 


Table 1: Adoption of Population Control Policies, 1950-1990 








independent Variables Model1 Model2 Model3  Model4 Model5 

Demographic Conditions 

Total fertility rate 1.49*** 1.30 1,25 1.36 eS 
(.15) (.19) (.19) (.22) (.21) 

Population density 13.48" 12.86" 13.74" 18.17* 13.92* 


(12.35) (12.88) (14.14) (20.62) (14.89) 
Domestic Context 


Log GDP per capita 62" .64 .61* 16 
(.14) (.15) (.15) (.20) 
Percent Catholic 54 .60 84 .66 
(.29) (.33) (.48) (.36) 
Percent Muslim 1.02 1.04 1.49 1.10 
(.61) (.62) (.91) (.68) 
Communist 42 .34* (Sor, ‘53 
(.19) (.16) (.16) (.26) 
Francophone 44 fog 00. ra 


International Diffusion 


World policy adoptions hoz, Woe 1.40*** 
(.09) (.09) (.10) 
Within region adoptions 12s 1.14" 
(.04) (.05) 
International conferences 13) 
Log USAID population funding ( : 7 
Log population council fellows (at) 
Log likelihood -200.44 -194.37  -182.10  -179.26 178.42 
Chi squared AOO7 te OU 2B mos. bas yn O0b2"* 1) 62.18" 
Number of observations 3,205 3,176 3,176 3,176 3,146 
Number of adoptions 47 47 47 47 47 
Note: Cox proportional hazard models, exponentiated coefficients with standard errors in 
parentheses. 


*D < 05 - 0 < 01 ap < 001 
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Conclusion 


Export-only diffusion is the separation of innovation from adoption: those who 
invent and promote a phenomenon do not themselves adopt it. This model has 
existed in the diffusion literature for decades, but is not frequently distinguished 
from the usual conceptualization of diffusion, according to which late adopters 
learn from influential early adopters. We propose that both processes exist in the 
world; our challenge in this study has been to demonstrate the importance of the 
neglected, export-only form. 

Export-only diffusion may take place at any scale, just like other forms of 
diffusion. At the individual scale, for example, product development and sales 
personnel may invent and promote products that they cannot afford to buy. At 
the institutional level, management consulting firms may recommend business 
practices that they are not themselves in a position to pursue. At the national level, 
states may encourage policies abroad that they do not consider appropriate for 
themselves. The case study in this article, the diffusion of population-control poli- 
cies during the Cold War, ranges across all of these scales. “Diffusion professionals” 
promoted these policies to countries they considered to be at risk of population. 
problems; these professions, and the private foundations that sponsored them, 
organized international associations to rationalize policy promotion; and states 
and international governmental organizations joined the campaign. 

Unlike most diffusion models, where network linkages are based on likeness, 
population policies diffused along channels of dissimilarity. In the categories of 
dissimilarity proposed in this research, potential adopters differed from policy 
promoters primarily in location. The cadre of diffusion professionals in Western 
countries did not consider their home societies to be appropriate contexts for 
population-control policies because they defined population problems in terms 
that excluded Western societies. Alternative definitions of population prob- 
lems—such as resource consumption—might have led to a different pattern of 
diffusion. Countries that objected to the diffusion professionals’ definitions and 
solutions—such as communist regimes and Francophone societies—were less likely 
to adopt population-control policies. The countries that did adopt such policies 
were different in key respects from the countries where the policies were devised 
and promoted. They were more likely to be poorer and more densely populated 
countries with relatively high fertility rates, as documented in Table 1, as compared 
with the countries exporting population policies. Forty-three of the 60 adopting 
countries had been decolonized since the middle of the 20" century, while the 
exporting countries were colonizers. The one characteristic that adopters and pro- 
moters shared was their active participation in international population activities 
such as conferences, fellowships and aid programs, all of which are significantly 
associated with policy adoption. The countries that designed population policies 
explicitly used international activities such as these to promote them abroad, with 
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heightened reflexivity exemplified by the 1988 United Nations Expert Group 
Meeting on International Transmission of Population Policy Experience. 

These international activities raise a question of causality: Did participation in 
such programs cause countries to adopt population policies, or did the intent to 
adopt a policy cause countries to participate in these programs? The former would be 
the most convenient argument to make for export-only diffusion, but the latter also 
works. Even if countries decided to participate because of a prior commitment to 
population control, owing to factors that are not included in this article's empirical 
models, the decision to participate suggests that they viewed export-type activities 
as the proper preparation for policy adoption. Mechanisms of diffusion such as 
conferences, fellowships and funding may be viewed as training for policy adoption 
and reflective of an underlying, unobserved initial desire for policy adoption. 

The export-only pattern is demonstrated in the statistical significance of mecha- 
nisms that linked adopters directly to diffusion professionals in non-adopting 
core countries: attendance at international conferences, Population Council fel- 
lowships and participation in USAID population programs. Moreover, there is 
qualitative evidence that even peer-to-peer diffusion, suggested by the significance 
of the regional adoption variable, was mediated by diffusion professionals in core 
countries. The example of Ghana illustrates this process. Ghana was the first coun- 
try in Sub-Saharan Africa to sign the World Leaders’ Declaration on Population 
in 1967. The country undertook a National Demographic Sample Survey in 1968 
and instituted a population-control policy in 1969 (Ghana 1969; United Nations 
1979). Ghana’s population policy noted at length the international factors that 
justified action at the national level (Ghana 1969). This information was not the 
product of interactions between Ghana and other peers, but was drawn from 
research produced and disseminated at the core (possibly drawn from Bogue 
1968). Ghana then hosted the second African Regional Population Conference in 
1971, which was proposed and organized by diffusion professionals and funded 
by core-country governments and foundations seeking to influence public policy 
in non-core countries (Cantrelle et al. 1974). The conference included a dramatic 
impromptu session to air grievances by a group of leftist participants who objected 
to the “imported and ill-adapted concepts” of Western demography and “the bom- 
bardment of data to which we are helplessly submitted.” (Cantrelle et al. 1974). 
The crowded session concluded with resolutions acknowledging the complexity of 
the African context and the need for economic development measures in addition 
to family planning—and re-confirming the United Nations statement on the need 
for population control in poor countries (Cantrelle et al. 1974). In the following 
four years, four African countries adopted population policies. 

As this example suggests, export-only diffusion often tries to hide or transcend 
its export origins. That is, diffusion professionals may try to leverage their efforts 
by increasing the influence of early adopters on potential later adopters. To the 
extent that this influence increases and the proportion of adopters at the heart of 
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the diffusion network increases, the diffusion pattern becomes less export-only. 
This research has not measured influence directly, and therefore cannot map the 
network of diffusion explicitly, but future studies might be able to demonstrate 
the prevalence and trajectory of export-only diffusion. 


Note 


1. The baseline hazards for our full models are unevenly curvilinear over time, suggesting 
that the Cox specification, which makes no assumptions about the hazard rate, is the 
most appropriate form. Only one of our independent variables (percent Catholic) 
violates the proportional hazards assumption of the Cox model, and the results 
are similar with this variable removed. The findings are robust with exponential, 
Gompertz and Weibull distributions as well. 
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Appendix. Summary Statistics 


Variable Observations _ Mean SD Min Max 
Population policy adoption 4487 01 At 0 { 
Total fertility rate 4021 5.00 2.05 1.18 8.50 
Population density 3455 07 12 .001 2.82 
Log GDP per capita 3442 7.80 1.06 5.40 10.43 
Percent Catholic 4487 34 38 0 .99 
Percent Muslim 4487 et 230 0 1.00 
Communist 4487 22 42 0 1 
Francophone 4487 a) 34 0 1 
World policy adoptions 4487 23.17 19.05 0 59 
Within region adoptions 4487 | .66 0 5 
International conferences 4487 1.96 2.50 0 15 
Log USAID population funding 4487 fe L295 0 10.09 
Log population council fellows 4487 ie 65 0 4.26 
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Social Change and Anomie: A Cross-National Study 


Ruohui Zhao, University of Macau 
Liqun Cao, University of Ontario Institute of Technology 


We apply Durkheim’s social transitional theory to explain the variation of anomie 
in 30 nations in the world. Combining data from two sources—the 1995 World Val- 
ues Survey and the United Nations University’s World Income Inequality Database or 
WIID—we test the hypothesis that rapid sociopolitical change at the structural level 
disrupts social integration and regulation, and increases the level of anomie among 
individuals in a society. Using the multilevel approach that permits the decomposi- 
tion of variance within and between nations, the results of the analyses confirm that 
rapid sociopolitical change at the macro level, such as the political transition from 
totalitarianism to democracy, produces a higher level of anomie among individuals in 
a society. In addition, we find a cross-level effect of confidence in authority on anomie. 
Findings at the individual level are largely consistent with Merton's theory of anomie 
and with the extant literature that anomie is inversely related to an individual's social 
and economic position in a society. 





One of the major changes that happened in the last quarter of the 20th century 
was the third wave of democratization (Huntington 1991). The high point of 
this wave occurred when the Soviet Empire collapsed in the early 1990s. Eastern 
European nations broke away from former Soviet Union’s dominance and/or 
control and became democracies. By the mid-1990s, the number of democratic 
nations had surpassed the number of non-democratic societies in the world for 
the first time in human history (LaFree 2007). What effects would such political 
change have on citizens of these societies? The current article attempts to evaluate 
one of the consequences of democratic transition: the spread of anomie. 
Anomie is one of the central concepts of sociology. Since Emile Durkheim, 
anomie theory has experienced a series of developmental stages from its dormant 
period (1940s) to a golden age (1950s and 1960s), then a decline (1970s and 
1980s) and a revival (from late 1980s to present) (Cullen and Agnew 2003). 
Numerous studies, both qualitative and quantitative, have been conducted to 
examine the theory as a whole. Normally, anomie is treated as an independent 
variable in these studies. We argue, however, that anomie itself needs explana- 
tion. There is a reason why Merton (1938) entitled his article “Social Structure 
and Anomie,” not “Social Structure and Crime.” The current study, therefore, 
examines anomie as a dependent variable and advances a linkage between rapid 
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sociopolitical change and anomie. We hypothesize that rapid sociopolitical 
change, such as democratic transition, is the structural factor that exerts a pow- 
erful influence on anomie of individuals. We test this hypothesis with data from 
both micro and macro levels. 


Theoretical Background 


Durkheim (1984[1893],1966[1897]) advances his theory of social transition 
where he argues that social order is maintained through social integration and 
regulations in a social equilibrium. All nations develop normative behavior pat- 
terns and belief systems in the evolutionary change process. During the transi- 
tional period, such as the transition from a rural society to an urban society, the 
diffusion of new norms and values disrupts the equilibrium of traditional societies 
and breaks down sacred-religious institutions, traditional beliefs and ascribed 
status relationships. The new organic social relationships enable individuals to 
challenge old cultural values and social rules, resulting in the rapid increase of 

- anomie. Durkheim is specifically concerned with the social transition from a rural 
society with mechanical solidarity to an urban society with organic solidarity and . 
the spread of anomie during this unique period. 

Durkheim did not study the democratic transition, but his theory can be read- 
ily applied to explain anomie during this situation. Desirable as it may be, the 
process of democratization is never smooth (Fukuyama 1992; Huntington 1991) 
because it polarizes the polity, brings out the historical wounds of ethnic conflicts, 
and provokes civil strife. In addition, the economic wellbeing of a nation deterio- 
rates, to different degrees, during the transition (Duch 1995; King and Sznajder 
2006). The transition from a communist authoritarian system to a democracy 
creates a temporary disequilibrium when new values and norms come into contact 
with and/or disrupt older social patterns, weakening formal as well as informal 
social control. Transitional societies, therefore, are associated with growing cultural 
heterogeneity, or the state of anomie. 

Under the communist authoritarian regime, the social order was maintained 
through draconian measures, and the legal system was built on fear, not on con- 
sensus. With its demise, people have become the masters of their fate, and have, 
for the first time, really enjoyed freedom of choice. Ideally, a liberal democracy 
is defined by “the extent to which a political system allows political liberties and 
democratic rule.” (Bollen 1993:1208) The enforcement of contracts, regulations, 
protection of freedoms, and sole claim on the legitimate use of violence allow 
the state to protect its citizens and guard their freedoms. Via these functions, 
the state is able to create the space for each citizen to live a “good” life. In real- 
ity, the recrudescence of ethnic intolerance and political conflicts has resurfaced 
in Eastern European nations, and the new social order based on the principles 
of civil society has not been built up to replace the old mechanisms of social 
control. Asa result, confusion regarding particular or specific norms is generated 
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from the abundance of new rules and new ways of doing things. This normless- 
ness or confusion of rules is called anomie by Durkheim. 

Robert Merton in 1938 borrowed the concept of anomie in his theory of social 
order. He intends to explain why some societies exhibit higher levels of anomie 
than do others, and he poses his explanation of anomie in terms of social structure. 
He is more specific about the nature of anomie as a product of utilitarian behavior 
(Cullen 1984). He proposes that anomie is a product of the imbalance between the 
two fundamental components of society—cultural structure and social structure. 
In a democratic society like the United States, the cultural barrier for upward 
mobility is removed, but the structural barrier for upward mobility for people in 
poverty remains. People at the bottom of the class structure, therefore, are more 
likely to suffer from strain and anomie because the new culture of unlimited eco- 
nomic success exerts pressure to those to take the most efficient means to achieve 
monetary success. Three themes in his theory are distinctive of Merton’s own: 
democracy is a precondition for the prevalence of anomie (Cao 2004); anomie is 
not natural, but it is socially induced (Cullen 1984); and inequality is a source of 
anomie. Poverty per se is not a cause of strain and deviance. Merton (1938:681) 
states specifically that “poverty is less highly correlated with crime in southeastern 
Europé than in the United States” because the probabilities of vertical mobility 
in those undemocratic societies “would seem to be fewer than in this country.” 

Anomie, therefore, is a byproduct of rapid social change (Durkheim 1897) and 
the adaptive response of an individual in an open stratification system (Merton 
1938). Previous tests of the theory, in general, regard anomie as an independent 
variable predicting such other deviant behavior as suicide and, later, crime and 
deviance (Bjarnason, Thorlindsson, Sigfusdottir and Welch 2005; Clinard 1964; 
Durkheim 1897; Merton 1938; Thorlindsson and Bernburg 2004). We hypoth- 
esize that rapid sociopolitical change, such as the transition from authoritarian 
regimes to democracy and social inequality are both important sources of anomie. 
We capitalize here on a multilevel study of 38,845 respondents from 30 nations in 
order to highlight sociodemographic and national sources of variation in anomie. 

Despite anomie’s popularity and its importance in sociology, its meaning is 
very elusive and varies depending on its different developmental stages and on 
elaborations by different theorists. Durkheim (1893, 1897) uses the concept, 
but does not define it. Later theorists (Clinard 1964; Coser 1971; Hilbert 1989) 
draw basic ideas about anomie from his work. They generally agree that anomie 
for Durkheim refers to “normlessness.” The concept is broad and includes pow- 
erlessness, alienation and confusion regarding rules (Fischer 1973; Martin 2000; 
McClosky and Shaar 1965; Thorlindsson and Bernburg 2004). 

Although Merton writes often about anomie, he never provides an explicit 
definition of this concept either (Messner 1988). In addition, his usage of this 
concept is inconsistent (Featherstone and Deflem 2003). For example, Levine 
(1985) finds that Merton employs at least 10 definitions of anomie from 1938 
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to 1964 and fails to clarify the ambiguities of anomie in his own work. One of 
the clearer definitions of anomie from Merton appears in his 1964 article, in 
which, he (1964:227) states, “In a word, the degree of anomie in a social system 
is indicated by the extent to which there is a lack of consensus on norms judged 
to be legitimate, with its attendant uncertainty and insecurity in social relations.” 

A detailed contrast between Durkheim’s and Merton's conceptualizations 
of anomie is summarized in Table 1. Although there are differences between 
Durkheim and Merton in their conceptualization of anomie, there also seems 
to be enough convergence between the two scholars in its core meaning. That is, 
anomie refers to normlessness. 


Empirical Tests of Anomie Theory 


Through decades of evolution, the concept of anomie has been expanded, span- 
ning both the macro and micro levels. At the macro-societal level, anomie is 
described as a breakdown of social norms regulating individual behavior and 

* social interaction. It is a social structural phenomenon of society in which norms 
are rapidly changing or where there are conflicting norms of what is right, good,- 
proper and so on. Thus anomie is a property, not of the individual, but of the social 
system (Bernard 1987). Although most scholars agree that anomie is a systemic 
condition and phenomenon, they have not been able to identify a measure to cap- 
ture the concept at the macro-societal level. After so many years, no agreed-upon 
measure of anomie at the macro-level has emerged while there are many attempts 
to create a measure of anomie at the individual level. 

Most previous scales of anomie capture a broad sociological meaning of anomie 
rather than a narrow meaning in the Mertonian sense (Cao 2004). For example, 
Thorlindsson and Bernburg (2004) advance two measures of anomie for juveniles. 
One emphasizes the rule-breaking dimension of anomie among adolescents, and 
the other emphasizes the meaninglessness dimension of anomie. Both measures 
represent more of Durkheim's conceptual scheme than Merton's. Menard’s (1995) 
operationalization of anomie is also specifically designed for juveniles, but his 
measure represents more of Merton’s conceptual scheme than Durkheim’s because 
it emphasizes the goals-means disjuncture. 

For the general population, Sampson and Bartusch (1998) proposed an index of 
anomie (legal cynicism). This measure focuses on the respondents’ agreement with 
five items that tap issues related to laws and their violation. Although variation in 
respondents’ ratification of acting “outside” of law and social norms are tapped, 
Merton's original sense that anomie is the discrepancy between the goal and the 
means and that anomie is the product of utilitarian behavior is not obvious. 

Cao (2004) offered another index of anomie based on data from the World 
Value Surveys. This is a six-item index of acceptance of the legitimacy of instru- 
mental criminal/deviant scenarios. The six scenarios include profit-motivated 
behavior regarding claiming government benefits, avoiding a fare on public trans- 
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port, cheating on tax, buying stolen goods, accepting bribe and failing to report 
property damage. This index emphasizes the normlessness dimension of anomie 
and better represents Merton’s original conceptualization in three ways. First, the 
measure reflects Merton’s point that a lack of consensus on norms judged to be 
legitimate is anomie, and it captures Merton’s argument that anomie is the extent 
of delegitimation regarding “particular” or “specific” norms resulting from the 
rules and legal codes of a society. Second, the index covers a range of instrumental 

"crimes ranging from white-collar to street crimes. Third, the index is consistent 
with Merton’s (1938:675) belief that the rules are known to the rule violators, but 
“the emotional supports of these rules are largely vitiated by cultural exaggeration 
of the success-goal.” It captures the extent of the individual’s readiness to disregard 
the rules or to use the technically most efficient and available means for monetary 
success. One limitation of this measure is that it focuses on hypothetical rule 
breaking or cheating. 

The current study intends to explore this definition of anomie among individu- 
als across the world. We argue that anomie itself needs explanation. This idea is not 

entirely new. Menard (1995) used his measure of anomie as an intervening variable 
in his study. Sampson and Bartusch (1998) conducted the best study of anomie 
(legal cynicism) using Chicago neighborhoods data with multilevel analysis. Cao 
(2004) treated anomie as the dependent variable in his cross-national study. Other 
scholars (Baumer 2007; Martin 2000; Thorlindsson and Bernburg 2004) argue 
that full understanding of anomie must come from the multiple levels. In this 
study, we intend to extend Sampson and Bartusch’s (1998) effort and examine 
anomie and its determinants with cross-national data. We are particularly inter- 
ested in the social structural effect, such as rapid social changes, on anomie. Stack, 
Cao and Adamczyk (2007) argue that international research has the advantage of 
testing the generalizability of theories developed in one society. 

Previous cross-national studies (Form 1975; Cao 2004) focus primarily on indi- 
vidual and demographic attributes rather than situating anomie in a larger structural 
context. Form’s study examines the workers’ anomie in four nations with cross-tab- 
ular tables. Cao's (2004) regression analysis finds that older, female, the employed, 
and the married have less anomie than younger, male, the unemployed and the 
unmarried. With the exception of gender, Cao proposes that these characteristics 
represent the social and economic position of an individual in a society. This inter- 
pretation is consistent with Agnew’s (1997) finding that individuals with a lower 
level of social integration are more likely to experience feelings of normlessness and 
meaninglessness. For gender differentiation, studies have found that females have 
stronger bonds to their families and social institutions, and these attachments lead 
to lower rates of female offending (Jensen and Eve 1976: Paternoster 1989), and by 
extension, lower levels of anomie. Other studies have found similar effects of age, 


gender, employment and marriage (Sampson and Bartusch 1998). Building on this 
literature, the current study tests these effects again. 
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In addition, we include a few new variables in our model. They are family sav- 
ing, subjective social class and confidence in authority. Family saving and social 
class are two different aspects of socio-economic status of individuals in a society. 
Confidence in authority captures the public’s general attitudes toward the legiti- 
macy as well as the effectiveness of a regime (Cao and Hou 2001). From different 
dimensions, these variables all tap various aspects of socio-economic status of an 
individual in a society. It is expected that those with social and political powers 
are less likely to have higher levels of anomie. 

In the study of anomie, Sampson and Bartusch (1998) and Thorlindsson and 
Bernburg (2004) examine both individual and contextual effects on anomie with- 
in a single society. The unit of analysis for Sampson and Bartusch is neighborhood 
in Chicago and for Thorlindsson and Bernburg is school district in Iceland. Both 
have found significant effects of the structural variables on anomie. Thorlindsson 
and Bernburg (2004) find that community political integration influences ano- 
mie of juveniles while Sampson and Bartusch (1998) show structural effects of 
concentrated disadvantage on legal cynicism —their measure of anomie. Sampson 
and Bartusch’s (1998) research results support Merton's argument that anomie is 
related to social stratification in the United States. Their model, however, has not 
been extended to include cross-national data. 


Rapid Sociopolitical Change 


In this article, rapid sociopolitical change is captured by the variable called nations 
of Eastern Europe under democratic transition. We understand that there are 
other nations that also changed into democratic regimes during the third wave of 
democratization (Huntington 1991), such as the Philippines, Brazil and Chile, 
but these nations are different from the nations in Eastern Europe. We are particu- 
larly interested in the unique experiences of democratization of Eastern Europe 
for several reasons. First, Eastern Europe is a product of the Cold War (Fukuyama 
1992). Many nations did not choose to be communist states, but were forced to 
be Soviet satellite states or Soviet states within the Soviet Union. Given that the 
West had won the economic and ideological war over the forces of communism, 
it is interesting to see the effect of change in these societies. Second, these societies 
had been under the communist regimes for almost half a century (Bunce 1995). 
In contrast, many societies in Latin America, such as Brazil and Chile, have a 
history of alternation between authoritarian and democratic rules. Third, these 
communist states developed a far more penetrative party system than autocratic 
regimes based on individual charisma in Asia and Latin America. Fourth, Eastern 
European societies are industrialized, and their general educational level is signif- 
cantly higher than those in Lain American societies. Fifth, there is an absence of 
military intervention in the Eastern European history. Sixth, the course of history 
changed in these societies between 1989 and 1991 ina short period of time, and 
social stratification systems all increased to a different degree (see Appendix). 
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These conditions make Eastern Europe preferable to study as a group or block 
than nations elsewhere. We follow the practice of the comparative literature and 
treat these nations as a group (Bozoki 2003; Duch 1995; Hanley 2003; King and 
Sznajder 2006), fully aware of many economic, historical, linguistic, political and 
social differences among them (see Walder 2003 for a detailed discussion of these 
differences and their consequences). 

In addition to the rapid sociopolitical change, we measure inequality. To dif- 
ferent degrees, all nations under transition have experienced enlarged inequality 
(see Appendix). As mentioned, Merton (1938) and many others (Blau and Blau 
1982; Messner 1989; Sampson and Bartusch 1998) argue that greater economic 
inequalities foster conflict and spawn anomie. 


The Research Hypotheses 


Based on the theoretical deduction of anomie theory and the previous studies, the 
current research attempts to test three sets of hypotheses. These research hypoth- 
- eses include variables at both individual and national levels. 

First, at the individual level, we hypothesize that anomie is inversely related to 
the social and economic position of the individual in a society. Respondents who 
are older, married, employed, better educated, from the middle or upper class, 
and have more family savings and more confidence in authority will experience 
a lower level of anomie than those who are younger, single, unemployed, less 
well-educated, from the lower class, have less family savings and less confidence 
in authority. Females have a lower level of anomie than males. 

Second, at the national level, individuals who live in nations under democratic 
transition will experience a higher level of anomie than those who live in nations 
with relatively stable social conditions. 

Third, at the national level, individuals who live in nations with greater eco- 
nomic inequality will have a higher level of anomie than those living in societies 
with smaller levels of economic inequality. 


Methods 


The Sample 


The data for the current analyses are from two sources, Data for all individual-level 
variables are from the 1995 World Values Survey. Data for the Gini index for each 
nation in 1995 or closest to 1995 are taken from the United Nations University’s 
World Income Inequality Database or WILD (United Nations University 2005). 
The individual-level data source, World Values Survey, has been utilized 
in numerous publications in more than 20 languages (World Values Survey 
Organization 2006). The survey is designed to facilitate cross-national and 
cross-cultural comparisons of a wide range of basic values and beliefs about the 
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public, political and social life across the globe. Each national questionnaire is 
a translation from a standard questionnaire, which is in English. The question- 
naires are administered to about 1,000 to 3,500 adult interviewees face-to-face. 
The samples are selected as close as possible following the random sampling 
methods. The current study used the 1995 dataset, which was designed to give 
special attention to obtaining better coverage of non-Western societies (World 
Values Survey Organization 2006). 

After reviewing the dataset for missing values of the dependent and inde- 
pendent variables, 30 out of 53 nations/regions are selected for further analysis 
(see Table 2). Although the selection is not random, all continents except Africa 
are represented: four nations are from Asia, 16 from Europe, three from North 
America, six from South America, and one from Oceania (see Table 2). The miss- 
ing values for all the independent variables are less than 4 percent with the excep- 
tion of the variable “social class,” which has the missing data up to 13.5 percent 
for Sweden.' The missing data are excluded from the analysis. The final analyses 
contain 38,845 cases from 30 nations, with each country ranging from 417 cases 
(Dominican Republic) to 2,092 cases (Belarus). 


The Dependent Variable 


The dependent variable, anomie, is measured as an index of the individual’s accep- 
tance of the legitimacy of five instrumental crime-related scenarios. Respondents 
in each nation are asked to answer the questions as follows: Please tell me for each 
of the following statements whether you think it can always be justified, never 
be justified, or something in between: a) claiming government benefits which 
you are not entitled to; b) avoiding a fare on public transport; c) cheating on tax 
if you have the chance; d) buying something you knew was stolen; e) someone 
accepting a bribe in the course of their duties. The responses range from 1 (never 
justifiable) to 10 (always justifiable). The index of anomie is calculated as the sum 
of the scores of these five items minus 4 so that it ranges from 1 to 46. A higher 
score indicates a higher level of anomie. The reliability alpha equals to .75. The 
missing data for these items range from 1.9 percent (buy stolen goods) to 4.0 
percent (claim benefits). They are excluded from the analyses. The index’s skewness 
and kurtosis are both less than 2.2. A Q-Q plot does not show a serious skewness 
of the anomie index either. 

This measure is first labeled as anomie by Cao (2004), but its variations are 
called “materially self-interested attitudes” by Halpern (2001), using four items 
(c, d, e, and “lying in your own interest”), and “attachment to the law” by Jensen 
(2002), using the six-item index (“keeping money you have found,” “lying in your 
own interest,” a, b, cand d). Cao’s index has one more item that is not in the cur- 
rent index: failing to report damage you've done accidentally to a parked vehicle. 
For lack of data in many nations, we had to drop this item. 
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The Independent Variables 


The independent variables are grouped into two levels: individual and national 
levels. The eight variables at the individual level are gender, age, marital status, 
family savings, employment status, education, self-defined social class and confi- 
dence in authority. Gender is a dummy variable where female = 0 and male = 1. 
Age is coded into six groups, from the youngest group (1 = 18-24) to the oldest 
group (6 = older than 64). Marital status is coded with values of 1 for respondents 
who are married including living together and widowed, and 0 for otherwise 
(single, separated, and divorced). Family savings in the past year is an ordinal 
variable with four groups: one representing “spent savings and borrowed money” 
and four representing “saved money.” Employment status becomes two dummy 
variables: employment 1, where the employed is coded as 1, and employment 
2, where students and housewives are coded as 1, and others as 0 (the reference 
group). Education is coded into five categories with one representing “no formal 
education” and five representing “university-level education.” Self-defined social 
‘class asks people to rank themselves from 1 = lower class, 2 = working class, 3 = 
lower-middle class, 4 = upper-middle class, to 5 = upper class. Finally, confidence - 
in authority is an index consisting of three items: confidence in the legal system, in 
the police, and in the government. The responses to these items range from 1 (not 
at all) to 4 (a great deal). We add the responses of these three items and divide the 
final results by 3 to return to the original scale. The reliability alpha is .70. These 
variables, to a different extent, tap the current social and economic position of an 
individual in a society. It is expected that those with a higher position in a society 
will have a lower level of anomie. 

At the national level, there are two variables. They are nations of Eastern Europe 
in democratic transition, and the Gini index. Nations in transition is a binary 
variable with 1 representing nations in Eastern Europe and 0 representing all other 
nations in the sample. In the present study, 13 out of 30 nations are from Eastern 
Europe (see Table 2). 

The Gini index is a measure of economic inequality in a nation. It ranges from 
0 to 100 with 0 representing perfect equality and 100 representing extreme or 
perfect inequality. There are no missing values at the national level. The hierarchi- 
cal linear modeling technique was used for the current study.? 


Procedures in Conducting the HLM Analysis 


This study utilizes HLM 6 software to estimate the parameters for anomie. We 
ran four models. First, Model 1 is the base model with only the dependent vari- 
able, anomie (also known as “one-way ANOVA with random effects” model, see 
Raudenbush and Bryk 2002). This model is created to obtain the intra-class cor- 
relation coefficient, which measures the proportion of variance in the dependent 
variable that is counted for by level-2 units. The ICC, as obtained from Model 
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1, is .1038. Therefore, nation characteristics (the level-2 units) account for 10.38 
percent of the variability in anomie among individuals, which suggests that a 
multilevel model incorporating nation and national characteristics is useful. The 
reliability estimate, which is close to 1, indicates that the sample mean is a reliable 
measure of the true nation mean for anomie. 

Next, Model 2 with all variables at the individual level is created. This level-1 
model shows that anomie is the function of individual level explanatory variables. 
In order to make the elaboration of the results easier, all the interval level indepen- 
dent variables are centered? on their grand means‘ while the binary independent 
variables are not centered.° 

Then, in Model 3 individual and national level variables are included simulta- 
neously in a single multivariate model. Combining both the level-1 and level-2 
equations produces a composite model: 


Anomie, =Voot Vor (Nation), + Yoo( Gini). +) io Gender), +) o9( Age), + 
Y49( Marital Status), + (Family Savings), + (Employment 
Status), +y (Education), + ),(Social Class), + (Confidence in 
Authority), +p + 7, 


This composite model shows which part of the model is composed of fixed effects 
(the ys) and which part is composed of random effects (wu and r) (Luke, 2004). 
Overall, the model tests not only whether the individual level variables have an 
effect on the anomie levels in individuals, but also whether the national level 
variables influence individuals’ anomie. 

Finally, we ran additional analyses to explore the cross-level effects of the indi- 
vidual and national level variables on anomie. Model 4 is created for this purpose. 


Results 


The mean level of anomie for each nation in the sample is presented in Table 
2. The results show that a certain level of anomie existed in every one of the 30 
nations, which is in line with Merton’s statement regarding the prevalence of 
anomie in modern societies. The grand mean of the anomie scores is 7.59. Among 
the 30 nations, Croatia has the highest mean of anomie (12.29), followed by 
Philippines (11.80) and Brazil (11.21). Nations which have the lowest anomie 
levels are Uruguay (4.64), India (4.01), and the United States (3.77). Ten out of 
13 nations with an anomie score higher than the average (7.59) are the nations 
under democratic transition. 

The parameter estimates and the R squares for each model are reported in Table 
3. Instead of interpreting the R square as a simple percentage of variance accounted 
for, HLM interprets the R square as the proportional reduction of prediction 
error (Luke 2004). There are four models in Table 3. Compared with Model 1, 
Model 2 diminishes prediction error in anomie by 6.22 percent at the individual 
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Table 2: The Mean Levels of Anomie among 30 Nations in 1995 


____Nation/Region _—Continent__ Anomie Level Number of Cases _ 
1. Armenia* Asia 11.19 2000 
2. Azerbaijan* Asia 9.67 2002 
3. India Asia 4.01 2040 
4. Philippines Asia 11.80 1200 
5. Belarus* Europe 10.23 2092 
6. Bosnia Herzegovina* Europe aot 1200 
7. Bulgaria* Europe 6.08 1072 
8. Croatia” Europe 12.29 1196 
9. Estonia* Europe hee, 1021 

10. Finland Europe 6.06 987 
11. Latvia* Europe 10.71 1200 
12. Lithuania* Europe 8.29 1009 
13. Macedonia* Europe 6.91 995 
14. Moldova* Europe 10.85 984 
15. Norway Europe 4.98 1127 

16. Russia* Europe/Asia 8.46 2040 

17. Slovenia* Europe 8.79 1007 

18. Spain Europe 4.76 1211 

~ 19. Sweden Europe 7.06 1009 

20. Switzerland Europe 6.47 1212 

21. Mexico North America 10.13 1510 

22. Puerto Rico North America 4.99 1164 

23. USA North America ods 1542 

24. Australia Oceania 4.78 2048 

25. Brazil South America 11.21 1149 

26. Chile South America toe 1000 

27. Dominican Republic | South America D200. 417 

28. Peru South America 7.74 1211 

29. Uruguay South America 4.64 1000 

30. Venezuela South America 5.73 1200 


Mean = 7.59 Total = 38,845 
*Nations of Eastern Europe in democratic transition 





level and 10.63 percent at the national level. Model 3, which incorporates two 
national level variables, reduces prediction error in anomie by 6.22 percent at the 
individual level and 37.74 percent at the national level. Model 4, the best fitting 
model incorporating cross-level effect, decreases prediction error by 6.36 percent 
at the individual level and 39.6 percent at the national level. Although the increase 
in the proportional reduction of prediction error at both levels may look small in 
number compared to that in Model 3, the hypothesis testing of the deviances for 
the model fit confirms that Model 4 is significantly better in explaining anomie 
among individuals (y* = 50.836, df = 1, p < .001). 

Specifically, Model 2 in Table 3 suggests that six out of eight individual-level 
variables are significant predictors of anomie. With all the other variables in the 
model being held constant at zero, an individual is expected to have an anomie 
score of 8.176. Males tend to be more anomic than females (y = 1.027, t = 8.098, 
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p <.01); older people are less anomic than younger people (y = -.773, t= -11.12, p 
< .01); and the married have a lower anomie score than the unmarried (y = -1.106, 
t = -8.216, p< .01). In addition, the employed show a lower level of anomie than 
the others (y = -.484, t = -2.192, p < .05); the more savings a family has, the less 
anomic an individual is (y = -.32, t = -3.247, p < .01). Finally, the more confi- 
dence an individual has in the authority (legal system, police and government), 
the less anomic s/he is (y = -.798, t = -3.615, p < .01). The other two variables, 
education and subjective social class, are not significant predictors of anomie. The 
standardized coefficients in the parentheses show that, at the individual level, age 
has the strongest effect (-.158) on an individual’s level of anomie, followed by 
the confidence in authority (-.071), gender (.065), marital status (-.063), family 
savings (-.036), and employment status (-.028). Overall, the evidence for research 
hypothesis 1 regarding the effects of socioeconomic positions on anomie in in- 
dividuals is mixed, largely consistent with the theory expectation (gender, age, 
marital status, family savings, employment 1, and confidence in authority), but 
_ with two exceptions (education and subjective social class). 

At the national level, the results presented in Model 3 indicate that the grand 
mean of anomie becomes 6.920, which is lower in comparison with the mean of | 
anomie in Model 1 and Model 2. The effects of all the individual-level variables 
remain the same as they are in Model 2. The level-2 variable, nations under transi- 
tion, is statistically significant at the .01 level. The results confirm that individuals 
who live in Eastern European nations, which have undergone democratic transi- 
tion, experience a higher level of anomie than those who live elsewhere (y = 2.900, 
t = 4.422, p < .001). The Gini index, however, is not significantly related to 
anomie. The standardized coefficients show that among all the predictor variables, 
nation under transition has the strongest effect (.567) on anomie. Overall, the 
results of Model 3 suggest that the incorporation of the national-level variables in 
predicting anomie in individuals is necessary and all the effects of the significant 
variables at both levels are in the predicted direction. 

Model 4 was created to explore the cross-level effects on anomie in individuals. 
Before creating Model 4, nations under transition was added to influence the slope 
of each of the significant predictor variables at the individual level respectively to 
explore any cross-level interaction effect on anomie. The analyses show that only 
nations under transition at the national level and confidence in authority at the 
individual level have a significant cross-level interaction effect on anomie.$ 

The results of Model 4 show that by including the cross-level interaction effect, 
the strength of the significant effect of nations under transition increases from .567 
in Model 3 to .980 in Model 4; the effect of confidence in authority at the individual 
level, however, becomes smaller (from -.071 to -.036). Although the strength of this 
effect declines by almost 50 percent (.036/.071), its direction remains the same: 
the more confidence an individual has in authority, the lower level of anomie s/he 
experiences (y = -.404, t = -2.309, p < .05). This effect, furthermore, is stronger for 
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Figure 1: Predicted Anomie Level, by Confidence in Authority and 
Nation in Transition 
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individuals who live in the nations under democratic transition (see Figure 1 for the 
cross-level interaction effect: the slope is “steeper” for nations under transition). That 
is to say, as the levels of individuals’ confidence in authority increase, the levels of 
anomie drop faster for individuals who live in nations under democratic transition 
than those who live elsewhere and do not experience this sociopolitical change. 


Discussion 


The current study tests the linkage between rapid sociopolitical change and ano- 
mie. As predicted by Durkheim's theory of social transition, anomie as a byproduct 
of rapid social change increases also during the democratic transition in Eastern 
Europe. Our models explain a reasonably large amount of the variance of anomie 
in systematic ways. The results suggest that rapid social change— measured as na- 
tions in transition from authoritarian regimes to democracies—elevates the level 
of anomie among individuals in these nations. 
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The high level of anomie found in nations of Eastern Europe in transition should 
not be a surprise. Sung (2003, 2004) examines the relationship between democracy 
and political corruption, but with similar implications. He finds that the relationship 
is curvilinear: political corruption increases in the intermediate democracies, but 
eventually declines in the advanced democracies. With consolidation of democracy 
in Eastern Europe, we expect that the high level of anomie observed in 1995 may 
decrease in the next decade or so when the new civil society builds up its roots and 
replaces the old repressive mechanisms of social integration and social regulation. 

Most contemporary scholars agree that liberal democracy is a desired goal. Liberal 
democracy is defined in this article as the political system that allows political liber- 
ties and democratic rule (Bollen 1993; Fukuyama 1992; Huntington 1991; Lipset 
1981). With the adoption of democratic constitutions and competitive elections, 
the process of early democratic transition from totalitarian rule had come to an end 
by the mid 1990s. While the formal structures of liberal democracy are in place, 
the durability of the newly established structure remains uncertain. According to 
these scholars, liberal democracies promote social justice by increasing rationality, 
mutual trust and self-control. Therefore, individuals in these societies are embedded 
in increasingly complex social configurations such that advancing their self-interest 
less often requires the use of unusual means, such as violence or crime. Theoretically, 
we would expect the decline of anomie in the future. However, we would not expect 
the disappearance of anomie because it is endemic in an organic society according 
to Durkheim and in a democratic society according to Merton. 

The analysis used in the current study expands our understanding of anomie 
at both the individual and national levels. The present study extends Durkheim's 
theory of transitional societies to the nations under democratic transition. Results 
show that individuals who live in Eastern European nations undergoing demo- 
cratic transition have a higher level of anomie. The cross-level examination con- 
firms that nations in transition not only have a direct effect on the intercept of 
the dependent variable, anomie, but also an interactive effect through the level-1 
variable, confidence in authority. That is, the linear lines of confidence in authority 
on anomie show different degrees of drops in the level of anomie: when confidence 
in authority increases, the level of anomie drops faster in nations under transition 
than in those which do not experience rapid social change. Thus, Durkheim’s 
argument about the effect of rapid social change on anomie is empirically sup- 
ported by the contemporary data from nations under democratic transition. In 
addition, the insignificant effect of Gini index on anomie seems to be consistent 
with Durkheim's arguments that inequality is a natural and inevitable human 
condition, and that is is not associated with social maladies such as crime (Vold, 
Bernard and Snipes 2002), and by extension, anomie. f 

The descriptive analysis of the current study shows that anomie exists in every 
one of the 30 nations, and the mean individual level of anomie differs from nation 
to nation. Merton's proposition that different societies have different levels of so- 
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cially induced anomie has been empirically verified. Contrary to Merton, however, 
the level of anomie is not especially high for the United States and inequality is not 
related to anomie in these societies. We cannot say that Merton’s argument was in- 
correct. Time is one of the ultimate constraints on all theories (Kuhn 1970). While 
the United States was one of the few democracies in the late 1930s when Merton 
published his essay, the third wave of democratization since the late 1970s has 
created new speculative spaces and a universal merchant mindset. Using the most 
efficient means to achieve the desired economic success is no longer limited to the 
United States. Merton (1938) suggests that anomie is a typical adaptive behavior in 
democratic societies in general. The American dream has, through the spread of the 
democratic system, become a universal dream (Cao 2004). In addition, Merton’s 
theory was advanced to describe anomie in societies with more stable cultural and 
social structures. Therefore, a more precise test of Merton’s theory should rely on 
data from nations with relatively stable economic and political systems. 

At the individual level, the results show that the effects of the socioeconomic 
variables are consistent with previous findings in the literature (Cao 2004; Jensen 
2002; Paternoster 1989; Sampson and Bartusch 1998): males, the young, the un- 
married and the unemployed are found to be more anomic than females, the older, 
the married and the employed. It should be noted that the measures of anomie in 
these various studies all differ slightly from one to another. The similar findings 
are reassuring that our measure of anomie is valid. In addition, the more family 
savings an individual has the less anomic he or she is, and the more confidence an 
individual has in authority (the police, the legal system and the government), the 
less anomic he or she is. These results largely support Merton’s argument that ano- 
mie is inversely related to an individual’s social and economic position in a society. 

The present study has several limitations. First, our operationalization of ano- 
mie is narrow, closer to Merton’s definition of the delegitimation of social norms 
with an emphasis on the goals-means discrepancy than to that of Durkheim's broad 
conceptualization. Second, although the World Values Survey is well designed for 
cross-national comparisons it is not particularly designed for the current study. 
Like many cross-national studies, the World Values Survey data have their intrinsic 
limitation —the missing values. Consequently, fewer than 60 percent of the nations 
(30 out of 57) in the survey were chosen for the current analysis. Future studies 
need to rely on better comparative data that allow more nations in the analysis. 
Third, our measure of social change is captured by one categorical variable and it 
may be too simplistic. The concept of change can be captured as the continuous 
variable of the degree of more or less transition and the transition to more or less 
democracy. Also cross-sectional design limits our ability to measure change. 

Despite these limitations, the present study is the first to examine the linkage 
of rapid social change and anomie cross-nationally using the hierarchical linear 
modeling. The findings highlight the importance of using the multi-level data to 
explore the sources of anomie among individuals. They extend Durkheim's theory 
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to explain anomie in the nations undergoing rapid sociopolitical change. It also 
lends support to Merton’s contention that a certain level of anomie is normal in 


democratic societies. 


Notes 


r 


The subjective social class has the most missing values in Sweden (13.4%). We replace 
the missing values with the sample mean for Sweden. To make sure that this does not 
affect the results, we recode the non-missing values for “subjective social class” in 
Sweden, as “1” and the missing values as “O,” and run a regression analysis with this 
variable. The new variable is not significantly related to anomie, meaning that there is 
no difference between those who do not answer an item and those who answer. 


To test whether a HLM analysis is needed for the current study, we ran a regression 
with anomie as the dependent variable and all the individual-level variables as 
independent variables. A one-way ANOVA was conducted using the unstandardized 
residual obtained from the regression analysis as the dependent variable and nation as 
the factor. The ANOVA result shows that the model violates the OLS independence 
assumption that the errors should be independent from each other (F = 127.45, 
df, ween = 29> Af...., = 33095, p < .01). Therefore, a hierarchical analysis is more 
appropriate than an OLS regression for solving the problem of correlated errors. 
‘There are two main advantages of centering the predictors (Kreft and Leeuw 1998): 
(1. obtaining estimates of 8,, and other effects that are easier to interpret; (2. removing 
high correlations between the random intercept and slopes, and high correlations 
between first- and second-level variables and cross-level interactions. 


The grand-mean centering is interpreted as a deviation away from the grand mean. 
In the present study, the grand-mean centering instead of group-mean centering is 
used mainly because there are no strong theoretical reasons to do the group-mean 
centering (Luke 2004). 


‘The coefficient for binary or dummy variables is interpreted as the difference in the 
dependent variable between the group coded as 1 and the group coded as 0. 


The model is what Raudenbush and Bryk (2002) called as an “intercepts- and slopes- 
as-outcomes” model. The equations for Model 4 can be written as: 


Anomie, = 6, + 6 (Gender), + 8, (Age), + 8, (Marital Status). + 6, (Family Savings) 


gj lj zi 
aa b ale mployment Status), +B g( Education : + B,( Social Class), +B R( Confidence in 
Authority), 
B,, =Yo9+ Yo,(Nation), + Yo2(Gini), AU, 
4 V10 
a3 Y20 


B, =Yeot ¥5,(Nation), 
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Appendix: Gini Index for Nations of Eastern Europe in Democratic Transition 





Nation/Region 1985 1995 
Armenia 26.9 (1986) 38.1 
Azerbaijan 26.6 (1986) 45.9 
Belarus 25.4 (1986) 28.3 
Bosnia Herzegovina — 32.9 (1991) 
Bulgaria 27.9 36.8 
Croatia 21.1 (1987) 26.5 (1993) 
Estonia 26.7 (1986) 30:3 
Latvia 25.2 (1986) 200 
Lithuania Pa ond 
Macedonia 32.2 (1989) 35.9 
Moldova 24.8 (1986) 39.0 
Russia 26.1 (1986) 43.2 
Slovenia 23.5 (1987 35.8 


Data source: World Income Inequality Database, United Nations University, 2005 

Note: The Gini index is based on gross, net and taxable incomes; earnings and expenditure; 
main income recipient units; rural and urban data. The Gini index data in this appendix are 
collected from the WIID following the same income definitions (e.g., gross) and reference 
units (e.g., persons) for Gini index as much as possible. Because not all Gini indexes 
follow the same income definitions and reference units, the data still need to be interpreted 


with caution. 
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Incubating Innovation or Cultivating Corruption? The 
Developmental State and the Life Sciences in Asia 


Cheol-Sung Lee, University of Chicago 
Andrew Schrank, University of New Mexico 


A substantial body of literature purports to document the growth of scientific miscon- 
duct in Northeast Asia. This article traces the apparent growth of research fraud and 
falsification to two distinct features of the national innovation systems common to 
the region: liberal research regimes adopted by developmental states and marked by 
freedom from government oversight, and illiberal laboratory cultures imported from 
Germany and marked by all-powerful lab directors and their vulnerable underlings. 
Based on comparative, qualitative case studies of pioneering countries in bio-medical 
research, as well as cross-national quantitative analyses of the permissiveness of na- 
tional stem-cell research policies, we argue that Asia’s scientific pathologies are the 
products of two institutional factors: funding and freedom offered to scientists by 
developmental states, and the lack of informal control prevalent in the German model 
of higher education. We conclude that, while Northeast Asian officials offer their 
biomedical researchers funding and freedom to take advantage of opportunities that 
rarely exist in the West, their scientists stifle open debate and criticism, and thereby 
hinder the growth of informal as well as formal control mechanisms that are critical 
for deterring and detecting scientific fraud. 


Introduction 


Why is Northeast Asia particularly prone to scientific scandal (Financial Times 
2006; Beech 2006; Nordling 2006; Shetty 2007)? We trace the answer to two dis- 
tinct features of the “national innovation systems” (Nelson 1993) engendered and 
exploited by the region’s policymakers in the late 20" century: (1. liberal research 
regimes adopted by “developmental” states Johnson 1982) in an effort to overtake 
their advanced industrial counterparts, and (2. illiberal laboratory cultures inher- 
ited from Germany by way of imperial Japan (Ben-David 1977; Altbach 1989). 
While Northeast Asian officials offer biomedical researchers funding and freedom, 
and thereby exploit opportunities all but foregone in the West (Fukuyama 2002; 
Einhorn 2005; Owen-Smith and McCormick 2006), their academic collabora- 
tors disdain debate and disclosure, thereby inhibiting the growth of informal 
as well as formal control mechanisms that might otherwise deter misconduct 
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(Normile 2003; Ihlwan 2005; Beech 2006; DeJonquieres 2006; Financial Times 
2006; Gottweis and Triend! 2006). 

Our primary argument is worth underscoring at the outset: neither a liberal 
research regime nor an illiberal laboratory culture is a sufficient basis for wide- 
spread scientific misconduct. By providing abundant resources to heavy-handed 
lab directors who produce high-profile articles in Western journals, Northeast 
Asian policymakers offer their scientists a motive—pecuniary and non-pecuniary 
rewards—and a means—deference and discipline on the part of their subordi- 
nates —to perpetrate fraud and falsification. The resources are allocated by develop- 
mental states; the labs are products of German models of higher education; and the 
misconduct is an outgrowth of their interaction in contemporary Northeast Asia. 


Intellectual Context 


Woo-Suk Hwang completed his doctorate in veterinary science at Seoul National 
University and joined the faculty there in 1986. Almost two decades later, Hwang 

- and his collaborators announced three “striking achievements.” In March 2004, 
they described the successful generation of a line of embryonic human stem . 
cells from the nucleus of an adult cell in Science. In June 2005, they described a 
significant improvement in the efficiency of their nuclear transfer procedure in 
the same venue. In August 2005, they announced their successful effort to clone 
a dog—an animal notoriously resistant to cloning—in Nature. 

Hwang's purported contributions to the growing field of regenerative medicine 
captured the imaginations of scientists and their supporters worldwide. They also 
transformed him into a veritable celebrity—complete with fan club and $65 mil- 
lion research fund—in his homeland (Kang and Segal 2006). By the end of 2005, 
however, Hwang's reputation would be irreparably damaged by allegations that he 
had falsified two of his three most important articles and that international stem cell 
research was not nearly as far along as previously believed. Following an internal 
investigation, SNU dismissed Hwang, and the Korean government filed charges 
against him for the fraudulent use of government research funds. Subsequent 
allegations have included the intimidation of subordinates, payments of hush 
money to colleagues, and influence peddling within the Korean academy. 

Hwang is neither the first, nor in all likelihood the last, of his compa- 
triots to fall from scientific grace. Unverified claims of human cloning first 
surfaced in, and cast doubt on, the Korean life sciences in the late 1990s 
(Baker 1999). Scientists familiar with the countrys biomedical community 
portrayed Hwang's misconduct as a sign of things to come a few years later 
(Wade 2005; Weiss 2005), and their fears were confirmed in 2008, when the 
Korea Advanced Institute of Science and Technology suspended molecular 
biologist Tae-kook Kim, who had previously been portrayed as a Nobel Prize 


candidate, on the grounds that he, too, had fabricated data in two of his best 
known articles (Korea Times 2008). 
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Nor is Korea unique. Japan’s most prestigious universities dismissed two dif- 
ferent biochemists on independent charges of falsification in late 2006 (Brender 
2006; Nature 2007; Normile 2007), and rogue researchers are in all likelihood 
more prominent, as well as more common, in authoritarian polities than in their 
democratic counterparts. Take, for example, China: while government officials 
acknowledge at least 10 recent cases of misconduct, including six “high-profile” 
cases in the life sciences (Jia 2006:867), their critics worry that disclosure is the 
exception to the rule—especially among the country’s scientific elite. “The higher 
the position a cheater occupies,” argues one whistleblower, “the easier for him 
to avoid investigation and punishment.” (Marquand 2006:3) 

Finally, the perception of scientific misconduct is no less threatening to the sci- 
entific enterprise than the misconduct itself. Western scientists, editors and fund- 
ing agencies are beginning to express doubts about Asian science as a whole (Wade 
2005; Beardsley 2006; Jia 2006; Spice 2006; Nature Medicine 2006; Wadman 
2006). Asian scientists compound their already substantial credibility problems 
by trading anonymous allegations over the Internet (Choe 2005; Hasebe 2006), 
and Chinese scientists are particularly vulnerable to Western skepticism, for their 
political patrons afford neither whistleblowers nor their targets due process of law 
(Cyranoski 2006a; Einhorn and Arnst 2006; Nature 2006a; Xin 2006). 

Northeast Asia’s predicament speaks to at least three important sociological pro- 
cesses: the expansion of Western science (Drori et al. 2003; Schofer 2004), the transfor- 
mation of the “developmental state” (Johnson 1982; Evans 1995; Weiss 2000; Chibber 
2003), and the “evaluation and control” of scientific research (Owen-Smith 2001; 
see also Collins 1968; Hackett 1994; Zuckerman 1977). While Western observers 
trace the proliferation of scandal to the alleged absence of scientific standards in 
Asia (Stein 2005; Nordling 2006), and therefore decry the growth of potentially 
risky research in late-industrializing countries, their critics disdain explanations 
that focus on “the non-Western scene of the alleged crime” (Kevles 2006: B3) 
and bemoan the “profiling” of Asian scientists (Weiss 2006; see also Wong 2006). 

The critics overstate their case by half, however, for explanations that empha- 
size “the non-Western scene of the crime” (Kevles 2006:B3) are not necessarily 
essentialist or biological in nature, and it is therefore incumbent upon social 
scientists to transcend parochial accounts of misconduct by “replacing proper 
names of social systems with the names of variables.” (Przeowrski and Teune 
1970:30) We endeavor to do so by identifying the incentives and opportunities 
engendered by two different political and organizational features of northeast 
Asian science: liberal research regimes produced by developmental states, and 
illiberal lab cultures that are byproducts of German models of higher education. 


Motivating Fraud through Liberal Research Regime 


Kang and Segal (2006) trace the proliferation of potentially problematic re- 
search not to the purported pathologies of Asian culture but to the nationalist 
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ambitions of Asian elites. While in the West, government officials regulate contro- 
versial research, and thereby attempt to reconcile competing interests, in the East 
they reward controversial research, and thereby exploit the opportunity to overtake 
the West in the life sciences where ethical issues loom particularly large (Fukuyama 
2002; Banchoff 2005; Einhorn 2005; Gottweis and Triendl 2006). Not only Korea 
but China, Singapore and other Asian countries hope to take advantage of Western 
regulations to build their own scientific capacity (Fukuyama 2002). 

In fact, Korean stem cell scientists have redoubled their efforts in the after- 
math of the Hwang affair (Nature 2006b), and Korea is by no means the only 
latecomer to leave a mark. “Similar cutting-edge work is being done in countries 
like China and Israel, also with lax rules.” (Lee 2004:22) Singapore is positioning 
itself as “a haven for biomedical freedom” (Arnold 2006:1), and in the 1990s, 
the Japanese responded to economic crisis by increasing government support 
for their nascent biomedical effort (Kinoshita 1994). 

We cannot blame misconduct on the lack of standards per se, however, 

. Since a lack of regulation is not necessarily identical to a lack of standards. 
On the contrary, the Korean government's effort to promote basic research ‘ 
has been accompanied by the imposition—rather than the abandonment—of 
performance standards: Baker (2000:207) notes that the Korean Ministry of 
Science and Technology demands “visible, clear, and quantitative evidence” 
that program administrators are achieving their goals. However, Brender 
(2004:34) labels the use of citation counts in Western journals “one of the 
more controversial components” of their effort, and Kirk (2006:7) cites one 
Korean professor who suggests that performance standards may be the cause 
of—rather than the solution to—the problem. “One problem is a strict policy 
on supporting research,” he says. “If the research does not produce results in a 
certain period, the funds are withdrawn.” In short, the Korean reward system 
prioritizes prominent articles in Western journals rather than transparency, 
accountability or progress. In the face of such stringent standards and funding 
contingencies, it is natural that researchers have incentives to cut corners (or 
worse) in their research, 

Furthermore, Hwang played the system perfectly—at least for a while. By 
pursuing fashionable topics, embracing prominent collaborators, and pub- 
lishing memorable articles in visible outlets, he distinguished himself from 
his colleagues, enrolled a veritable army of professional, popular, and private 
sector supporters (Callon and Law 1987), and thereby won government favor 
(Len 2004; Kirk 2005; Gottweis and Triend] 2006). 

Scientists face similar incentives throughout the region, however, and their 
public sector patrons continue to raise the stakes. Asian science officials have 
made funding contingent upon publication impact factors since the late 1990s 
(Swinbanks and Nathan 1997; Cao and Suttmeier 2001; Xin 2006). They 


have also recently introduced explicit cash rewards for blockbuster articles 
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(Fuyuno and Cyranoski 2006) —much to the chagrin of critics who worry that 
explicit pecuniary incentives will tempt researchers “to plagiarize or fabricate 


data.” (Nature 2006c:786) 


Facilitating Fraud in Illiberal Laboratories 


Temptation is only half the story. Unscrupulous scientists need a means as well 
as a motive, and existing studies therefore treat illiberal labs as particularly likely 
sources of fraud and falsification. The authors of the Northeast Asian reward sys- 
tem implicitly or explicitly assume that articles that have survived peer review at 
Western journals are scientifically sound. Is their assumption defensible? Ironically, 
the Korean stem cell scandal has been invoked by both defenders and critics of 
the traditional model of quality control. While optimists underscore the eventual 
discovery of Hwang’s fraud, and thereby implicitly defend the Mertonian model 
of “open science” (Kevles 2005; Merton 1964), their critics emphasize the depth 
and duration of deceit at the highest levels of Korean science (Weiss 2005). 

The critics appear to have the upper hand, however, for the existing litera- 
ture acknowledges the inadequacy of peer review in a world of incomplete 
information, self-interest and guile (Chubin 1985; Owen-Smith 2001). The 
literature simultaneously suggests that replication—arguably the litmus test of 
scientific integrity—is particularly uncommon in the life sciences for at least 
two reasons: first, the “specificities of biological material” render replication 
difficult under the best of circumstances; and, second, the premium placed on 
originality—along with temporal and resource constraints— renders replication 
unappealing to individual scientists (Franzen et al. 2007:4). 

In fact, Owen-Smith suggests that academic falsehoods and fabrica- 
tions—and low quality work more generally—are deterred not by peer review 
but by skeptical encounters among team members. “By monitoring outcome 
quality,” he writes, “such skepticism underpins decisions about what claims 
‘make it out’ of the lab to the wider evaluative arena of peer review and public 
presentation” in the first place (2001:429). North American lab directors 
therefore go out of their way to organize skeptical encounters among team 
members and in so doing transform the Mertonian notion of “organized skep- 
ticism” from a “global scientific norm” into a “local mechanism for organizing 
and evaluating work.” (Owen-Smith 2001:436; see also Hackett et al. 2006; 
Cho et al. 2007) 

However, quality control through organized skepticism presupposes debate 
and discussion, and Northeast Asian labs privilege neither. On the contrary, 
they tend to discourage “dissent and outspoken criticism” in favor of “conformity 
and respect for authority.” (DeJonquieres 2006:19) Thus, Hwang's lab featured 
a strict division of labor: Hwang monopolized design; his colleagues followed 
his orders; and martial and industrial organizational principles dominated 
the routines and practices in his lab (Len 2004; Wade 2005; Ihlwan 2005; 
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DeJonquieres 2006). Rick Weiss has drawn an explicit distinction between 
“the hive of communication typical of an American lab” and the “factory as- 
sembly line” presided over by Hwang (Weiss 2005: A2), and his observations 
apparently describe the rule rather than the exception in Korea. A Harvard-trained 
Korean biologist acknowledged more than a decade ago that frank evaluations 
of colleagues are simply “not a part of the scientific culture” in his homeland 
(Kinoshita 1993:356), and the problem is particularly vexing for junior scientists 
who are all but entirely dependent upon senior professors (Kirk 2005). 

Nor is Korea unique. Illiberal lab cultures are the norm throughout Northeast 
Asia and have been blamed for the growth of misconduct in Japan and China as 
well as in Korea (Saegusa 1999; Normile 2003; Cyranoski 2006b; DeJonquieres 
2006; Cho et al. 2007). In fact, the editors at Nature Medicine (2006:255) worry 
that “the hierarchical organization of science in Korea prevented whistleblowers 
from coming forward” before Hwang and his collaborators had passed the point 
of no return. They also suggest that “China is no stranger to hierarchies” and could 
therefore experience similar problems. 

In summary, Northeast Asia’s liberal research regimes interact with the region's il- 
liberal lab cultures to produce fraud. While the former reward the authors of promi- 
nent publications and invite misconduct, the latter stifle debate and facilitate fraud. 


Scientific Fraud in Northeast Asia: Motives and Mechanisms 


Most commentators on Northeast Asian scientific fraud focus on the proximate 
causes of fraud and falsification to the exclusion of their underlying roots by 
assuming-—rather than proving—that research regimes and lab cultures are exog- 
enous variables. In fact, the liberal research regimes and illiberal lab cultures that 
allegedly animate research misconduct in Asia are not exogenous variables to be 
invoked but endogenous variables to be explained. They are in all likelihood prod- 
ucts as well as producers of technological underdevelopment, and their origins are 
therefore no less important than their consequences. Where do they come from? 
We now trace the answer to the nature of Asian states and the character of Asian 
universities and propose two hypotheses: that liberal research regimes are products 
of developmental states, and that illiberal lab cultures are legacies of German 
models of university organization. 


Hypothesis 1: Liberal research regimes are products of 
developmental states. 


Chalmers Johnson introduced the by-now-familiar distinction between “devel- 
opmental” states and their “regulatory” counterparts in 1982. While the former 
are “goal-oriented” and therefore take on a number of “developmental functions” 
including indicative planning, industrial rationalization and export promotion, 
the latter are rule-oriented, and therefore assume a broadly neutral posture 
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toward firms, sectors and industries (Johnson 1982:19). According to Johnson, 
while the United States is a typical example of the regulatory state, Japan em- 
bodies the prototypical developmental state (Johnson 1982). Newly industrializing 
countries represent variations on the theme (Johnson 1986); and the less developed 
countries hope to replicate their success (Johnson 2000; Amsden 2001). * 

By giving manufactured exports pride of place in their development strategies, 
however, the LDCs are imposing new demands on their middle-income predeces- 
sors including the substitution of mental for manual labor and proprietary for 
imported technology (Weiss 2000; Wong 2004). “The challenge for the middle- 
income world is to find ways to ‘tech up’ and enter the global knowledge econ- 
omy,” explains Garrett (2004:93-4), “so as to escape the trap of having to dumb 
down to compete in standardized manufacturing and, increasingly, standardized 
services.” (See also Summers 2006.) Furthermore, their traditional mid-technology 
industries faced a number of competitive and political threats including rising 
production costs at home, de facto protectionism in the United States, and the 
impending outlaw of their traditional export subsidies by the WTO (Mistry 1991; 
Crane 1994; Kim and Leslie 1998; Amsden 2001; Wong 2004). Policymakers in 
middle-income countries are therefore reinventing their developmental states with 
an eye toward cultivating frontier technologies. They are paying particular atten- 
tion to controversial activities abandoned or circumscribed by regulatory states in 
North America and Western Europe (Kang and Segal 2006), and their preferred 
policy instruments include liberal biomedical research regimes (Fukuyama 2002; 
Lee 2004; Blonder 2005; Kharif 2004). 


Hypothesis 2: Illiberal lab cultures are the legacy of the 
German model of higher education. 


Ben-David introduced the no less important distinction between the continental 
European “chair system” of faculty organization and the Anglo-American “de- 
partmental” alternative in the 1960s (Ben-David and Zloczower 1962). While 
the former gives power and responsibility over the primary academic operating 
unit to a single professor, and thereby fosters deference on the part of his or her 
subordinates, the latter distributes power and responsibility across a wider range 
or roles and ranks, and thereby fosters a more permeable and pluralistic university 
environment (Clark 1978). According to Ben-David, German universities gave 
birth to the chair system in the early 19" century. Japan imported the German 
model in the late 19% century (Ben-David 1977; Clark 1978; Ogawa 2002). 
Korea and Taiwan inherited the chair system from their Japanese colonists (Kehoe 
1949; Wood 1961; Altbach 1989; Lee 1989; Wu 1989), and postwar China offers 
observers of the two models a natural experiment. While Nationalist policymakers 
abandoned the chair system for the departmental alternative in the late 1940s (Wu 
1989), and thereby put to rest the Japanese legacy on Taiwan (Wu 1989), their 
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Figure 1. Formal and Informal Control of Scientific Misconduct 
Degree of Formal Control 
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fraud 


Communist rivals imported the chair system from Europe by way of the Soviet 
Union in the 1950s and 1960s (Hayhoe 1987), and thereby fostered the growth 
of “clanlike” (Lo 1991:710-11) hierarchies on the mainland. 

What are the consequences of an academic system that “binds students to 
their mentors” from recruitment to retirement? Students find it all but impos- 
sible to challenge mentors who have the power to determine their academic and 
career prospects (Cao 2004). However, deference and dependence inhibit the 
communicative underpinnings of open science (Merton 1964) and thereby offer 
unscrupulous senior scientists the means to be engaged in fraud. 

The implications for both organizational behavior and national development 
are perhaps obvious. While Asian officials conditioned subsidies on exports 
in the late 20" century (Amsden 2001), and thereby gave their private sector 
associates an incentive to conquer world markets, they are now conditioning 
research grants and awards on publications and peer review, and are thereby giv- 
ing their scientists an incentive to publish in prestigious Western journals—often 
by any means necessary. What are the likely consequences? Our two hypotheses 
aggregate into the four outcomes illustrated in Figure 1. German model univer- 
sities in developmental states are likely to produce academic misconduct. They 
offer unscrupulous scientists both a motive and the means to perpetrate fraud 
and falsification. Anglo-American universities in regulatory states are relatively 
unlikely to produce academic misconduct. They deny potential perpetrators 
both the motive and the means to engage in rogue activity. Anglo-American uni- 
versities in developmental states and German model universities in regulatory 
states are intermediate cases. The former offer potential perpetrators a motive 
but not the means to deceive their colleagues and publics, while the latter offer 
potential perpetrators the means but not the motive to do the same. 

Are liberal research regimes in fact more common in developmental states (or 
their imitators) than in their regulatory counterparts? We provide a preliminary 
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answer by examining the socio-political bases of human embryonic stem cell 
research regimes in 47 middle- and upper-income countries: HESCR provides a 
critical test of our first hypothesis because U.S. restrictions on stem cell research 
funding are accelerating competition among foreign scientists (Cook and Allen 
2004; Owen-Smith and McCormick 2006) and controversy among their ob- 
servers. While bioethicists decry the potential “outsourcing of ethics” (Hurlbut 
2006), and pay particularly careful attention to the case of HESCR, their crit- 
ics confidently assert that “there has been no global race to the bottom in stem 
cell research.” (Banchoff 2005:205; see also Paarlberg 2005) Who is correct? The 
Korean HESCR scandal provides an obvious point of analytical departure. 


Cross-National Quantitative Analysis 


A number of observers have portrayed Hwang Woo Suk as a metaphor for 
Korean policies and ambitions (Fifield 2005). We have argued, by way of con- 
trast, that Hwang is more than a metaphor -that is, he is in fact the product of 
Korea's German model educational institutions and developmental state. Korean 
officials wanted to’ cultivate high technology. They viewed embryonic stem cell 
research—where most OECD countries had tighter standards—as fertile soil 
(Pollack 2004; Choe 2006), and they expected scientists and their universities 
to constitute the professional and organizational backbones of their “technon- 
ationalist” aspirations (Kang and Segal 2006; as well as Normile 1997; Wong 
et al. 2004). They were not alone, however, for government efforts to promote 
HESCR through public funding and regulatory latitude are common to late 
industrializing countries governed by actual or aspiring developmental states 
(Kharif 2004; Lee 2004; Blonder 2005). 

Is the apparent relationship between late-industrialization and government 
promotion of stem cell research systematic or spurious? A substantial body of 
literature traces the degree of tolerance for HESCR to religious factors. While 
Christian societies ascribe moral status to the embryo, and therefore oppose or 
strictly regulate HESCR, their Jewish, Muslim, Buddhist and Confucian counter- 
parts are at least divided on the question, and therefore tolerate varying degrees of 
research (Paarlberg 2005). Consequently, Kharif places Korea—along with China 
and Israel—on a list of “countries that are trying to cash in on types of research 
that the U.S. doesn’t fund.” (Kharif 2004) 

The cultural explanation is incomplete, however, for Korea has a politically 
influential Christian population (CIA 2005) and the United States is an outlier. 
A number of Christian countries are tolerant of a wide range of HESCR activities 
and Fox (2004) therefore denies the veracity of a simple link between religion and 
regulation. In fact, the cultural account ignores the pluralism of most religious 
traditions; the potentially problematic translation of religious beliefs into policy; 
the countervailing weight of material interests; and the activist influence of the 
scientists themselves (Walters 2004). 
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Is Korea’s aggressive approach to stem cell research exceptional or characteristic 
of “international challengers” (Banchoff 2005:205) more generally? We explore 
the cross-national correlates of Human Embryonic Stem Cell Research by means 
of ordered logistic regression analysis. Hoffman has divided national stem cell 
policies into three ordinal categories: permissive, flexible and restrictive (Hoffman 
2006; Russo 2005). Permissive countries allow stem cells to be derived through 
a variety of techniques, including somatic cell nuclear transfer or “therapeutic 
cloning.” Flexible countries allow stem cells to be derived from embryos donated 
by fertility clinics and ban or restrict SCNT. Restrictive countries either ban 
human embryo research outright, permit research on a limited number of stem 
cell lines, or permit research on imported stem cell lines. We treat the degree of 
stem-cell policy permissiveness in 47 upper- and middle-income countries for 
which embryonic stem cell research appears feasible and data are readily available 
as a function of three different variables: per capita income, Christianity, and the 
depth and power of the scientific establishment: 

* Permissiveness of national stem-cell research policies. The dependent variable is an 
index derived from Hoffman’s codings: permissive = 3; flexible = 2; restrictive = 1.° 

* Per capita income. We expect the permissiveness of stem-cell policies to be 
an inverse function of income, as middle-income late industrializers are more 
willing to deregulate stem cell research in an effort to move ahead in the life 
sciences. We measure income as the log of GDP per capita—in order to minimize 
the risk of heteroscedasticity—and use average values from 2003 to 2005 in 
thousands of constant 2000 U.S. dollars in order to smooth the consequences 
of cyclical fluctuations (World Bank 2007). 

* Christianity. We treat regulation of HESCR as a function of the depth of 
Christianity, and therefore expect the permissiveness of stem-cell policies to be an 
inverse function of Christianity. We code Christianity as an indicator variable that 
assumes a value of 1 wherever the population is more than 50 percent Christian 
(CIA 2005). 

* Scientific establishment. We treat scientists and their allies as key opponents of 
regulation and therefore expect stem-cell policy permissiveness to be a positive 
function of the depth and power of the scientific establishment. We use two 
different measures of the power and influence of the scientific community: 
R&D expenditures as a percentage of GDP circa 2004, and the number of 
researchers per million inhabitants circa 2004 (UNESCO 2008). First, we view 
scientists as self-interested defenders of liberal policies. Second, we expect 
governments that invest more of their material and human resources in 
R&D to pursue more permissive policies in an attempt to realize a political 
and economic return on their investment. 
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Results and Interpretation 


Table 1 presents the results of two different ordered logit models incorporating 
the two different measures of the power and influence of the scientific community. 
Regression coefficients are placed above their parenthesized t-statistics. 

What drives cross-national variation in stem-cell policy permissiveness? All 
variables have the expected signs and are—with the exception of Christianity in 
Model 1 —statistically significant. 

Ordered logit models are interpreted in more or less the same manner as 
conventional logistic regression models. The exponentiated coefficients—or odds 
ratios— represent the expected change in the odds of the dependent variable (i.e., 
movement from one degree of permissiveness to the next) produced by a one-unit 
change in the independent variable. The odds ratio for GDP per capita is .45 in 
Model 1 and .35 in Model 2 suggesting that every additional unit increase in 
logged GDP per capita produces more than a 45 percent decrease in the prob- 
ability of permissiveness of stem-cell policy. 

We interpret the inverse relationship between per capita GDP and the tolera- 
tion of stem cell research as evidence that middle income countries are liberal- 
izing their HESCR regimes as part of a “catch-up” strategy in the sciences. Our 


Table 1: Ordered-Logistic Regression Models of Permissiveness of National Stem- 
Cell Research Policies 











M 
Christianity ot _ 728 
ae m Ss 
GDP per capita (log) -.803 -1.046 
-? Downs (225). 

Number of researchers 1.493 
per 100,000 inhabitants (log) (2.97)*** 
Threshold Parameter Estimates 

0, -7.63 -1.69 

8, -5.39 597 
Log-Likelinood -43.31 -41.95 
Likelihood ratio x? 15.04 17.76 
x’ test (Probability > x’) 002 001 
Pseudo R-square 148 oh) 
Observations 47 47 
Note: N = 47 


Absolute value of t statistics in parentheses; *p<10% “ps5% “ps 1% 

The baseline model for LR-test includes the Christianity indicator variable and GDP per capita. 
Proportional Odds Assumption Tests confirm the use of ordered logit for both model (e.g., LR 
ratio test for model 2: 2(-40.4 — (-42.0))= 3.2 (df= 3). The assumption holds at a = .05 level. 
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interpretation is consistent with the Asian example. It gains strength from the 
pronounced negative correlation (-.53; p < .01, n = 44) between per capita income 
(logged) and the percentage of R8&¢D spending undertaken by the government, for 
the qualitative evidence suggests that public officials in upwardly mobile countries 
pursue catch-up by offering ambitious scientists “financing and freedom” (Arnold 
2006:1) and both patterns are borne out by the quantitative data.’ 

We do not want to exaggerate the reliability of the admittedly limited quan- 
titative evidence available to us. We view the regressions and correlations not as 
definitive causal tests, but rather as provocative exercises in data reduction. They 
suggest that scientific latecomers are simultaneously more dependent on public 
funding and more tolerant of HESCR than their predecessors, and thereby imply 
that the aforementioned Asian pattern is anything but anomalous. 


Comparative Historical Evidence 


Late-developing countries are exploiting opportunities foregone by their advanced 

- industrial counterparts by liberalizing and subsidizing biomedical research in- 
cluding, but not limited to, HESCR. We are by no means the first analysts to- 
underscore the potentially perverse consequences of liberal research regimes in 
late developing economies. A number of experts have portrayed the developing 
world as a particularly liberal research environment (Hall 2006; Beardsley 2006; 
Einhorn and Arnst 2006) and have decried the lack of regulation in Northeast 
Asia in particular (Lee 2004), 

While Northeast Asian scientists have been embroiled in controversy, and have 
therefore encountered skepticism at home and abroad, their rivals in Singapore 
and Israel have for the most part steered clear of scientific—if not necessarily ethi- 
cal—controversy, and have thereby earned high marks among their peers and observ- 
ers. Ethical controversy is all but inherent in biomedical research and is particularly 
salient in upwardly mobile countries with lax regulatory standards (Gross 2000). 
However, ethical controversy (i.e., disputes about the proper scope of scientific in- 
quiry) and research misconduct (i.e., fraud, falsification and plagiarism) are distinct 
outcomes. The former occurs throughout the latecomer population; the latter is by 
all accounts decidedly more common in Northeast Asia than in Singapore and Israel. 

In fact, Liu, executive director of the Singapore Genome Institute and former 
director of clinical sciences at the National Cancer Institute, responded to the 
Hwang affair by offering a negative answer to his own rhetorical question: “Can it 
happen in Singapore?” Liu’s optimism is not based on the extent of formal regula- 
tion. Singapore—like Japan, China, Israel and Korea—scores a maximum value 
of 3 on our stem-cell policy permissiveness index. Instead, Liu (2005) puts 
his faith in the “culture of scientific integrity’ being developed in the “next 
generation’ of Singaporean scientists and the influx of foreigners that provides 

“perhaps one of the most under-appreciated characteristics” of the city state’s 
innovation system. PhD candidates in Singapore undergo ethical training 
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under the auspices of a former Oxford don. They are self-consciously encour- 
aged to serve as “internal watch dogs for scientific integrity,” and more than 
half of the 2,000 senior scientists who populate Singapore’s most dynamic labs 
are foreigners—most notably Britons and North Americans—who have been 
enticed to the city-state with promises of funding and freedom (Huifen 2006). 

Some treat Singapore’s life sciences strategy as a direct descendant of the 
smokestack chasing” approach to industrial development that served the city- 
state so well in the 1960s and 1970s (Arnold 2006) and/or criticize the “out- 
sourcing of ethics” (Hurlbut 2006) to latecomer countries. Although they make 
an important point, they fail to address a no less important question: why 
dont foreign-trained scientists—like their putative Korean counterparts (Valigra 
1993)—adopt “Asian” values and lab cultures when they arrive in Singapore? We 
trace the answer to the city-state’s British colonial heritage. While the Koreans 
inherited an arguably intensified variant of the German academic model from 
Japan, and therefore embraced hierarchical lab cultures, Singapore inherited 
more liberal British models (Altbach 1989)—and responded to crisis by adopt- 
ing an even more liberal American ethos in the 1990s (Cohen 1999).8 

In so doing, Singapore is arguably following in Israel’s footsteps. While the 
founding fathers of Israeli higher education came from Central Europe, and 
therefore built German model universities during the British mandate, their 
descendants came from—and interacted with— North America, and therefore 
abandoned the “elitist” German model for the more liberal American ethos 
in the mid-20" century (Iram 1980:81). In fact, the traditional German chair 
system “lasted for a relatively short period of time” (Toren and Nvo-Ingber 
1989:464) in Israel and had been all but abandoned by the time of partition. 

Finally, Taiwan constitutes the exception to the Northeast Asian rule. Unlike 
some of their Japanese, Korean and mainland Chinese counterparts, Taiwanese 
academics have maintained their bona fides in international circles. They abandoned 
the German model and absorbed the American alternative in the mid-20" century. 
Ruibei and Jinfu (2003:83) hold that the scientific ethos imparted by American- 
trained scholars in Taiwan “stresses the retention of a skeptical attitude, constant 
improvement, an emphasis on originality, and the condemnation of imitation. After 
decades of hard work, this attitude of striving for the new and for change has deeply 
entered Taiwanese consciousness and has had a very broad impact.” 

The continental chair system is alive—if not necessarily well—in Germany, how- 
ever, and the German scientific community therefore constitutes the most obvious 
place to look for a link between illiberal labs and scientific fraud. Do illiberal labs 
foster fraud in their countries of origin? A substantial body of literature counsels 
an affirmative answer, for the Germans lost their scientific innocence in the late 
1990s, when a spate of research misconduct shocked not only Germany but the 
entire scientific world. In the most notorious case, two German oncologists were 
convicted of fabricating or falsifying data in more than 50 articles (Abbott 1999). 


« 
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German authorities have responded to the crisis by deemphasizing the impor- 
tance of citation counts and empowering ombudspersons to protect whistleblow- 
ers (Nature Neuroscience 1998; Nature 2005). While the reforms are new, and 
therefore unproven, their critics worry that whistle blowers remain vulnerable 
in Germany. They also worry that their vulnerability—and scientific malprac- 
tice more generally—is a by-product of the country’s illiberal labs (Abbott 2003; 
Koenig 1997). In fact, the ombudsperson at the University of Géttingen goes so 
far as to trace the prevalence of misconduct and scandal to the “extreme hierarchy 
in old-fashioned clinical departments” and therefore calls for more “serious pre- 
ventive measures.” (Abbott 2003:218) 

What would more serious preventive measures look like? European scientists 
and bioethicists believe that effective anti-fraud campaigns would include “cul- 
tural changes to diminish the steep hierarchy in certain research institutions and to 
encourage open scientific discussions.”(Hagmann 1999:2258) “It’s important to 
take away the fear of talking openly from students,” according to Marinus Lamers 
of the Max Planck Institute for Immune Biology in Freiburg, and “to develop a 

“culture of criticism and open discussions.” (Hagmann 1999:2259) 
North American labs offer one model: U.S. scientists tend to facilitate ” 
“skeptical encounters” among lab participants in an effort to control quality 
and integrity (Owen-Smith 2001:33-4). While North American labs are by no 
means devoid of power and status hierarchies, and are therefore anything but 
egalitarian, they are decidedly more open than their German or Northeast Asian 
counterparts, and are therefore less vulnerable—although by no means invulner- 
able—to fraud and manipulation. 

Nor are informal controls the only obstacles to misconduct in U.S. labs. On the 
contrary, they are supplemented by formal controls that reduce or eliminate the mo- 
tivation for fraud and falsification. Whatever their costs, for example, and they may 
well be enormous (Owen-Smith and McCormick 2006), U.S. restrictions on HESCR 
mean that a Hwang-like scandal would have been highly unlikely at an American 
university. Of course, German officials impose strict limits on HESCR as well (Le., 
Germany scores a 1 on the Hoffman index), and we therefore suspect that formal con- 
trols constitute the principal obstacle to stem cell research scandal in Germany as well. 

By way of conclusion, the comparative historical evidence is consistent with 
the framework originally distilled into the aforementioned Figure 1. German 
model universities in developmental states like Japan,’ Korea and China consti- 
tute the most likely breeding grounds of fraud and falsification. Anglo-American 
universities in regulatory nations such as the United States constitute the least 
likely sources of misconduct. And German model universities in regulatory states 
(like Germany itself), and Anglo-American universities in developmental states 
including Israel, Singapore and Taiwan, constitute intermediate cases. 
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Conclusion 


The Northeast Asian life sciences have frequently been portrayed as particularly 
hospitable to research fraud and falsification. We have traced the perception and 
at times reality of misconduct to liberal research regimes and illiberal lab cul- 
tures that are themselves products of developmental states and German models of 
higher education, and have thereby made at least three important contributions 
to the contemporary sociological literature. 

First, we have identified the political and economic underpinnings of the 
expansion of the Western life sciences (Schott 1993; Drori et al. 2003; Schofer 
2004; Owen-Smith and McCormick 2006). Late industrializing countries are 
taking advantage of ethical and political controversies in their advanced indus- 
trial counterparts by offering funding and freedom to scientists who publish 
high profile articles in prestigious journals. Second, we have illuminated the 
shifting coalitional basis of the developmental state (Evans 1995; Chibber 2003). 
While public officials in late-developing countries have traditionally pursued 
mid-technology exports, and have therefore allied with industrial enterprises 
and engineers, they are increasingly drawn to basic research, and therefore em- 
brace scientists and universities. Finally, we have identified the institutional 
underpinnings of scientific scandal (Zuckerman 1977; Hackett 1994). While 
developmental state elites offer the authors of blockbuster articles pecuniary and 
non-pecuniary rewards, and thereby give their scientists an incentive to evade 
“the mores of science” (Merton 1964:660), German models of higher education 
discourage candor and criticism, and thereby offer unscrupulous scientists the 
opportunity to take advantage of the incentives in question. Asia's purported 
scientific pathologies are therefore products not of Eastern values—which are 
no less hegemonic in Taiwan and Singapore than in China and Korea—but of 
Western institutions adapted to Eastern circumstances. 

Our goals are ultimately no less programmatic than substantive, however, 
and include the fertilization of boundary-crossing by the next generation of 
sociologists. Policymakers throughout the developing world are responding to 
low-wage competition by facilitating high technology research, and their ef- 
forts offer sociologists of science, as well as sociologists of development, fertile 
theoretical terrain. We therefore hope that by exploring the origins, diffusion 
and consequences of liberal research regimes and illiberal lab cultures. We have 
succeeded in developing a compelling account of scientific misconduct and in 
opening the door to a productive dialogue between sociologists of science and 
sociologists of national development in the years to come. 
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Notes 


di 


A developmental state is designed to exploit—and thereby overcome-—the so-called 
advantages of backwardness (Gerschenkron 1962; Bendix 1967; Dore 1989) 
by importing and adapting foreign technology to local needs (e.g., export and 
employment growth) and conditions (e.g., capital scarcity, labor surplus). Weberian 
bureaucracies (Evans and Rauch 1999) or pilot planning agencies (Chibber 2002) 
play the role of directing, while vocational schools and training programs (Dore 1977, 
1989) and state-owned banks (Woo 1991) provide necessary levels of skill and capital. 
We use the terms “liberal” and “illiberal” in an empirical rather than a normative 
manner, A liberal regime gives agents freedom to carry out the research they want on 
their own terms; an illiberal regime places limits on research goals or procedures. See, 
e.g., Gomez Mera (2007). 


The scandal began in June of 2005 when a television newsmagazine received an 
anonymous tip from a former lab insider who alleged that Hwang had obtained some 
of the original eggs not from anonymous donors, as he had stipulated, but from junior 
members of his own team, in contravention of research norms, and had falsified his 
results as well. The producers initiated a sting operation that turned up more damning 
evidence of misconduct and aired their allegations of improper donations—to widespread 
opprobrium from Hwang’s many supporters—on November 22. Hwang conceded the 
charges but defended the scientific integrity of his work and vowed to carry on. But ~ 
the controversy would not go away. Well-documented charges of falsification began 
to circulate on the web. Seoul National University announced an investigation. The 
newsmagazine aired a second segment including hidden camera interviews with a lab 
member admitting to doctoring photographs at Hwang's direction. And by early 2006 
Science had retracted both papers. 


By conditioning public financial support on private export performance, planners and 
bureaucrats of developmental states encouraged or even compelled their interlocutors 
in the private sector to conquer foreign as well as domestic markets for a wide array 
of “mid-technology” products—including steel, petrochemicals, transportation 
equipment and consumer electronics—by exploiting their comparative advantage 
in high quality, low cost labor (Amsden 1989; see also Dore 1977; Koo and Kim 
1992; Kim 1993; Evans 1995; Gereffi 1989; Lie 1998; Chibber 2002; Kohli 2004; 
McMichael 2004). The Northeast Asian NICs eventually gained particular proficiency 
in the manufacture and export of labor- and scale-intensive goods in the 1970s and 
1980s (Mytelka 2004). They imported or licensed foreign technology, exported a 
wide array of capital, intermediate and consumer goods (including machine tools, 
textiles and apparel, consumer electronics, steel, petrochemicals and transportation 
equipment), moved up the so-called product cycle. 


Nor are public attitudes toward HESCR exogenous. To take but the most 
prominent example, Hwang did not sit by passively when his compatriots 
debated the ethics of HESCR. He actively contributed to the debate (Choe 
2006) and enrolled powerful allies in his campaign. 


Proponents of therapeutic cloning hope to eventually use cloned stem cells to produce 
patient-specific replacement tissues or organs. 


> « . . » . . . < : . . 3 
Hoffman's “restrictive” category includes countries with “no established policy” as 
well. We exclude low-income countries unlikely to have HESCR capacity from our 
analysis in order to avoid conflating restrictive policies and non-existent policies. 
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7. Correlation analysis performed by the authors and available upon request; data on 
public contribution to R&D spending from Duga and Studt (2005). 


8. For example, Johns Hopkins has established a branch campus in Singapore 
designed to “import American-style medical care into an Asian culture, 
perform groundbreaking research on diseases prevalent in the region, and create 
academic programs to raise the technical expertise of Singaporean doctors and 


nurses.” (McMurtrie 2000) 
9. Japan's liberal HESCR policies merit additional commentary. On the one hand, 


Japan is an upper-income country and therefore should regulate rather than 
encourage HESCR. On the other hand, Japan features a developmental state 
and is therefore consistent with our underlying hypothesis. In fact, the existing 
literature asks “why Japan, immensely rich with more than a century of universal 
education and the world’s second largest economy, performs more like a modestly 
middle ranking country” in the sciences (French 2001:6). We not only trace “the 
underperformance of the world’s second richest country in scientific matters” to a 
development model rooted in “imitation rather than innovation” (Drucker 1987: 
933) but treat Japan's liberal HESCR regime as an effort to abandon the former 
for the latter in the 21st century. In short, Japan not only performs but behaves 

“like a modestly endowed middle ranking country” in terms of scientific innovation, 
and the behavior is an effort to transcend the performance—both of which are rooted 
in political and academic institutions more common to middle-income countries. 
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Why Do People Engage in Corruption? The Case of Estonia 


Margit Tavits, Washington University, St. Louis 


This study uses survey data for 2004 on the general public (N = 788) and public 
officials (N = 791) in the young post-communist democracy of Estonia to examine 
individual-level determinants of corruption. The results indicate that both public 
officials and citizens are more likely to engage in corruption when they do not define 
corruption as wrong, and when they perceive that corrupt behavior is widespread 
among their peers. This social learning effect becomes statistically insignificant for 
those citizens who are extorted. The results provide no support for the most common 
argument on corruption and compliance—that people are more likely to engage in 
corruption when they are distrustful of their fellow citizens or of government. 


One of the greatest challenges that newly democratized countries face is how 
to create well-functioning and clean governments. Post-communist Europe is 
not an exception. To the contrary, economic and political change in the region 
created ample opportunities for corrupt exchange. The general public in post- 
communist countries believes, and scholars agree, that corruption is widespread 
(Karklins 2005; Miller, Grodeland, and Koshechkina 2001; Rose and Mishler 
2007; Schmidt 2007). At the same time, the causes of corruption, especially on the 
individual level, are not well understood. What makes some public officials more 
likely to engage in corruption than other officials? Why do some citizens decide to 
offer bribes and others not? Corruption is ultimately the direct result of decisions, 
choices and behavior at the level of the individual, but the growing research has 
largely focused on system-level and institutional explanations of corruption. One 
can restructure institutions or political systems, but if individual-level motivations 
for corrupt behavior are not understood, the restructuring may not be effective. 
This study uses survey data on the general public and public officials from 
the post-communist democracy of Estonia to explore individual-level causes of 
corruption. The study makes two distinct contributions to the existing literature. 
First, it offers a novel perspective on understanding corruption. I show that one 
of the most common explanations in social science research on compliance and 
corruption — political and social trust—is not significantly related to an individual's 
decision to engage in corrupt exchange. Further, I follow the social learning theory 
(Akers 1998), developed to explain various sorts of deviant behavior, to argue that 
for both public officials and citizens, the decision to engage in corrupt behavior 
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corresponds with positive or neutral definitions of corruption and modeling/ 
imitating similar behavior by others. Given that the study of corruption tends to 
be dominated by economics and political science approaches while sociological 
theorizing and research on corruption are rare (You and Khagram 2005), the 
results of this study should encourage future research on corruption that crosses 
disciplinary boundaries. 

Second, the study not only establishes that the social learning theory explains 
corrupt behavior better than trust-based arguments, but also provides a refinement 
of the social learning theory by introducing a cost-benefit calculation into an in- 
dividual’s behavior. Namely, the higher the potential benefits of corrupt behavior 
(e.g., when a person has been extorted to pay a bribe) and the lower the risk of 
getting caught, the more likely people will engage in corruption regardless’ of 
their socially learned predispositions. Overall, the study contributes to our under- 
standing of the processes underlying corruption, to’ the development of the social 
learning theory, as well as to the literature on trust, social capital and compliance. 


Causes of Corruption 


Corruption is usually understood as a “misuse of public office for private 
gain.”(Kunicova and Rose-Ackerman’ 2005; Rose-Ackerman 1999; Sandholtz 
and Koetzle’2000; Treisman 2000) Although corruption refers most often to the 
behavior of public officials, citizens’ decision to pay bribes or engage in corrupt 
behavior in'some other way feeds the corrupt system. 

Literature on both the causes and consequences of corruption is voluminous. 
‘There issubstantial evidence that corruption inflicts serious costs on the economy 
in terms-of slowing growth and increasing inequality, on democracy in’ terms of 
decreasing government effectiveness, and on political stability and legitimacy 
(Doig and ‘Theobald 2000; La Porta et al. 1998; Mauto 1995; Rose, Mishler and 
Haerpfer' 1998; Seligson 2002). Given these negative consequences, . research- 
ers have tried to explain what causes variation inthe (mostly perceived) level of 
corruption: The potential causes of corruption include the wealth of a country, 
level of democracy, the size of unofficial economy, political institutions, political 
stability, trade openness, public sector recruitment and wages, income inequal- 
ity; and voter turnout (Chang 2005; Gerring and Thacker 2004; Kunicova and 
Rose-Ackerman 2005; La Porta et al. 1998; Mauro 1995; Montinola and Jackman 
2002; Paldam 1999; ‘Treisman 2000; You and Khagram 2005).' Because these 
explanations focus almost exclusively on system-level variables, they tell us little 
about the individual-level motivation to engage ina corrupt exchange. 

To be sure, a few individual-level studies of corruption do exist. Studies have 
considered why some people are more likely than others to think that corruption 
isa great problem (Davis, Camp and Coleman 2004). Other studies have consid- 
ered the consequences of corruption, i.e., how perceived corruption or actual en- 
counters with it influence people's perception of the legitimacy and performance 
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of government (Mishler and Rose 2001; Seligson 2002; Uslaner and Badescu 
2003, 2004). Additionally, Miller, Grodeland and Koshechkina (2001) use survey 
data from Eastern Europe and provide descriptive information on corruption that 
citizens experience in their daily encounters with public officials. The goal of our 
research is to complement the existing research by developing and testing a theory 
of the causes of corruption on the individual level. 


Individual-level Motivation 


The literature on compliance with rules provides one potential explanation for 
why people engage in corruption. The earlier research focused on the fear of 
punishment as a factor that deters non-compliance. The more recent literature 
has extended the basic deterrence model by treating compliance as a collective 
action problem (Levi and Stoker 2000). According to this approach, all members 
of society gain if everybody complies with socially beneficial laws, but at the same 
time each individual faces incentives to cheat. Because compliance is equivalent 
to a cooperative solution to the collective action problem, the factors that help to 
sustain cooperation also promote compliance (Scholz 1998). 

One such factor is trust. Both sides to a social interaction cooperate as long as 
the other is perceived to be trustworthy (Scholz 1998). To use Scholz’ example, 
individuals joining a sports club pay membership fees and expect to receive the 
benefits of membership. However, these benefits will only materialize if other 
members also contribute and the club leadership does not steal the funds. Given 
this, it is rational to pay one’s fees only if one trusts the other members and the 
leadership to keep their end of the bargain. 

In large collectives (such as entire societies) it is difficult to determine whether 
the other party in a given exchange is trustworthy. In these cases, the theory argues, 
people use trust heuristics, i.e., generalized trust in others obtained from prior 
experience with beneficial collective action (Scholz 1998). Empirical research 
on compliance has shown that the more individuals trust government and their 
fellow citizens, the more likely they will comply with government demands and 
regulations including paying taxes, accepting court authority and complying with 
the military service requirement (Levi and Stoker 2000; Tyler 1990). Generalized 
trust, including trust in political institutions, increases tax compliance more sig- 
nificantly than fear of punishment (Scholz 1998). 

Non-corruption is a form of compliance. It can be conceptualized as a collec- 
tive action problem in which everyone would benefit if all cooperated. At the same 
time, each individual faces strong incentives to cheat, for example, by offering 
and accepting bribes. When framing corruption as a collective action problem, 
one would expect that if people (both public officials and citizens) trust govern- 
ment to be fair (non-corrupt) and trust other people to behave fairly, they would 
reciprocate and also behave fairly, i.e., to not engage in corruption for personal 
gain. Trust, thus, becomes the basis on which non-corrupt exchange is sustained. 
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A few studies on corruption have argued that trust matters. However, these 
studies use aggregate (country-level) analysis and have not focused on the causal 
mechanisms at play. Using country-level measures, Uslaner has shown that the level 
of trust decreases corruption. To account for this finding, he argues that trust reduces 
corruption via social bonds: “trust leads to empathy with others—and thus a respect 
for the law.” (Uslaner 2004:10; see also Putnam 1993) This mechanism, however, is 
not explained in much detail. Seligson (2002), using individual-level data, finds a 
correlation between corruption and feelings of trust in a handful of Latin American 
countries. But he argues that corruption leads to mistrust. Mishler and Rose (2001) 
reach a similar conclusion by considering perceptions of corruption in Eastern 
Europe (see also Uslaner and Badescu 2003). Davis, Camp and Coleman (2004) 
find that social mistrust is associated with increased perception of corruption in 
Chile, Costa Rica and Mexico, but do not focus on the causal mechanism. In sum, 
the individual-level studies on corruption inform us about an empirical correlation 
for which the literature on compliance provides a plausible theoretical explanation. 


‘Social Learning 


The social learning theory (Akers 1998) developed to explain deviant behavior 
offers an alternative explanation of individual motivation to engage in corrupt 
behavior. The basic mechanism of the social learning theory works as follows: 
behavior is acquired and sustained (1. through adopting definitions (evaluations 
of the behavior as good or bad) via differential association with one’s peers (friends, 
family, colleagues and civic organizations), (2. through imitating such behavior 
by peers, and (3. through the positive reinforcement provided by past rewards for 
such behavior (Akers 1998). Given the overlapping and mutually reinforcing rela- 
tionships between these factors, the causal order between them is not determined 
(Akers 1998; Lanza-Kaduce et al. 1982). 

Because the measures of different concepts of social learning theory are likely 
to overlap empirically (Akers et al. 1979; Krohn et al. 1985), I focus here on key 
variables that can be clearly distinguished: definitions and models of behavior (or 
imitation). Definitions constitute normative attitudes toward certain behavior. 
The more individuals define a behavior as good (positive definition) or as justified 
(neutralizing definition) rather than bad (negative definition) the more likely 
they are to engage in it (Akers 1998). Empirical studies have found a strong and 
consistent relationship between individual’s definitions and attitudes toward the 
behavior in question and misbehavior in a wide variety of contexts such as sub- 
stance abuse, sexual aggression, white collar crimes and police misconduct includ- 
ing police corruption (Akers 1998; Krohn et al. 1985). Furthermore, in their study 
on corruption on Eastern Europe, Miller et al. (2001) argue that condemnation of 
bribery is associated with increased resistance to both accepting and paying bribes. 

Social behavior is also acquired and sustained through imitation or modeling 
of others’ behavior (Akers et al. 1979). If people perceive that a behavior is wide- 
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spread and that it is approved, then they are more likely to engage in it. Definitions 
and imitation are mutually reinforcing in their effect on an individual’s behavior. 
By using others’ behavior as a model, a person learns evaluative definitions of 
behavior as good or bad, right or wrong. To an extent, these definitions are them- 
selves verbal and cognitive behavior, which can be directly reinforced and can act 
as discriminative stimuli for other behavior (Krohn et al. 1985). Again, empirical 
research confirms that models and approval are the major predictors of problem 
behavior (Jessor and Jessor 1977). 

Applying these arguments to situations of corruption, a person is expected to 
engage in corrupt behavior if he or she does not define corruption as morally or 
situationally wrong, but rather as a justified and acceptable mode for exchange; 
and if he or she has been exposed to corrupt behavior or at least perceives that 
such a behavior is widespread and, thus, approved. 

The following section presents empirical tests of the arguments based on trust 
heuristic and social learning. The empirical tests also account, as much as data 
limitations permit, for the most common microeconomic explanation of corrup- 
tion—incentive theory (see Andvig and Fjeldstad 2001 for a detailed description 
of this research tradition). This approach focuses on the cost-benefit analysis of 
a particular individual. More specifically, previous research on incentive theory 
points at the importance of factors such as wage incentives, job security and 
promotion in curbing the motivation of public officials to engage in corruption. 
Similarly, penalties and sanctions, excessive public sector regulations, and other 
structural incentives may influence the level of corruption. 


Empirical Analysis: Data and Measures 


The analysis is based on two nationally representative datasets—one covering citi- 
zens, the other public officials in the post-communist democracy of Estonia. To my 
knowledge, these surveys contain the most comprehensive individual-level infor- 
mation on corruption that is currently available.” Estonia has scored 5.5-6.7 (out 
of 10, which is the minimum level of corruption) on Transparency International’s 
corruption perception index for the past seven years. Most West European countries 
have lower levels of corruption (scoring higher than 8 on the corruption perception 
index), but some, such as Italy and Greece, have higher scores. Most less-developed 
countries in Latin America, Africa, Asia and the Middle East are more corrupt 
than Estonia. Many countries in the post-communist region (e.g., Czech Republic, 
Hungary, Latvia, Lithuania, Slovakia and Slovenia) occupy the middle range of 
values on the corruption perception index, and are therefore comparable to Estonia. 

Finding a good measure for an individual’s engagement in corruption is a 
non-trivial task. In order to reduce potential self-reporting bias necessarily pres- 
ent in any survey, all respondents were guaranteed anonymity. Furthermore, the 
interviewees were told that the survey was conducted to get information on public 
service provision rather than on corruption, and the questions on the latter were 
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embedded within questions about other aspects of public service. Yet given the 
sensitivity of the topic, asking directly whether somebody has ever engaged in 
corruption is still most probably going to provide an unreliable measure. This is 
especially so in the case of public officials. In the case of ordinary citizens such an 
engagement may occur as a result of extortion, but public officials have to take 
full responsibility for involvement in corruption. Given this, it is preferable to use 
more subtle questions in order to get at the extent to which public officials are 
susceptible to corruption. 

The measure for Corruptibility was obtained by presenting the respondents 
with the following scenario: “Imagine that you are an official who decides upon 
whether or not to give financial support for certain projects. One of the applicants 
for financial support offers you a trip to a summer resort in case you decide in 
favor of their project. To what extent do you agree with the following statements? 
(1. I would decide in favor of the project because it would be impolite to refuse 
the offer; (2. I would decide in favor of the project because there is little chance 
of getting caught; (3. I would decide in favor of the project because it would be 

"a good economic gain; (4. I would decide in favor of the project because such 
a counter-favor would be harmless—nobody would suffer; (5. I would decide” 
in favor of the project because it is a rather widespread practice.” For every one 
of these five options respondents could (3) strongly agree, (2) rather agree than 
disagree, (1) rather disagree than agree, (0) strongly disagree. Note that if the 
respondent strongly agrees (or rather agrees) with any of these statements, they 
agree to participate in a corrupt exchange. The excuse for agreement becomes 
irrelevant for current purposes. Thus, one way to measure the dependent variable 
for public officials is to create a dummy variable so that respondents completely 
agreeing, or agreeing rather than disagreeing, with any of these five statements 
are assigned “1,” and the rest are assigned “0.” Such a measurement distinguishes 
between those officials who are susceptible to corruption and those who are not, 
but it loses potentially relevant information about corruptibility. As an alternative, 
I simply averaged each respondent's answers across the five statements to arrive 
at a continuous measure of corruptibility. These measures have face validity. In 
terms of reliability, responses to a hypothetical scenario are expected to be less 
contaminated by socially desirable responses than asking for someone's direct 
involvement in corruption. 

In the case of citizens I had to rely on more direct questions about their 
involvement in corruption. Yet, the self-report questionnaire technique has been 
argued to produce relatively accurate measures of corrupt behavior (Seligson 
2006). Indeed, prior research has consistently shown that such a technique is re- 
liable and valid in measuring sensitive topics such as delinquent, drug and drink- 
ing behavior (see Akers et al, 1983; Thornberg and Krohn 2000). Furthermore, 
an individual's involvement in corrupt deals can easily be blamed on public 
officials, especially if the individual has been extorted. Given the possibility of 


Why Do People Engage in Corruption? © 1263 


blame shifting, respondents should be less reluctant to report their personal 
involvement, enhancing the reliability of the measure: 

Specifically, in order to measure citizen involvement in corruption, the respon- 
dents were asked “Have you ever paid a bribe or offered a gift to a public official 
in order to influence the provision of a public service?” “Yes” responses for this 
question were coded 1, and “no” responses 0 to get the variable Ever paid a bribe. 
‘The variable intends to capture an individual’s behavior, yet there is no definitive 
way to validate the response. Because hard data are not available, one has to take 
the survey results as plausible approximations of reality. Several steps taken— using 
subtle rather than explicit ways to inquire about public officials’ involvement in 
corruption and performing robustness tests with slightly different measures —al- 
leviate some of the validity concernsTrust in other people (Generalized trust) was 
measured for both citizens and public officials by asking a standard question on 
a 4-point scale (where “4” = “very trustworthy”): “How trustworthy are most 
people?” Trust in government was measured by taking an average of the responses 
to these questions: (1. “How often do the Estonian courts guarantee a fair trial?” 
(where “4” = “always”) (2. “How much do you trust Estonian police?” (3. “How 
much do you trust that politicians will do what is best for the country?” (where 
“4” = “trust completely”). 

For the sake of increasing reliability, a series of questions were used to measure 
the concepts of “definitions” and “imitation.” The questions were based on several 
hypothetical third-person scenarios (or vignettes) to present respondents with re- 
alistic examples of a variety of corrupt behaviors (see Chappell and Piquero 2004). 
The variable measuring definitions is labeled as Perceived acceptability of corruption 
and the one measuring imitation as Perceived pervasiveness of corruption. The former 
was measured by an average of the respondent’s agreement on a 4-point scale (where 

“4” = “strongly disagree”) to the following: (1. “Ifa public official accepts a gift for a 
personal favor, then it is corruption;” (2. “Ifa public official accepts a bribe or a gift 
in exchange for avoiding a fine or for fast-tracking bureaucratic processing, then it 
is corruption;” (3. “If a person is employed thanks to personal relationships with 


_a senior public official, then it is corruption;” (4. “Ifa public official sells classified 
information, then it is corruption.”? This measure has considerable face validity 


common’): “To what extent do you think this is common behavior 


because the questions ask directly about how people define corruption. 
In order to measure Perceived pervasiveness of corruption, respondents were pre- 


sented with a variety of behaviors and asked (on a 4-point scale where “4” = “very 
7, 


| For public officials the question ended with “among public officials in Estonia,” 


while for the general public with “in Estonia.” Thus, especially for public officials, 


_ the questions tried to prompt respondents to think about what is common among 


their peers. The situations included the following: (1. A driver offers a police 
officer a good or a service by his or her firm at a discount price in order to avoid 
a speeding ticket; (2. An entrepreneur offers the headmaster of an elite public 
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school a trip to a summer resort for admitting his or her son to the school; (3. 
A public official uses a government provided car for private purposes; (4. A civil 
servant offers, for a fee, lectures in the area of his or her work-related expertise; 
(5. An entrepreneur calls up a public official who he or she knows from previous 
personal contacts, and asks to fast-track the processing of his or her file; (6. An 
entrepreneur offers a public official personal favors in return for a public con- 
tract; (7. A public official buys goods on behalf of his or her institution from a 
company owned by his or her relative; (8. A patient is moved up on a waiting list 
for surgery because his or her brother is friends with the surgeon; (9. (only in the 
case of public officials) An entrepreneur offers a public official a good or a service 
by his or her firm. The final measure is the average of all responses. This measure 
has face validity because the questions ask for the respondent's perception of how 
widespread corrupt behavior is among his or her peers: if it is perceived as more 
common and widespread, a person is more likely to imitate it.* 

L also test for some expectations of the incentive theory. Specifically, it has been 

_ argued that salary level and satisfaction with the workplace might influence how 
prone to corruption public officials are (Andvig and Fjeldstad 2001; Evans and 
Rauch 2000). Therefore, I added two variables to the analysis of public officials 
measuring the extent to which the respondent agrees (on a 4-point scale, where 
“4” = “strongly agree”) that (1. “My job is secure and stable” (variable name /Jod se- 
curity) and (2. “I am satisfied with my salary and benefits” (variable name Satisfied 
with salary) to the analysis of public officials. 

In the case of citizens, I control for the possibility that they have been Asked 
to pay a bribe, and | also include a variable indicating whether the person thinks 
that a bribe influences the quality of the public service (variable name Bribe helps).° 
Other factors derived from the incentive theory, such as long queues, costly regula- 
tions or severity of punishment cannot be included in the model because of lack 
of data. While objective measures of these variables are likely to be constant in 
a single-country study, it is still possible that there are some differences across 
specific sectors or organizations in the extent of to which the same regulations 
are followed. Unfortunately, the data collected for this project are not nuanced 
enough to test for these possibilities directly. | 

Both analyses also include demographics (sex, age, ethnicity), socio-economic vari- 
ables (education, family income, job type, length of career), and region (capital city 
vs. the rest of the country), as well as respondent’s Satisfaction with personal welfare.’ 


Findings 


Definitions and descriptive statistics of all variables are provided in Table 1. The 
frequency distribution of the dependent variables shows that about 14 percent 
of the public officials would agree to participate in a corrupt activity and about 
16 percent of the respondents among the general public admitted that they have 
bribed an official. The highest simple correlation coefficient in the public officials’ 
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dataset was .36 between Satisfied with salary and Family income. In the case of 
the general public, the highest correlation coefficient was .39 between Bribe helps 
and Perceived pervasiveness of corruption. All other correlation coefficients were 
considerably smaller. The correlation coefficient between Perceived pervasiveness 
of corruption and Perceived acceptability of corruption was -.07 for public officials 
and -.26 for the general public. These results reassure that the independent vari- 
ables measure separate concepts, and low variance inflation factors indicate that 
multicollinearity is not a great concern. 


Results for Public Officials 


Table 2 presents the results of the analysis of public officials. The two alternative 
models perform very similarly. The explanations based on generalized trust and 
trust in government fail to perform in the expected manner: neither of these vari- 
ables is significant in either model. These results did not change when the variables 
were entered one by one rather than together or when the different components of 
trust in government rather than the composed variable were used. This indicates 
that the insignificant results are not due to potentially low reliability of the index 
measure of the trust in government. Both variables remained insignificant even 
in a bivariate analysis. Thus, there is no evidence that trust, either in strangers or 
in government, is directly associated with corruptibility. 

At the same time, the results provide considerable support for the variables 
derived from the social learning theory. Perceived pervasiveness of corruption 
is a significant predictor of the corruptibility of a public official in both models. 
Consider Model A: for a unit increase in the Perceived pervasiveness of corruption 
score (e.g., from an average of “not at all common” to “not very common’), the 
odds of agreeing with a corrupt deal increase by a factor of 3.5. Thus, moving from 
the minimum to the maximum, somebody who thinks that corrupt activities are 
very common is about 10 times more likely to be corruptible than somebody who 
thinks that corrupt activities are not at all common. Further, corruptibility is also 
significantly related to perceived acceptability of corruption: for a unit increase in 
this variable, the odds of agreeing to participate in a corrupt deal increase about 
2.4 times. Again, a public official who perceives the different corrupt activities 
as completely acceptable is about eight times more likely to be corruptible than 
an official who perceives the same activities as not acceptable at all. These results 
are corroborated by Model B that uses the continuous measure of corruptibility. 

Model A can be used to further illustrate the effects of the social learning 
variables on the probability of being corruptible. Figure 1 presents predicted prob- 
abilities of corruptibility at different levels of perceived pervasiveness and accept- 
ability of corruption, and corresponding 95 percent confidence intervals. Clearly, 
as the values of Perceived pervasiveness of corruption and Perceived acceptability of 
corruption increase, the predicted probability of being corruptible also increases. 
Given that according to the theory, the effect of social learning variables is mutu- 
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Table 2: Logit and OLS Models of Corruptibility (Public Officials) 


Model A: Logit Model B: OLS 
OR 








Variables aor . 3557 b Cue 
veness of corru tion ~~ 333 
Pervelved pervasi e pi a ae 
Perceived scoop of comuption 867" 2.37 ae a 
a 0) ee hk 
General ized trust Ue 
es (OR) ee a ee 
Trust in government .16 1.17 -.02 
oe (60) (-.39) 
Job security oo 2 43h a 
oo EO) oo 3) 
Satisti ed with ay -.23 19 -.04* 
(1.49) (-1.93) 
Satisfaction with oe welfare : o 18 3 84 
ee AS — (1.44) 
Sex (female) .28 1.32 -.02 
S Te (-.66) 
Education (higher) ae =<16 OS" 
Ethnicity (non- Sa 1107 2.99 20 
VAC) (2.15) 
a income rE 1 nt 
a hr (.14) es 
dob Type! A. : 
—Mesacet. = rr r—C( OO 12 
ee oie) (1.14) 
Specialist -.53 59 -.0 
ch sD tte, BO 
Ae — 02. [2 Oe 
a 2. 01.83) le ao 
Length of career -.17 85 -.02 
pe ae | _(-1.61) ha (-1.05) 
Capialcty is : 2 8] oo -.04 
 .. oo 160) a 
Constant -8 29% -.46* 
-4.76 -1.69 
Wald x? ne aad 
Pseudo-R? 4 
R2 


1 


Notes: *The reference category is “other,” which includes, but is not limited to, 
political appointees, consultants and advisors. Dependent variable for Model A is the 
dichotomized and for Model B the continuous measure of Corruptibility; z-ratios or tratios 


in parentheses. *p<.10 “p<.05 **p <.01 (two-tailed tests). 
N = 791 
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Figure 1. Probability of Corruptibility (Public Officials) 


Pr(Corruptibility) 
Pr(Corruptibility) 








1 2 3 4 1 2 3 4 
Perceived Acceptability of Corruption Perceived Pervasiveness of Corruption 


Notes: The solid line indicates the predicted probabilities of being corruptible at 
different levels of perceived acceptability of corruption (left panel) and perceived 
pervasiveness of corruption (right panel). The dashed lines indicate the corresponding 
95% confidence intervals. 


ally reinforcing (Akers 1998), it is also informative to calculate the combined 
effect of perceived pervasiveness and acceptability of corruption on corruptibility. 
Indeed, when both variables are increased from their minimum values to their 
maximum values, holding other variables at their means or medians, the predicted 
probability of being corruptible increases considerably: from .01 to .81. 

As for other variables, the explanations derived from the incentive theory 
appear significant. Specifically, officials who are satisfied with their pay are less 
likely to engage in corruption, but those who think their jobs are safe and stable 
appear more corruptible. Perception of high job security may capture the lack 
of performance reviews and other type of monitoring of public officials. The 
existence of such monitoring may decrease officials’ confidence in job security, 
but also decrease the likelihood of engaging in corruption as previous research 
has shown (Ruch and Evans 2000).* 

The respondent's ethnicity produces an interesting effect: non-Estonian officials 
appear to be more corruptible than their Estonian counterparts. About 30 percent 
of Estonian population is non-Estonian. A majority of the latter are Russians or 
Russian-speaking people from other former Soviet republics. The significant coef- 
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ficient on this variable underlines the importance of cross-cultural differences in the 
level of corruption and corruptibility. Similar differences between nationalities have 
been pointed out in previous research (Miller, Grodeland and Koshechkina 2001).? 


Results for the General Public 


Table 3, Model C presents the results for the general public. Because the depen- 
dent variable is binary, I used logit regression to estimate the model. As was the 
case with public officials, the trust-based explanations of engaging in corruption 
are not supported by the results of the analysis: both generalized trust and trust 
in government remain insignificant. Again, these results are not altered when the 
trust variables are entered in the models one by one or when trust in government 
variable is disaggregated into its constituent components. Both variables remain 
insignificant even if the variables with the strongest effects—Asked to pay a bribe 
and the social learning variables —are excluded from the analyses. 
The variables of the social learning theory behave similarly to those in the analy- 
‘sis of public officials: both perceived pervasiveness and acceptability of corruption 
have the expected and statistically significant effect on the likelihood of having - 
paid a bribe. The effects remain significant even when controlling for extortion, 
i.e., whether the respondents were asked to pay a bribe, which is the strongest 
predictor of having paid a bribe. At the same time, having been asked to pay is 
not a necessary or sufficient condition for having paid: only about 40 percent of 
the respondents who had ever been asked to pay a bribe stated that they actually 
had paid one. Similarly, 9 percent of those who said they had never been asked to 
pay still stated that they had paid. When interpreting the size of the effect, recall 
how the variable is measured: its empirical range is from 0 to .92 and represents 
an average of 12 public services for which a respondent may have been asked to 
pay a bribe. Substantively, when all other variables are at their average value, for 
a respondent who has never been asked to pay a bribe the predicted probability 
of having paid is .10. This probability increases to .16 for a respondent who has 
been asked once to .23 for the one who has been asked twice and to .91 when 
the variable is at its maximum value. When this variable is excluded from the 
analysis, the performance of the social learning variables improves considerably. 
Furthermore, the relationships stay as reported in Model C when the analysis is 
performed with only those who said that they had never been asked to pay (see 
also Model D in Table 3). This constitutes an even more direct test of the social 
learning variables as the choice of these people to avoid corruption or engage in 
it was entirely voluntary. The fact that the relationships survive these alternative 
tests offers additional confidence in the theoretical and empirical importance of 
the social learning variables.'° : 
| Substantively, the effects of the social learning variables are also considerable. 
Figure 2 presents the predicted probabilities of offering a bribe at different levels 
of perceived pervasiveness and acceptability of corruption, and the corresponding 
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Table 3: Logit Model of Ever Paid a Bribe (General Public) 














Model C Model D: With Interactions 
Variables b(z-ratio) Odds ratio —_b (z-ratio) Odds ratio 
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ee Le 
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Notes: *The reference category is “primary education.” “The reference category is 
“unemployed.” Dependent variable is whether or not a respondent has ever paid a bribe; 
z-ratios in parentheses. *p<.10 “*p<.05 ***p<.01 (two-tailed tests). 

N = 788 
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Figure 2. Probability of Ever Paid a Bribe (General Public) 


Pr(Ever Paid a Bribe) 
Pr(Ever Paid a Bribe) 





1 2 3 4 1 2 3 4 
Perceived Acceptability of Corruption Perceived Pervasiveness of Corruption 


Notes: The solid line indicates the predicted probabilities of ever having paid a bribe 
at different levels of perceived acceptability of corruption (left panel) and perceived 


pervasiveness of corruption (right panel). The dashed lines indicate the corresponding 
95% confidence intervals. 


95 percent confidence intervals. When all other variables are held at their means 
(or medians), increasing the value of Perceived pervasiveness of corruption from its 
minimum to its maximum increases the predicted probability of offering a bribe 
from .04 to .23. A similar manipulation in the values of Perceived acceptability of 
corruption increases the predicted probability from .10 to .31. The combined effect 
of perceived acceptability and pervasiveness of corruption increases the predicted 
probability of having paid a bribe from .03 to .47. 

In sum, the empirical models demonstrate that both public officials as well 
as citizens are motivated by their perception of what is acceptable and common- 
place. At the same time, the general views of government and other people as 
trustworthy and fair, remains inconsequential. These results are robust across a 
variety of alternative estimations including (1. using imputed data that replace:the 
non-trivial degree of missing cases, (2. using alternative measures of dependent 
and independent variables, (3. performing Wald tests of exogeneity and Hausman 
test of model specification in order to determine whether the results of the original 
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models presented here are accurate, (4. using data from a repeat study conducted 
in 2006. The results are available at http://tavits.wustl.edu. 


Revising the Social Learning Theory: The Effects of Situational Constraints 


Our analyses show that the effect of the social learning variables on corruption 
is not trivial. Despite these strong effects, the relationships presented are not 
self-evident or obvious. That is, there may be situations where the social learning 
variables are not related to one’s decision to engage in corruption. This becomes 
clear when the individual-level motivations are considered in combination with 
incentive structures. Uncovering under what conditions social learning theory 
does and does not work offers an important refinement of the theory. 

Corruption as an illegal act involves both potential benefits and risks. Benefits 
can be defined as getting something as a result of a corrupt deal that one would 
otherwise not get, while risk can be defined as the probability of getting caught 
and being punished. Uncertainty of benefits and risks may increase the impor- 
tance of social learning variables. Benefits are uncertain when an individual cannot 
be sure whether engaging in corruption will have any desired effect. For example, 
an individual attempting to bribe an official with a goal of getting special treat- 
ment cannot be certain that the official accepts the bribe and delivers the favor. 
Similarly, risks are uncertain when it is not known how likely it is that one will 
get caught for engaging in corruption. In the absence of constraints that would 
make benefits and risks more or less certain, it is likely that people will rely on 
their socially learned behavior. Indeed, the social learning variables themselves may 
act as heuristics that help individuals estimate the likelihood of benefits and risks. 

However, when benefits are unlikely and risks are high, people should be less 
likely to engage in corruption regardless of their socially learned predispositions. 
For example, explicit statements by officials that attempted bribery will not be 
tolerated or evidence of vigorous prosecution of such attempts in the past lower 
the benefits and simultaneously increase risks. The flip side of this argument is 
that, when benefits are certain and risks are low (i.e., there is a structural incentive 
to behave in a corrupt manner), then perceived acceptability and pervasiveness of 
corruption may be less important in making the decision to engage in corruption. 
For example, we might expect that benefits are certain and risks low in situations 
where it is known that in order to receive a good or a service a bribe is needed or 
in situations where an official asks an individual to pay a bribe. In such situations 
one can be almost certain that the benefit will follow when the bribe is paid (or 
at least be certain that no benefit will follow if the bribe is not paid). Also, in the 
case of extortion, the risk of punishment falls mostly on the official. 

This argument can directly be tested with data for general public using the 
variable Asked to pay a bribe to group respondents by whether or not they have 
been extorted into a dichotomous Ever asked to pay a bribe.\' We would expect 
that those who have been extorted are less influenced by perceived acceptability 
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and pervasiveness of corruption in their decision whether or not to engage in 
corruption than those who have not been extorted. Model D in Table 3 includes 
interaction terms between the dichotomous measure of extortion and both social 
learning variables. The results of the interactive model confirm the theoretical 
expectations: perceived acceptability and pervasiveness of corruption significantly 
influence bribe giving for the group of respondents who have nor been extorted. 
However, these variables have no effect on the bribe-giving of those respondents 
who have been extorted. 

Specifically, the coefficients of Perceived acceptability of corruption and Perceived 
pervasiveness of corruption reported indicate the effect of these variables on having 
paid a bribe for people who have not been extorted (i.e., when the variable Ever 
asked to pay a bribe is “0”). Both of these effects are positive and highly significant 
statistically. However, calculating the conditional coefficients (marginal effects) 
of the social learning variables for those who have been extorted (i.e., when the 
variable Ever asked to pay a bribe is “1”) shows that neither variable significantly 

- influences the likelihood of ever having paid a bribe: the conditional marginal 
effect for Perceived pervasiveness of corruption is .797 — .231 = .566 (z= 1.52) and. 
for Perceived acceptability of corruption .792 — .988 = -.197 (z = -.57). Extortion 
significantly conditions the extent to which social learning variables influence 
people’s corrupt behavior. These variables guide the decision about whether or 
not to engage in corruption only when structural incentives to engage are absent. 

In sum, while the effects of the social learning variables on the probability of 
engaging in corruption are very strong, the argument made is not a truism, but 
an important addition to current scholarship on corruption that tends to be 
dominated by structural explanations, as well as to social learning theory, which 
has not accounted for those structural constraints. This revision of the social 
learning theory helps us understand when and how socially acquired behavior 
could be altered and thereby offers clear practical implications. If the argument 
about benefits and risks are correct, one would expect that various policies and 
rules increasing the risks of getting caught and decreasing the likelihood that 
public officials accept a bribe could be implemented to overcome social learning 
effects and reduce incentives to engage in corruption. Data limitations in the 
current study do not allow probing the effect of additional structural features on 
constraining the explanatory power of the social learning theory. However, the 
results presented here open an important and fruitful avenue for future research 
and call for further investigation into the interaction effects between institutional 
constraints and individual psychology on people’s behavior. 


Conclusions 


Research on system-level reasons for corruption is a growth industry. However, it 
is the people who are corrupt, not the countries, This study has therefore taken 
an individual-level approach and tackled the question of why public officials and 
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citizens engage in corrupt exchange in a context where the problem tends to be the 
acutest—in a young post-communist democracy. I have argued that willingness to 
engage in corrupt behavior is more likely when one does not define corruption as 
morally or situationally wrong. Furthermore, a willingness to engage in corruption 
is enhanced by modeling others: the more widespread the corrupt behavior among 
someone's peers, the higher the likelihood that he or she will imitate such behavior. 
This social learning theory is especially powerful when individuals do not face struc- 
tural constraints or incentives to engage in corrupt behavior, while the presence of 
such constraints and incentives weakens the effect of the socially learned predisposi- 
tions. At the same time, the analyses provide no support for another explanation 
for engaging in corrupt behavior—that people who are less trusting of their fellow 
citizens and of government institutions, are more likely to engage in corruption. 

These findings have several important implications, related to both policy and 
theory. First, the results suggest that there are significant interpersonal differences 
in willingness to engage in corrupt exchange. Thus, the level of corruption is not 
exclusively a system-level phenomenon: within similar systems some people are 
more prone to corruption than others. This, in turn, means that countries with 
endemic corruption may face a vicious circle and consequently an even harder task 
of controlling corruption than previously assumed. The results also suggest that 
intervention strategies designed to reduce the corruption by street-level bureau- 
crats could target the social processes through which public officials and citizens 
learn that corrupt behavior is acceptable. The focus of such an effort might be on 
raising awareness of acceptable and unacceptable (or illegal) behavior. The results, 
however, also showed that structural incentives constrain the extent to which 
social learning influences corrupt behavior. The significant policy implication 
from the revised social learning theory is that implementing rules that decrease the 
likelihood of obtaining benefits from corrupt exchange and increase the risks of 
getting caught may help to overcome social learning effects and reduce corruption. 

Last but not least, this study also contributes to the discussion about the rela- 
tionship between corruption and trust. The current study has shown that when 
engagement in corrupt exchange is voluntary, or even unsuspecting, then the extent 
to which a person trusts other people or government has no effect on his or her 
willingness to engage in corrupt activity. This does not invalidate the findings that 
when extorted, people may lose trust in their government (Seligson 2002), or when 
being skeptical about one’s government and the trustworthiness of other people, one 
also tends to perceive others (including government) to be more corrupt. 


Notes 


1. Fora more complete survey of literature on corruption, see Treisman (2007), Gerring 


and Thacker (2004), Rose-Ackerman (1999) and Schmidt (2007). 


2. The surveys were conducted in December 2004 by an Estonian polling firm Emor 
(Estonian Ministry of Justice 2005). The survey of the general public used face-to-face 
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interviewing and a national probability sample (1,002 respondents). The survey of 
public officials used computer-assisted web interviewing and a representative sample of 
respondents from the following public sector institutions: ministries, county governments, 
constitutional institutions, agencies and boards, institutions of law and order, political 
officials, local governments (901 respondents). Such self-administered surveys have an 
advantage when sensitive questions are asked (Leeuw, Hox and Kef 2003). 


All of these situations, and the ones presented for measuring perceived pervasiveness 
of corruption, are classified as “criminal” according to the Estonian Penal Code and 
Anti-Corruption Act. 


Although several steps were taken to maximize the validity of the measures used, 
because both behavior (the dependent variable) and attitudes (the independent 
variable) are measured with survey data, endogeneity cannot fully be avoided. This 
limitation certainly applies to most survey-based research including the existing 
studies of social learning and those of consequences of corruption, where behavior 
and attitudes are both measured by survey items simply because corrupt behavior 
is practically impossible to measure directly (see Akers et al. 1979; Boeringer et al. 
1991; Canache and Allison 2005; Seligson 2002, 2006; Torgler 2005; Wenzel 2004). 
For example, both Torgler (2005) and Wenzel (2004) use individual’s perceptions 
of social acceptability of tax evasion to predict self-reported tax evasion. Seligson 
(2002) uses questions about encountering corruption by the police and in court to - 
predict trust in police and courts. The more carefully crafted measurement of the 
dependent variables in the current study compared to the existing ones represents an 
improvement in the attempt to reduce endogeneity. Furthermore, previous studies 
have established that “social learning theory can adequately be tested with survey data” 
(Akers et al. 1979:651), which provides further reassurance about the adequacy of the 


research design used here. 


The index is based on the average of positive responses to the question of whether or 
not a respondent has been asked to pay a bribe for the following public services: (1. 
applying for a driving license/ registering a car, (2. in relation to taxes, (3. applying for 
a passport or an ID card, (4. in relation to the property reform, (5. in courts, (6. in 
hospitals, (7. by police, (8. in relation to schools, universities, or kindergartens, (9. in 
relation to social services, (10. when crossing the border, (11. by the local government, 
(12) in the case of any other public services. The theoretical range of the measure is 
from 0 (never asked to pay) to 1 (asked to pay in relation to all of these services) while 
the empirical range from 0 to .92. 


“To what extent does bribing or gift giving influence the quality of the service?” 1 = 
not at all, 2 = not much, 3 = quite a bit, 4 = a lot. 


“How would you evaluate your personal economic wellbeing today compared to five 


» 
years ago?” 1 = much worse, 2 = somewhat worse, 3 = the same, 4 = somewhat better, 
5 = much better. 


It is impossible to say exactly how the respondents were interpreting the question 
about job security. For example, one can also argue that a sense of job security may 
decrease incentives to engage in corruption because getting caught on corruption 


could lead to losing an otherwise secure job. If this interpretation is correct, then,the 
finding here runs counter to the incentive theory. . 


Note that ethnicity is not a significant predictor of corruption among the general 


public (see Table 3). 
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10. An additional interesting finding is that public sector employees are more likely to 
offer a bribe or gift than other respondents. Further analysis is necessary to explain 
this curious effect. One possible explanation is that public sector employees are more 
likely to interact with other public sector employees more frequently and therefore 
develop expectations of reciprocity in offering favors. 


11. The dichotomized variable assigns “1” to all respondents who state that they have been 
asked for a bribe in relation to any of the 12 services listed in note 7, and “O” otherwise. 
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Social Insurance as a Collective Resource: 
Unemployment Benefits, Job Insecurity and Subjective 
Well-being in a Comparative Perspective 


Ola Sjoberg, Stockholm University 


This article argues that unemployment benefits are providing a crucial but often 
overlooked function by reducing the insecurity associated with modern labor markets. 
Because job insecurity is associated with concerns about future financial security, eco- 
nomic support during unemployment may lessen the negative effects of job insecurity 
on employed individuals’ well-being. Using data from the European Social Survey, 
this article shows that the generosity of unemployment benefits makes a difference to 
the subjective well-being of employed individuals, especially those with limited eco- 
nomic resources and an insecure position in the labor market. These results indicate 
that unemployment benefits may be viewed as a collective resource with important 
external benefits, i.e., benefits to society over and above those to the unemployed who 
directly utilize such benefits. 


Mankind has always been exposed to risks and insecurities of various kinds, and 
many of the societal institutions of modern society can in fact be seen as ways of 
dealing with these risks.! Whereas natural disasters and individual or epidemic 
illness have been a continuous concern for societies, industrialization and ur- 
banization not only meant the introduction of new risks related to the emerging 
labor markets, but also the breakdown of many traditional forms of risk-sharing 
(Holzman and Jorgensen 2001). The welfare state can be seen as a state-centered 
solution to the problem of handling the risks and uncertainties people encounter 
during the course of their lives. Since the introduction of the first modern social 
insurance schemes in Bismarckian Germany in the 1880s, the welfare state in the 
industrialized world has expanded to cover more risks—from the early laws that 
focused on work-related accidents and old age to the post-World War II expansion 
of family policies—as well as higher proportions of the population. 

The welfare state and social insurance schemes have many purposes and a large 
range of both intended and unintended consequences for individuals as well as 
society as a whole. Focusing on social insurance, redistribution is perhaps the 
first function that springs to mind: redistribution between rich and poor, as well 
as over the life cycle. In this article, it is argued that the welfare state—or more 
specifically unemployment insurance schemes—is providing a crucial but often, at 
least empirically, overlooked function: it reduces the overall insecurity associated 
with modern labor markets. A main characteristic of the “golden age of welfare 
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capitalism” in the 1960s and 1970s was not only low unemployment rates, but 
also an institutional framework for largely secure employment (Wheelock 1999). 
The insecurity of employment was largely confined to unskilled manual workers. 
However, since the late 1970s, unemployment rates have been persistently high 
in the OECD area and insecurity of employment has increasingly become the 
lived experience of the middle classes too. A large amount of research indicates 
that job insecurity (i.e., the fear, real or perceived, people in the workforce have 
that they might lose their jobs and become unemployed) is an important stressor 
and as such associated with a range of adverse health outcomes. Because it seems 
highly likely that job insecurity is associated with concerns about future financial 
security, this article argues that economic support during times of unemploy- 
ment may lessen the negative effects of job insecurity on employed individuals’ 
subjective well-being. One implication of this is that social policy schemes, such 
as unemployment benefits, might be viewed as a collective resource that need 
not necessarily be utilized in order to have a welfare-enhancing effect; the mere 
- knowledge that they exist may reduce the negative consequences of insecurity. 
Against this background, this article will analyze the cross-national relation-. 
ship between employed individuals’ subjective well-being and unemployment 
benefit generosity, a topic that notwithstanding the increased attention paid to 
the effect of macroeconomic and institutional factors on subjective well-being has 
attracted very scant interest previously (Di Tella et al. 2003; Ochsen and Welsch 
2006). In relation to these studies, this article contributes to the existing body of 
evidence in several ways. Firstly, a number of “placebo tests” is designed that allow 
us to further test the hypothesized relationship between unemployment benefit 
generosity and subjective well-being. The formulation of such “placebo tests” (or 
counterfactuals) and of as many observable implications from the general theo- 
retical argument as possible is a productive way of increasing the leverage over 
the research problem at hand (King et al. 1994). More specifically, it is argued 
that for the relationship between unemployment benefit generosity and subjec- 
tive well-being to be plausible, unemployment benefit generosity should have 
no effect on the subjective well-being of individuals who do not experience job 
insecurity and of individuals who are not eligible for benefits if they become un- 
employed. Further test of the hypothesized association between benefit generosity 
and subjective well-being is provided by differentiating between employed indi- 
viduals with different risk of unemployment and different levels of vulnerability 
if they were to become unemployed. Secondly, by using two different measures of 
subjective well-being, one single-item measure of life satisfaction and one multi- 
item scale that captures different components of positive well-being (the so-called 
WHO-5 scale) that to my knowledge never has been used in this line of inquiry 
before, the present study provides further insight into how different aspects of 
subjective well-being are affected by unemployment benefit generosity and other 
macro-social factors. Moreover, the data allow us to control for prior and present 
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health status of individuals as well as a number of indicators of social support 
that has been shown to moderate the negative effects of job insecurity on well- 
being. Thirdly, by distinguishing between different dimensions of unemployment 
benefit generosity this article sheds further light on the institutional mechanisms 
whereby unemployment benefit schemes might affect the subjective well-being 
of employed individuals. 

It should be remembered that even if unemployment benefits are found to 
have a positive effect on a worker's well-being, this does not come without costs. 
Such benefits must be financed through tax revenues, and they might also af- 
fect unemployment levels (Nickell et al. 2005). Both taxes and unemployment 
may then affect individual well-being, directly as well as indirectly through, for 
example, lower levels of economic growth. Moreover, uncertainty and risk can 
also have positive effects, for example by open up investment opportunities for 
entrepreneurs. In the long run, uncertainty may therefore contribute to economic 
efficiency and growth and as a result also benefit those less inclined to engage in 
entrepreneurial activities. However, the direct, short-term costs of uncertainty may 
be more unequally distributed in society. As a social construct, the labor market 
incorporates asymmetries of power between its participants, and whereas uncer- 
tainty may a window of opportunity for some, it may be a constraint for others. 


Job Insecurity, Insurance and Well-Being 


Insurance, both private and the various forms of income maintenance schemes 
usually referred to as social insurance, can be defined as a device which offers 
individuals protection against risk (Barr 1993). Focusing for a moment on pri- 
vate insurance, what induces individuals to buy protection against risks that they 
know have a very low probability of materializing? At the end of our lives, most 
of us have spent far more money on premiums to insure our houses, cars and 
other belongings or on medical insurance premiums than have been paid out to 
us by the insurance companies. The answer is of course that most of us are risk- 
averse, meaning that uncertainty per se causes disutility and potentially reduces 
our well-being (Barr 1993). Therefore, certainty in itself is seen by most people 
as something worth paying for. Having bought a new and expensive car, most of 
us will probably sleep better at night if we know that we will be (at least partly) 
compensated in financial terms if it is stolen. 

Admittedly, there are at least two important differences between private insur- 
ance and what we usually refer to as social insurance. First, social insurance is 
often compulsory, so individuals have no option as to whether to buy a particular 
insurance (let’s say against the risk of unemployment) or not. Second, social insur- 
ance can be provided against risks that private insurers may not cover. The prime 
example here is unemployment insurance. Private unemployment insurance is 
rare around the world for three primary reasons: (1. moral hazard (i.e., that the 
insured can influence the probability that an event, in this case unemployment, 
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will happen), (2. the fact that unemployment is related to the business cycle and 
often occurs in waves, and (3. some groups are simply seen as uninsurable. 
Despite these differences between private and social insurance, the core argu- 
ment of this article is that publicly provided unemployment insurance is enhancing 
people's well-being by reducing uncertainty in the same way as property insurance, 
for example. It may very well be that some individuals would prefer to opt out of 
compulsory unemployment insurance in favor of private insurance or savings and 
see this as an increase in well-being. If they are risk-tolerant, they might prefer no 
coverage. However, if people consider unemployment a real risk and they are risk- 
averse, the risk of becoming unemployed may have a negative effect on well-being. 
A large amount of research provides evidence of adverse effects of self-reported 
job insecurity (i.e., the fear of becoming unemployed) on various forms of health 
outcomes, ranging from psychological distress, anxiety and depression to an in- 
creased level of mental, emotional and physical exhaustion (for useful overviews 
of the literature, see De Witte 1999; Sverke et al. 2002; De Cuyper and De Witte 
. 2006). Burchell (1999; see also De Witte 1999) even states that the empirical 
literature provides evidence of a direct causal link between job insecurity and _ 
psychological well-being. Longitudinal studies have also suggested that such a 
causal link is plausible (see Sverke et al. 2002; Ferrie et al. 2003, 2005). However, 
that the perceived cause (i.e., job insecurity) precedes the effect (i.e., negative 
psychological well-being) is a necessary but not a sufficient condition for estab- 
lishing a convincing causal argument. In addition, a causal mechanism whereby 
job insecurity affects psychological well-being needs to be specified or at least 
suggested (Elster 1989). Here the literature points at related but slightly different 
ways of understanding this relationship. In Jahoda’s well-known theory (1982) 
about the latent functions of work, employment fulfils both a manifest function, 
in the form of generating an income, and a number of latent functions, such as 
provision of a time structure, social contacts, participation in collective purposes, 
status and identity, and regular activity. The threat of job loss therefore entails the 
threat of losing vital tangible as well as intangible resources, and the fear of such 
losses may then trigger various types of stress reactions such as anxiety and depres- 
sion (Greenhalgh and Rosenblatt 1984; De Witte 1999). Burchell (1994, 1997), 
drawing on agency theory, argues that the causal mechanism should be sought in 
the individual’s ability to plan and control his or her life. While individuals’ can 
use different strategies to deal with actual job loss, such coping strategies are very 
difficult to use when the individual fears the loss of his or her job, but is uncertain 
about if and when the loss will occur (Lazarus and Folkman 1984). Research has 
indicated that experiencing job insecurity can be as distressing as experiencing 
actual unemployment (De Witte 1999). It has also been pointed out that although 
job insecurity is related to actual change activity (i.e., the process of actually losing 
ones job), the perception of job insecurity may be triggered by factors beyond 
specific change activity. In line with Heaney et al. (1994), it can be argued that 


Social Insurance as a Collective Resource © 1285 


in contemporary working life perceived by many as being characterized by rapid 
change and recurrent organizational restructuring, a certain enhanced level of job 
insecurity may represent an enduring perceived threat for employees. Thus, if job 
insecurity may be seen as chronic, job insecurity can be considered to be a chronic 
stressor (Van Vuuren et al. 1991). 

Although the importance of having a feeling of economic security for the 
psychological well-being of both unemployed and employed has long been 
recognized (Eisenberg and Lazarsfeld 1935), the role of economic security as 
a moderator of the effect of job insecurity on psychological well-being has 
nevertheless been conspicuously absent from the literature (Ferrie et al. 2005). 
Although unemployment benefits may do little to directly affect the risk that 
employed individuals will lose their jobs, a central argument of this article is 
that such benefits may affect the sense of economic vulnerability associated 
with possible job loss. It seems plausible to assume that an important reason 
why uncertainty about future unemployment affects people’s subjective well- 
being is because it causes concern about financial security (Ferrie et al. 2003). 
Consequently, it can be argued that employed individuals’ sense of well-being 
will increase if there is an unemployment protection scheme which, at least to 
some degree, will protect them from income losses if they become unemployed. 

Although job insecurity and the uncertainty associated with future prospects in 
the labor market have increasingly come to be the lived experience of the middle 
classes too, there are strong reasons to believe that the burden of job insecurity 
still falls disproportionately on those who are the least well equipped to deal with 
it (i.e., those with limited resources, economic or otherwise). A useful distinction 
can be made here between risk and vulnerability. People are at risk if something 
negative might happen, whereas people are vulnerable when, if something nega- 
tive happens, it will damage them. A person with sufficient economic resources 
(money in the bank or a high income) but in a low-security occupation is at risk, 
but not vulnerable, whereas a person who is in secure employment but lacking 
in economic resources (low income and no money in the bank) is vulnerable, 
but not at risk. An additional argument of this article is that if unemployment 
benefits functions as moderator of the effect of job insecurity on well-being, this 
moderating effect will be most important for those with a high risk of becoming 
unemployed and/or lacking the private resources to cope with these risks. 

Although there is a lack of research on the links between job insecurity, future 
financial security and well-being, the effect of income maintenance schemes in 
reducing uncertainty has been noted in a more general sense. Both Atkinson 
(1991) and Barr (1993, 2001) say the fact that the welfare state reduces uncer- 
tainty and provides citizens with a sense of security is an important, but often 
neglected, objective of the welfare state. Haveman (1985:449) furthers this argu- 
ment by saying: “in my view, the primary economic gain from the welfare state 
is the universal reduction in the uncertainty faced by individuals.” Likewise, Sinn 
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(1995:259) argues that “apart from the benefit of stabilising the political system 
and avoiding social unrest, the welfare state’s main achievement is the social insur- 
ance it provides in an uncertain world.” There is also a nascent body of literature, 
above all in economics, which has studied the effect of macroeconomic conditions 

on subjective well-being (e.g., Di Tella et al. 2003; Wolfers 2003; Alesina et al. 
2004). An important conclusion from this line of research is that macroeconomics 

matter, and that people’s subjective well-being is correlated with both levels and 

trends in macroeconomic factors such as GDP, unemployment and inflation. Also 

factors such as the quality of governance (Helliwell 2003) and democracy (Frey 
and Stutzer 2000) have been shown to be important in this context. A limited 

number of cross-national studies have also analyzed the effect of the welfare state 

and income protection schemes on subjective well-being. Di Tella et al. (2003), 
which is, to my knowledge, the only study that directly analyzes the effects of 
unemployment benefits on the well-being of the employed, finds that unemploy- 
ment benefits and self-reported life satisfaction are positively correlated for both 

unemployed and employed. Ochsen and Welsch (2006) and Veenhoven (2000), 
however, find no evidence of such a link when analyzing combined samples of 
unemployed and employed, and neither does Ouweneel (2002) when analyz- 
ing only unemployed individuals. Ochsen and Welsch (2006) argue that this 

does not imply that the unemployment protection system has no positive effects 

on well-being, but rather that these effects are matched by associated costs (e.g., 
taxes). The studies by Veenhoven (2000) and Ouweneel (2002) use no individual- 
level control variables and expenditure-based measures of social protection that, 
I would argue, make their results questionable.” In relation to these studies, this 

research controls for a much larger set of relevant micro-level variables (especially 
regarding present and prior health-status of the respondents). It also uses and 
contrasts different measures of subjective well-being (at the individual level) and 
unemployment benefits (at the macro level). 


Hypotheses 


‘The main idea guiding this research is that job insecurity is a more or less chronic 
stressor in modern societies which has adverse effects on employed individuals’ 
subjective well-being. One important reason for this is that job insecurity is likely 
to cause concern about financial security. Because unemployment benefits may 
reduce the sense of economic vulnerability associated with job loss, such benefits 
will have a beneficial effect on employed individuals’ subjective well-being. Thus: 


H1. Unemployment benefit generosity will have a positive 
effect on employed individuals’ subjective well-being. 


For the hypothesized effect of unemployment benefit generosity on employed 
people's well-being to be plausible, benefit generosity should also have a negligible 
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effect on the well-being of individuals who are not experiencing job insecurity and 
for whom the status of “unemployed with a right to unemployment benefits” is 
irrelevant, as well as for individuals who will not be eligible for unemployment 
benefits if they become unemployed. Possible candidates for the first group of in- 
dividuals are housewives. Because they are not employed they will not experience 
job insecurity directly, thus there will be no moderating role of unemployment 
benefits. Appropriate candidates for the second group of individuals are students; 
although in some countries students may be eligible for unemployment benefits 
under certain conditions, as a rule they will not be eligible when they have finished 
their studies. Therefore, although both students and housewives will probably 
also have concerns about financial security, which in turn may affect their well- 
being, it could be argued that these concerns will not be mitigated by generous 
unemployment insurance benefits. However, using students and housewives for 
testing these kinds of “placebo effects” is not without problems. If housewives 
are affected by the well-being of their husbands, and if economic resources are 
pooled within the household, the well-being of housewives will be influenced by 
the job insecurity experienced by their husbands, and as a consequence also by 
the insecurity-reducing effect of unemployment benefits. However, it could be 
argued that this “indirect” effect of unemployment benefits on housewives’ well- 
being will be less in magnitude than the “direct” effect on employed individuals’ 
well-being. The issue of selection (i.e., that unhealthy women will withdraw from 
the labor market into homemaking) seems irrelevant because previous research has 
shown that health does not affect the odds of becoming or remaining a housewife 
(e.g., Ross and Mirowsky 1995). As for students, there are probably other reasons 
why their well-being is not affected by the generosity of unemployment benefits 
besides the fact that they are not eligible for such benefits. Some of these factors 
are (at least partially) controlled for in the empirical analyses, such as no families to 
support (which makes the economic consequences of unemployment less severe) 
and little labor market experience (which may make them less risk averse), while 
we are unable to control for other factors (such as the fact that many students are 
financially dependent upon their parents). Keeping these cautions in mind, it is 
possible to formulate the following hypothesis: 


H2. Unemployment benefit generosity will have a smaller 
effect on the subjective well-being of individuals who do not 
directly experience job insecurity (housewives) as well as for 
individuals who are not eligible for benefits if they become 
unemployed (students). 


We would also expect unemployment benefit generosity to have a greater effect 
on employed individuals’ sense of well-being as their position in the labor market 
becomes more uncertain. Because no direct measure of subjective job insecurity 
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is available in the data, prior unemployment spells will serve as a proxy for job 
insecurity (past unemployment is a strong predictor of the likelihood of becom- 
ing unemployed in the future).’ This is also in line with the results of Oliver and 
Pomicter (1981) that job insecurity becomes far more salient an issue for those 
who have lost jobs in the past themselves. Although there are theoretical reasons 
for defining job insecurity this way, it also entails methodological problems. In 
short, any difference in well-being between employed individuals with and with- 
out prior unemployment experience might simply be due to the fact that those 
with bad health also are more likely to become unemployed. The use of a large 
number of control variables is an attempt to control for observable factors that 
we believe influence both the probability of having experienced unemployment 
spells and reporting relatively low levels of subjective well-being. However, there 
is a possibility that there are unobservable differences between employed individu- 
als with and without prior unemployment spells that will affect the results. One 
way of tackling this problem is to be restrictive in the definition of the group 
- having experienced prior unemployment spells. In the subsequent analyses prior 
unemployment experience is defined as having experienced a spell of unemploy- . 
ment more than five years ago. It is worth stressing that our confidence in the 
results when testing this hypothesis is dependent on whether we believe that such 
a definition of prior unemployment experience captures important aspects of 
labor market risks but minimizes the problem of unobserved heterogeneity. Thus: 


H3. The more uncertain an employed individual’ position 
in the labor market, measured as having experienced spells of 
unemployment, the greater the effect of unemployment benefit 
generosity on subjective well-being. 


Because it is hypothesized that the generosity of unemployment insurance benefits 
will affect the sense of economic vulnerability associated with possible job loss, we 
would also expect benefit generosity to have a greater effect the more economically 
vulnerable the employed individual. To test this hypothesis, we will use a measure 
of economic vulnerability based on whether the respondent thinks it would be 
difficult to borrow money if necessary to make ends meet. Thus: 


H4. The more vulnerable the employed individual, measured 
as the respondent’ assessment of the possibility of borrowing 


money if necessary, the greater the effect of unemployment 
benefit generosity on subjective well-being. 


Data and Methods 


This article will use data for 21 countries (Austria, Belgium, the Czech Republic, 
Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Ireland, the 
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Netherlands, Norway, Poland, Portugal, Slovakia, Slovenia, Spain, Sweden, 
Switzerland and the United Kingdom) from the second wave of the European 
Social Survey carried out in 2004/05. The ESS data are collected in the form of 
face-to-face interviews, and the average response rate for the countries included 
in the analysis in this article is 65.1 percent.‘ A weighting factor has been used in 
all analyses to correct for sample selection bias. 

Studies on the cross-national relationship between macroeconomic and insti- 
tutional factors and subjective well-being have typically used single-item mea- 
sures on either life satisfaction or happiness. Such measures have been shown to 
correlate highly with both alternative and non-self-report measures of well-being 
(Sandvik et al. 1993; Kahneman and Krueger 2006). However, evidence also 
suggests that subjective well-being is composed of several components and that 
information on these components is lost when single-item scales are used (Diener 
1984), and moreover that increasing reliability through the use of multi-item 
measures may reduce situational effects and simple errors (Sandvik et al. 1993). 
This article will use two different but related measures of subjective well-being: 
the main focus will be upon the so-called WHO-5 scale which consists of five 
items assessing positive mood, vitality and general interest over the past two 
weeks (for a description of the WHO-5 scale and its validity, see Bech et al. 2003; 
Bech 2004; Henkel et al. 2003). More specifically, the WHO-5 scale consists of 
the following five statements: “I have felt cheerful and in good spirits for the last 
2 weeks;” “I have felt calm and relaxed for the last 2 weeks;” “I have felt active and 
vigorous for the last 2 weeks;” “I have woken up feeling fresh and rested for the 
last 2 weeks;” and “My daily life has been filled with things that interest me for 
the last 2 weeks.” Possible answers range from “At no time” (which is given a score 
of 0) to “All of the time” (which is given a score of 4).’ To obtain comparability 
with other similar scales, the scale is conventionally transformed to range between 
0 and 100, where 0 indicates worst conceivable well-being. As a point of reference 
in relation to other studies in this area, we will also use a single-item question on 
overall life satisfaction (“All things considered, how satisfied are you with your life 
as a whole nowadays?” The response scale varies between 0, extremely dissatisfied, 
and 10, extremely satisfied). 

Because the WHO-5 scale not only is a measure of overall quality of life but 
may also be used as a screening device for depression (Bech 2004), it could be 
argued that the WHO-S scale is more appropriate for measuring psychological 
reactions to job insecurity, such as anxiety and depression, than the single-item 
question on overall life satisfaction. There are also indications that the WHO- 
5 scale and single-item measures of life satisfaction measure different concepts 
(Tennant et al. 2007). However, it should be noted that these two measures of 
well-being are highly correlated—the individual-level correlation in the sample of 
employed individuals is .366 (p = .000), ranging from .192 in Portugal to .485 in 
Sweden, whereas the country-level correlation is .707 (p = .000). 
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The key explanatory variable, unemployment benefit generosity, will be mea- 
sured along three dimensions: replacement rate, benefit duration and average 
expenditure. Replacement rate is net of taxes and calculated as the average for two 
typical households: a single worker earning an average production worker's wage 
and a worker with a wife and two children, and pertains to a situation where the 
worker is unemployed for 26 weeks and employed for 26 weeks during the year 
2000.° Duration refers to how long an average production worker will receive 
regular (i.e., non-means tested) unemployment benefits. Due to low take-up rates 
and the existence of the long-term unemployed whose rights to benefits have 
expired, the stylized situation pertaining to an average worker may misrepresent 
the economic conditions facing the unemployed. In an attempt to correct for this, 
we use the average benefit expenditure, defined as average sum (in PPP-adjusted 
dollars) the countries are spending on each unemployed person for the period 
1990-1999.’ In the analyses, we will use these variables both separately and in the 
form of an additive index. When constructing this index, all variables were stan- 
dardized to have a variation between 0 and 1. Because duration has an extremely 
skewed distribution (the mean duration is 82 weeks whereas the median duration 
is 52 weeks), maximum duration was set at two years.® 

Research on cross-national differences in subjective well-being suggests a range 
of potentially confounding factors at the macro level which might affect the 
relationship between unemployment benefit generosity and subjective well-being. 
The models will therefore include measures of both levels (average for the period 
2000-2004) and trends (average yearly changes 1990-04) in GDP per capita, 
unemployment and inflation. Quality of governance is also used as a control vari- 
able because the quality and reliability of government services have been shown 
to have an impact on cross-national variation in individual well-being (Helliwell 
2003). Employment protection legislation is also entered as a control variable in 
the analyses. Whereas unemployment benefit generosity affects economic condi- 
tions during unemployment, employment protection legislation might be seen 
as influencing the likelihood of becoming unemployed in the first place. Finally, 
life expectancy, which can be seen as a crude measure of overall living and health 
conditions in a country, is also used as a control variable. More information on 
these variables and how they were constructed can be found in the Appendix. 

Although we are primarily interested in the macro-level effects, we control for 
a number of characteristics at the individual level that are important in account- 
ing for individual variation in subjective well-being. Social class will be measured 
using the EGP class schema (small proprietors and agricultural workers are ex- 
cluded from the analysis). The analyses also control for whether the respondent 
has experienced any spell of unemployment for 3 to 12 months (but not within 
the past five years), the number of years the respondent has spent working full- or 
part-time and the degree to which he or she thinks it would be difficult to borrow 
money in case of financial difficulties. A range of measures of social support is 
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also employed: whether the respondent is currently living with a partner; whether 
they have children living at home; how often the respondent meets socially with 
friends, relatives or work colleagues; and how often the respondent takes part in 
social activities. The last two of these measures were not possible to include in the 
analyses done by Di Tella et al. (2003) and Ochsen and Welsch (2006), which is 
unfortunate because it has been shown that social support provided by both family 
members and friends may moderate the negative effects of job insecurity on life 
dissatisfaction (Lim 1996). 

A range of variables aimed at controlling prior and present health status is 
also employed. A control for more serious health problems is used in the form of 
the question “Are you hampered in your daily activities in any way by any long- 
standing illness, or disability, infirmity or mental health problem?” Responses “yes, 
alot” and “yes, to some extent” are collapsed into one category. Contacts with the 
health care system is controlled for by including a variable based on the question 

“How many times in the past twelve months have you consulted a doctor for your- 
self, including specialists and GPs?” Possible answers range from “never,” “once or 
twice,” “3-5 times,” “6-10 times,” and “more than 10 times.” Current, as well as 
previous, use of medicine is controlled for by including three variables based on 
the following questions: “Are you regularly taking any pills or using any medica- 
tion prescribed for you?” (“yes”= 0, “no”= 1); “In the last five years, how often 
have you taken or used a prescribed medicine that was prescribed for someone 
else?” (“never in last five years’= 1; “once,” “2-3” and “4 times or more”= 0); and 

“What did you do with the prescription the last time the doctor prescribed you a 
medicine you had not had before?” (“never had a prescription from a doctor’= 1; 

“didn't collect the medicine,” “collected it but did not use it,” “used it but not as 
prescribed,” and “used I exactly as prescribed”= 0). 

Because the hypotheses in this article refer to the effect of macro-level variables, 
results for these individual-level variables have been placed in the Appendix (Table 
A1). The hypotheses outlined are tested using multilevel regression analysis, which 
is appropriate when observations are nested, in this case individuals within coun- 


tries (Snijders and Bosker 1999). 


Results 


Let's start with the general argument of this article, that unemployment ben- 
efit generosity increases the subjective well-being of employed workers. Figure 
1 depicts the country-level relationship between average unemployment benefit 
generosity and average levels of subjective well-being for two groups: unemployed 
and employed individuals with no experience of unemployment. 

There is a significant and strong correlation between unemployment benefit 
generosity and absolute levels of subjective well-being for both employed and un- 
employed individuals, an association that is almost identical when life satisfaction 
is used as a measure of subjective well-being (for a similar result, see Di Tella et al. 
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Figure 1. Cross-National Relationship between Subjective Well-Being and 
Unemployment Benefit Generosity among Employed and Unemployed 
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2003). An interesting consequence of the fact that the effect of unemployment 
benefit generosity on subjective well-being is roughly equal for both employed 
and unemployed individuals is that cross-nationally, the effect of unemployment 
benefit generosity on relative levels of well-being among the unemployed (i.e. 
relative to the employed) is close to zero. 

Table 1 presents the results from a series of multilevel regression models. The 
first column displays the results where the macro-level variables were introduced 
one at a time while controlling for individual-level variables (Table Al), with 
standard errors in parentheses and the change in deviance value between the full 
micro model and the full micro model plus this particular variable in brackets.’ 








> 


Social Insurance as a Collective Resource © 1293 


The measure of unemployment benefit generosity has a positive and significant im- 
pact on the subjective well-being of employed individuals. A closer examination of 
the individual components included in the benefit generosity index (i.e., duration, 
replacement ratio and average expenditure) indicates that all have a positive and 
significant effect (on at least the 5 percent level), with replacement rate, followed 
by average expenditure, as the component with the greatest impact on subjective 
well-being. This might indicate that it is income replacement and the possibility 
of maintaining accustomed levels of living which are most important in reducing 
the effect of perceived job insecurity on subjective well-being. 

Both levels and trends in GDP per capita have a significant positive effect 
on subjective well-being, whereas only change in unemployment, and not the 
unemployment level, has a significant effect. However, we should be careful in 
paying too much attention to conventional significance levels—the coefficient 
associated with unemployment level has the expected sign and is significant at 
the 10 percent level. Both governance and life expectancy have a positive and 
significant effect on subjective well-being, whereas inflation and employment 
protection have no significant effects. 

Column 2 displays the result from the model, which from a statistical point of 
view, performed best (the most parsimonious model that still provides a good fit 
to the data). In this model, change in GDP per capita and unemployment benefit 
generosity have positive and highly significant (at the .1 percent level) effects, 
and after these two variables are included in the model, no other variables have a 
significant effect or improve the model’s fit to the data. Also in this model all of 
the individual components included in the benefit generosity index have a positive 
and significant effect (on at least the 5 percent level) on subjective well-being, and 
again it is replacement rate which has the greatest impact on subjective well-being. 

The results from multilevel regressions with only 21 countries can be very 
sensitive to potential outliers. The results were therefore subjected to sensitivity 
analysis, whereby countries assessed as potentially having a large impact on the 
results were excluded. In a first step, post-communist countries were excluded 
from the analysis, where the results indicate that the effect of unemployment 
benefit generosity is highly significant also in the sample of only non-post- 
communist countries. In a second step, influential cases were deleted from the 
analysis one at the time. Although a range of more or less sophisticated statisti- 
cal procedures can be used to detect outliers in multilevel analysis (see, e.g. 
Langford and Lewis 1998), in a cross-sectional dataset with only 21 countries 
this can be done in a rather straightforward way: the country with the highest 
influence value (which is a combination of residuals and leverage value) was 
deleted from the analysis, the model was run again, and the procedure repeated 
until six countries had been deleted from the analysis. Although the coefficient 
of unemployment benefit generosity decreases in absolute number through- 
out these analyses (to around 1.6 when Switzerland, Estonia and the United 
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Kingdom are excluded, and thereafter stable when Hungary, Spain and Finland 
are excluded), the effect always remain significant at the 1 percent level. 

In the second section of Table 1 (columns 5 and 6), the same analysis is per- 
formed with the single-item life sat- 
isfaction scale as dependent variable. 
Two main conclusions stand out 

Sen from these analyses. First, there is a 
© consistently significant effect of the 
7 quality of governance variable, an 
effect that does not appear with the 
WHO-5 scale as a dependent variable 
in a multivariate framework. Second, 
unemployment benefit generosity 
has a significant positive effect also 
on this measure of employed individ- 
uals’ subjective well-being. However, 
this effect seems to be somewhat 
=< weaker in terms of significance levels 
im when compared to the WHO-5 scale. 
Although any meaningful compari- 
son between the results for the two 
measures of subjective well-being is 
extremely difficult to make, this re- 
sult might indicate that the effect of 
unemployment benefit generosity is 
as, if not more, important for rather 
concrete stress symptoms, such as 
anxiety and depression, as it is for 
overall life satisfaction. 

Returning to the first section of 
Table 1, Hypothesis 2 stated that 
benefit generosity should have a 
negligible effect on the subjective 
well-being of individuals who are 
not experiencing job insecurity and 
for whom the status of “unemployed 
with a right to unemployment ben- 
efits” is irrelevant, as well as for indi- 
viduals who will not be eligible for 
unemployment benefits if they be- 
come unemployed. Housewives and 
students were identified as possible 


15,666 


**D < 001 


*D < 01 


15,650 


Source: European Social Survey 2004/05 





Notes: Model 4 (students) controls for Greece and Switzerland as outliers 


Significance levels: *p < .05 
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candidates testing this hypothesis, and in columns 3 and 4 we can see that unem- 
ployment benefit generosity has no significant effect on the subjective well-being 
of individuals belonging to these groups. However, the effect of unemployment 

benefit generosity on housewives’ well-being is non-trivial and significant at the 

10 percent level, possibly indicating a kind of “spill-over effect” of these benefits 

on other household members. All in all, we must therefore conclude that the first 

three of our hypotheses are supported by these analyses. 

Our penultimate hypothesis argued that unemployment benefit generosity 
would have a greater effect on employed individuals’ sense of well-being the more 
uncertain their position in the labor market. An alternative way of formulating 
this hypothesis is to say that when we compare two individuals who experience 
the same degree of labor market uncertainty but live in countries with different 
levels of unemployment benefit generosity, the adverse effects of employment 
uncertainty on subjective well-being will be lower for the person living in the 
country with the higher level of benefit generosity. So what this hypothesis implies 
is the existence of cross-level interaction effects between benefit generosity and 
labor market uncertainty. Table 2 displays the coefficients for such cross-level 
interaction effects between benefit generosity (dichotomized into high and low 
benefit generosity) and the measure of labor market uncertainty identified for this 
purpose, whether or not the respondents have experienced any spell of unemploy- 
ment (3-12 months, but not within the past five years). 

These cross-level interaction effects support the hypothesis that unemployment 
benefit generosity may mitigate the adverse effects of labor market uncertainty on 
subjective well-being. First of all, we can see that within the group of countries 
defined as having low or high levels of benefit generosity, those with no experience 
of unemployment report higher levels of subjective well-being than respondents 
who have experienced a spell of unemployment. Then if we compare the subjec- 
tive well-being of individuals who have experienced spells of unemployment in 
countries with high and low benefit generosity respectively, we see that those liv- 
ing in countries with comparatively high benefit generosity report a significantly 
higher level of subjective well-being. This result is potentially sensitive to where 
we draw the line between low and high levels of benefit generosity. A sensitivity 
analysis excluding the two countries with values closest to the line between low 
and high benefit generosity was therefore performed without affecting the basic 
pattern displayed in Table 2. 

The last of our hypotheses stated that because the generosity of unemployment 
insurance benefits will presumably affect the sense of economic vulnerability as- 
sociated with possible job loss, benefit generosity could be expected to have a 
greater effect the more economically vulnerable the employed individual. This 
hypothesis is also tested through the formulation of cross-level interaction effects 
between levels of unemployment benefit generosity and economic vulnerability, 
operationalized as whether the respondent thinks it would be difficult to borrow 
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Table 2: Cross-Level Interaction Effects 


Risk Vulnerability 
Unemployment Unemployment Difficulty Borrowing 
Experience 






Low es Yes =s- Reference = Yes - 
Low No 1.910* No 
s (.958) 
oo Ves 92007 ee 
be. Le ee ro) 
High No Be falilera No 


Source: European Social Survey 2004/05 

Note: All models contain the full range of micro-level variables and all 
relevant (significant) macro-level variables 

Significance levels:*p<.05 “p<.01 *“*p<.001 


money if necessary to make ends meet. This measure is an attempt to capture 
the respondent's access to an economic buffer. However, this hypothesis is not 
supported by the data. There is a main effect of unemployment benefit generos- 
ity, in that economically vulnerable individuals living in countries with relatively 
generous unemployment benefits report significantly higher levels of subjective 
well-being compared with vulnerable individuals living in countries with less 
generous unemployment benefits. However, the effect of unemployment benefit 
generosity does not vary according to whether individuals consider themselves to 
be economically vulnerable or not. This could reflect the fact that how difficult it is 
for people to borrow money in times of financial difficulties is a poor indicator of 
their economic vulnerability. Even if people have difficulties in borrowing money, 
they might have other means to gather money in times of economic hardship. In 
addition, and perhaps more importantly, it could be argued that unemployment 
represent such a strain on the household economy that most wage earners consider 
themselves to be vulnerable if they were to become unemployed. 


Discussion 


The main argument of this article has been that, by reducing concerns about 
future financial security, unemployment protection systems may lessen the nega- 
tive effects of job insecurity on employed individuals’ subjective well-being. The 
empirical assessments presented are supportive of this general argument. In fact, 
from a cross-national perspective, the generosity of unemployment benefits seems 
to be as important for the subjective well-being of the employed as it is for the 
well-being of unemployed individuals. These results indicate that unemployment 
insurance schemes may have important external benefits, i.e. benefits to society 
over and above those to the unemployed who directly utilize such benefits. ‘Thus, 
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individuals may not have needed to use unemployment insurance benefits at any 
point in their working life in order to consider access to such benefits as a source 
of welfare, the mere knowledge that they will receive help if needed may be con- 
sidered as contributing to their welfare. 

Our belief in the results of this research depends on whether the approach em- 
ployed dealt satisfactorily with the methodological problems inherent in this kind 
of analysis, such as selection and unobserved heterogeneity. The formulation of 

“placebo tests” (or counterfactuals) and testing many implications from the general 

theoretical argument boosts our confidence. Labor markets in advanced capitalist 
democracies have undergone profound structural changes since the mid-1970s. 
The decreased significance of long-term attachments between employer and em- 
ployees and the concomitant growth of a-typical/non-standard work means that a 
declining proportion of the workforce is holding permanent full time jobs. With 
job and career patterns being less long term and stable, more and more people 
suffer insecurity on the labor market. And whereas insecurity may be an impor- 
tant incentive for people to work harder and make investments in their human 
capital, the results presented here suggest that it may also have harmful effects on 
their subjective well-being and possibly on their productivity as well. Thus, it is 
important to discuss and evaluate unemployment insurance schemes not only in 
terms of their poverty-alleviating effects and their effects on incentives to work, 
but also in terms of their productivity-enhancing quality. Whether a traditional 
unemployment insurance scheme is the best way to offer insurance on a labor 
market where risks are becoming less predictable and standardized career patterns 
can no longer be assumed is open for debate. 


Notes 


1. Throughout this article, “tisk” and “insecurity” will be used interchangeably. Although 
it has been argued that whereas risks can be transformed into probabilities, insecurity 
refers to unpredictable events that are not amendable to this sort of calculation (Barr 
2001), I would argue that in everyday vocabulary both risk and insecurity essentially 
mean danger or threats to people's livelihoods (Holzman and Jorgensen 2001). 


2. ‘The problems associated with using expenditure-based measures of the welfare state 
and social protection schemes are well-known (Classen and Siegel 2007). More 
particularly, expenditures on unemployment benefits are strongly affected by the 
state of the economy, and countries may experience rising expenditures due to higher 
unemployment at the same time as unemployment benefit levels or the duration of 
benefits are reduced. Expenditure-based measures therefore needs to be adjusted, for 
example by controlling for the number of recipients, which is approach used when 


constructing one of the three measures of unemployment benefit generosity used in 
this article. 


It is important to emphasize that the main hypothesis in this atticle is that 
unemployment benefits will reduce the effect of job insecurity on subjective well- 
being, not job insecurity itself. Thus, even if we had a measure of (subjective) job 
insecurity, it would be of limited relevance to the hypotheses tested here. 
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4, The response rate in individual countries is: Estonia = 79.1; Greece = 78.8; Poland 
= 73.3; Czech Republic = 70.8; Finland = 70.7; Norway = 66.2; Sweden = 65.4; 
Denmark = 64.2; Austria = 62.4; Belgium = 61.2; Great Britain = 54.6; Germany = 
51.0; Slovakia = 63.3; Slovenia = 68.9; Switzerland = 48.6; Ireland = 62.5; Netherlands 
= 64.3; Hungary = 66.5. 


5. The items included in the WHO-S scale display high internal consistency in the 
sample used here, with a Cronbach's alpha of .82. 


6. This is the latest available data point for all variables included in the benefit generosity 
index. 


7. The reason for measuring benefit expenditure per unemployed person as an average over 
a longer time period is that business cycles may have significant effects on this measure. 


8. Data on benefit generosity and duration for non-post-Communist countries come 
from the SCIP database at the Swedish Institute for Social Research. The SCIP data 
has been used in a large number of books and articles (see https:/ /dspace.it.su.se/ 
dspace/handle/10102/7). Data on expenditure for these countries as well as all data 
for the post-Communist countries have been collected within the project “Social 
Policy Strategies and Welfare Outcomes in East and Central Europe in a Comparative 
Perspective” at the Swedish Institute for Social Research. 


9. Asa statistical measure of the performance of a model estimated with maximum 
likelihood, the deviance or likelihood ratio test can be used (Snijders and Bosker 
1999). The difference in deviance (defined as -2*In (likelihood)) between two models 
has a chi-squared distribution, with the difference in the number of explanatory 
variables a degrees of freedom. For testing the significance of macro-level variables, 
the t-value with 21-1-q degrees of freedom was used, where q is the number of macro- 


level variables (Snijders and Bosker 1999:86). 
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Appendix 
Macro-level Variables 


Unemployment benefits are described in the text. All variables that measures 
levels is an average for the period 2000-04. All trend variables are calculated as 
the average yearly change over the period 1990 (or closest possible year) to 2004. 


Governance 

Data from the World Bank on the quality of governance, divided into six separate 

aspects: voice and accountability, stability and lack of violence, government effec- 
tiveness, the regulatory framework, the rule of law, and the control of corruption. 
These indicators are summed to a single index of the quality of governance for the 

year 2002. Source: http://info.worldbank.org/governance/wgi2007/ 


Unemployment 
. As a percentage of civilian labor force. Source: OECD Factbook 2008. Available 
at: http://lysander.sourceoecd.org/vl=937284/cl=3 1/nw=1/rpsv/factbook/. 


Inflation 
GDP Deflator (Annual growth in percentage). Source: OECD Factbook 2008 


GDP 


Real GDP per capita, $ in 2000 constant prices. Source: Penn World Tables. 
Available at:http://pwt.econ.upenn.edu/. 


Life Expectancy 
Life expectancy at birth, men and women. Source: OECD Factbook 2008. 


Employment Protection Legislation 

A summary assessment of legal provisions aimed at protecting employees from no- 
fault dismissal in terms of notice and severance pay requirements and the circum- 
stances surrounding collective lay-offs. The index also includes legal regulation of 
temporary and fixed-term contracts. The index can vary between 0 and 4, with 
a higher score indicating stricter employment protection. Source: OECD 1999. 
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Ethnic Environment During Childhood and the Educational 
Attainment of Immigrant Children in Sweden 


Magnus Bygren, Stockholm University 
Ryszard Szulkin, Stockholm University 


We ask whether ethnic residential segregation influences the future educational careers 

of children of immigrants in Sweden. We use a dataset comprising a cohort of children 

who finished compulsory school in 1995 (2 = 6,560). We follow these children ret- 
rospectively to 1990 to measure neighborhood characteristics during late childhood, 
and prospectively through 2003 to measure the number of years of education attained 

thus far. The largest immigrant groups came from Finland, Turkey, former Yugoslavia, 
Iran and Chile. Our empirical analysis reveals that immigrant children who grow up 

in neighborhoods with many young coethnics who have limited educational resources, 
obtain relatively low average grades from compulsory school, and on average, do not 

attain the same levels of education as do immigrant children who grow up elsewhere. 
For a minority of immigrant children who lived in neighborhoods with education- 
ally successful young coethnics, we find a positive effect of growing up in an ethnic 

enclave. Also in this case, the effect of the ethnic environment on future educational 

attainment is mediated by school results in compulsory school. 





Introduction 


Sweden has recently witnessed a significant increase in immigration as well as 
increased ethnic residential segregation (Biterman and Franzén 2007). Together, 
these two processes have changed the ethnic composition of the local social spaces 
in which children of immigrants spend most of their time. The purpose of the 
present research is to analyze whether and how this kind of spatial segregation dur- 
ing childhood influences the educational careers of individuals with an immigrant 
background.' Our intention is to go beyond the simple question of whether ethnic 
enclaves are good or bad. It is hard to tell a priori whether sharing one’s immediate 
social environment with persons of the same ethnic origin is positive or negative. 
The various characteristics of the social environment must first be discerned before 
hypotheses about the effects of the environment can be formulated (see Cutler, 
Glaeser and Vigdor 2007). When the ethnic environment encompasses a high 
proportion of individuals with advantageous social backgrounds, positive effects of 
social interaction can be expected. Children growing up in affluent environments 
can be assumed to perform better than do children with a similar social back- 
ground but without access to an affluent environment. Inversely, when the ethnic 
environment consists of a large proportion of individuals from less favorable social 
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circumstances, negative effects of the social interaction may arise. The emergence 
of such interactional effects may imply that individual social background and the 
ethnic surroundings interplay in a process where the correlation between origin, 


school results and educational career is passed along to new generations. 


The Pros and Cons of the Ethnic Enclave 


The social science literature on the fortunes of immigrants in their host countries 
has long been dominated by the “assimilation perspective.” Proponents of this per- 
spective have traditionally emphasized the importance and desirability of a process 
resulting in the gradual, but preferably speedy, fusion of immigrants into their new 
environment and in their becoming “like everyone else.”(Gordon 1964; Chiswick 
1978; Zhou 1997) According to the assimilation perspective, there are considerable 
disadvantages associated with residential segregation. The spatial segregation of im- 
migrants and their children from the majority is naturally accompanied by a lack of 
contact with members of the host society. Segregation also often means that many 
immigrants live in areas characterized by substantial social problems. Such areas can 
be assumed to be places where social marginalization is transferred between genera- 
tions and becomes permanent. In other words, residential segregation can produce 
obstacles to accumulation of the knowledge and skills necessary for success in adult 
life. The combination of increased immigration and ethnic segregation has altered 
the ethnic composition of the local social arenas in which immigrant youth spend 
their most formative years. Ethnic segregation is a reality in developed countries that 
have had a large influx of immigrants in the past decades. 

More recent research indicates that the assimilation perspective alone may not 
give the complete picture of the effects of segregation.” Many immigrant groups 
are determined to retain their cultural uniqueness along with strong bonds to 
their countries of origin (Portes 1995). Research indicates that belonging to an 
ethnic minority that forms an ethnic enclave may have both advantages and 
disadvantages. Concepts such as “ethnic capital” and “quality of the ethnic envi- 
ronment’ (Borjas 1995) are used to illustrate the fact that youth originating from 
countries with strong educational traditions may benefit from close contact with 
other people from their countries of origin after emigrating. For well-educated 
and numerically large minorities, residential segregation may facilitate frequent 
contacts with their own groups and thereby contribute to the preservation of 
educational traditions (Zhou 1997). Two assumptions underlie the belief that 
the ethnic surroundings may have an influence on students’ school results and 
educational careers. First, the social environment has an impact on educational 
success. Second, coethnicity facilitates in-group contacts. Both these assumptions 
are supported by sociological research. ‘ 

A student’s close social environment is an important source of norms, role 
models and information (see Coleman 1988). Neighborhoods and schools are 
central places for the formation of peer groups in which individuals influence each 
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other’s study habits and aspiration levels, which in turn affect school results and 
educational careers. When a large proportion of members in the peer group regard 
high educational ambition levels as self-evident, the probability is relatively high 
that these aspirations will be diffused throughout the group. Inversely, in a peer 
group where the norms prohibit being a “nerd,” a negative attitude towards achieve- 
ment in school may become dominant (e.g., Farkas, Lleras and MacZuga 2002; 
Fordham and Ogbu 1986). The basic idea here is that aspirations and behavior are 
formed in interplay between individuals and their social environment. If we think 
of school performance and the educational career as reflecting individual capacity 
and individual efforts, social norms and role models may exert a strong influence 
on effort. Thus, the peer group or the social network to which young people belong 
may influence their educational performance. The social environment, as a source 
of information, may also be of importance for the educational career by influencing 
the choice of school, study program, etc. Through path dependency, these choices 
may have long-term consequences for the educational careers of children. 

Research on social networks has shown that demographic and social factors 
such as ethnicity, gender and age are important for the formation of social bonds 
or in-group contacts. These factors are also important for the location of the 
individuals (centrally or peripherally) within a network (McPherson, Smith-Lovin 
and Cook 2001). In accordance with the so-called “similarity attraction” paradigm 
(Byrne 1971), close relations are primarily formed between individuals with simi- 
lar social characteristics. Social similarity functions as a mechanism that facilitates 
communication, reduces uncertainty, and creates trust and reciprocity in social 
life (Reskin, McBrier and Kmec 1999). There are reasons to believe that similar 
ethnic backgrounds may be significant in the formation of groups in neighbor- 
hoods. Empirical studies have shown that strong ethnic networks are prevalent 
among school youngsters (Portes 1995; Zhou and Bankston 1998). Young people 
often say that their closest friends belong to their ethnic groups. Moreover, close 
contacts within one’s ethnic group have been found to have a positive effect on 
school performance (Portes 1995). 

One important characteristic of enclaves formed by immigrants may be what 
Coleman (1988) refers to as “intergenerational closure.” In social groups where 
individuals from different generations are acquainted with one another and feel 
a strong sense of community, the social relations are affected by the high density 
of the network. These so-called “closed” networks may be especially effective in 
upholding respect for mutual norms. According to Coleman, in a situation where 
many parents know the parents of other children living in the same neighborhood, 
or when these children attend the same schools as their own children, norms dic- 
tating diligence and perseverance—and thereby the students’ school performance, 
and educational careers— may be encouraged. The expectation that ethnic groups 
have a relatively high degree of “intergenerational closure” is based on the fact that 
there is a high probability that people with the same country of origin, who live 
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close to each other and who often emigrate for similar reasons, know each other 
and each others’ children and maintain strong bonds. There is, in other words, 
reason to expect that ethnic-based groups in neighborhoods will be characterized 
by a high degree of solidarity, and that this group afhliation will influence, inter 
alia, the educational performance of young people. 

The point of departure for the present empirical analysis is thus that the social 
interaction between individuals of the same ethnic background, who live in the 
same neighborhood, is relatively frequent and therefore consequential for the 
school performance, educational choices, and future educational career of children 
growing up there. Similar network mechanisms have been found to influence the 
accumulation of human capital, the probability of being unemployed, and the 
probability of becoming a recipient of social welfare (Aslund and Fredriksson 
2005; Bertrand, Luttmer and Mullainathan 2000; Borjas 1995; Hedstrém and 
Aberg 2005). Our hypothesis is that social interaction effects on educational ca- 
reers are double-edged, in that they are conditional on the qualities of the ethnic 
group exerting the influence. Presumably, an ethnic group constituted by many 
doctors and lawyers will affect children’s educational ambitions differently than 
will an ethnic group dominated by, say, dishwashers in restaurants and taxi drivers. 


Previous Studies of Segregation’s Effects on Educational Attainment 


Many recent studies in the field of segregation have focused on the question of 
how social segregation in schools influences students’ school performance. The 
main conclusion reached in Swedish studies is that the social composition of 
schools influences students’ school results. Students who attend schools where 
high proportions of children have parents with university degrees and low propor- 
tions of children from families under economic hardship perform better than do 
children attending schools with the inverse characteristics, This is the case even 
when the individual’s own social background is accounted for (Erikson 1994), 
These results correspond well with results from international (primarily U.S.) re- 
search showing that social segregation hinders the school careers of those living in 
underprivileged areas (Hanushek, Kain and Rivkin 2009; Robertson and Symons 
2003; Willms 1986). At the same time, research shows that the variation in school 
results found between schools is considerably less than that within schools. This 
indicates that factors linked to the individual and/or his/her family are more 
important for school results than are factors found in the school environment per 
se. There is a related body of studies of neighborhood effects (see Borjas 1995; and 
reviews by Dietz 2002; Sampson, Morenoff and Gannon-Rowley 2002; Durlauf 
2004), showing that “middle-class” residential areas are associated overall with 
better school achievement, even after family-of-origin effects have been accounted 
for (although the causal nature of these associations is contested). 

However, as pointed out by several researchers (e.g., Ellen et al. 2002), we know 
rather little about the effects of ethnic concentration due to recent immigration. 
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The focus of the research (primarily carried out in the United States) has been 
on effects of racial segregation. The main conclusion generated by this research 
is that a high concentration of a racial minority tends to depress school achieve- 
ment (Boozer et al. 1992; Cutler and Glaeser 1997; Hoxby 2000; Hanushek, 
Kain and Rivkin 2009). The negative effect of racial concentration is particularly 
strong for black students, but has a noticeably smaller effect on the achievement of 
whites and Hispanics (Hanushek, Kain and Rivkin 2009). Note, however, that 
the empirical findings in this area are mixed, probably due to differences in meth- 
odology and in data (Durlauf 2004; Hanushek, Kain and Rivkin 2009). Studies 
of differences in school results between native-born students and those who have 
immigrated (first-generation immigrants) show that the former demonstrate, on 
average, better school results. The difference in school results between native 
students and the children of immigrants (second-generation immigrants) is less 
pronounced (Portes and MacLeod 1996; Simila 1994; see also Skolverket 2005). 
In an analysis of ethnic school segregation and school results in Sweden (Szulkin 
and Jonsson 2007), it is shown that ethnic school segregation had negative con- 
sequences for first-generation immigrants. Schools with a relatively high number 
of children born outside Sweden seem to constitute a problematic social environ- 
ment. Further, Grénqvist (2006) finds that among immigrants in Sweden, the size 
of the enclave in the municipality negatively affects the probability of graduating 
from higher education. 


Empirical Estimation of Social Interaction Effects 


Analyzing how the social environment affects individual outcomes entails several 
methodological problems (Dietz 2002; Manski 2000). The fact that the individu- 
als living in different residential environments distinguish themselves in a number 
of important areas—such as school performance, educational careers, labor mar- 
ket circumstances, criminality and so on—may broadly be seen as being due to 
selection effects or effects generated in the neighborhoods. Selection effects arise 
because a prior sorting process causes people in the same social environment to 
have similar observed and unobserved individual characteristics. One example of 
this is that children with the same educational or class background tend to grow 
up in the same neighborhoods. Because social background correlates with school 
performance, school performance varies between neighborhoods. The differences 
between neighborhoods with regard to future outcomes may thus be due to stu- 
dents’ (pre-selection) social background, not due to (post-selection) neighborhood 
effects. In order to capture the effects of the social environment, it is therefore 
necessary to net out potential effects of selection. 

Variation in outcomes between neighborhoods may also arise as a consequence 
of processes occurring after the sorting of individuals to neighborhoods has taken 
place. First, after moving into a neighborhood, everyone there is subject to influ- 
ence from all common characteristics of the neighborhood. In the literature, this 
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has been variously termed an effect of the environment (Hedstr6m 2005), context 
(Manski 2000), composition or place (Dietz 2002). Factors common to all indi- 
viduals residing in a neighborhood may make all individuals more or less likely to 

pursue education. For example, a neighborhood’s school quality, local labor mar- 
ket conditions and distance to the nearest university are factors that presumably 
affect all youngsters in their educational choices. Second, within neighborhoods, 
there may also be social interaction effects. Individuals in the same social environ- 
ment influence each other. In a neighborhood where many individuals have high 

educational aspirations, the aspiration level can be raised even among children 

who originally had a less ambitious attitude. Inversely, a negative attitude toward 

school performance can “contaminate” children who have been accustomed to 

producing good school work, drawing them into an oppositional culture of low 

school effort (cf., e.g., Fordham and Ogbu 1986). These kinds of processes may 

give rise to social multiplier effects, where initial small changes in behaviors can, 
through chain-reactions, give rise to disproportionately large changes in behavior 

at the group level. 


Hypotheses and Empirical Model 


The aspect of the neighborhood environment analyzed here is how different 
properties of ethnic groups within neighborhoods influence individuals within 
the group in terms of educational pursuits. There are substantial difficulties in 
attempting to distinguish between selection effects and effects of the neighbor- 
hoods’ resources, on the one hand, and effects of the social interplay between 
the individuals in the neighborhoods, on the other. However, our dataset yields 
good possibilities for dealing with these difficulties. A “naive” analysis that does 
not take these problems into consideration risks resulting in upwardly biased 
estimates, and the effects of the social environment may bea result of mis-specified 
models. On the other hand, netting out potential selection effects is likely to be 
accompanied by a reverse risk. Fixed effects of neighborhoods and ethnic groups 
are probably also proxies for characteristics of the social environment that are 
actually causally connected to the future educational career (see Bertrand, Luttmer 
and Mullainathan 2000; Cutler, Glaeser and Vigdor 2007). This may cause our 
estimated effects to be overly “conservative.” That is, in our attempts to rule out 
the influence of selection effects, we run the risk of underestimating the actual 
significance of social influence within ethnic groups on educational careers. 

Our analyses are based on the assumption that a common ethnic background 
raises the interaction frequency, and that persons with the same background repre- 
sent “significant others” in a social environment. However, social ties with coethnics 
cannot be assumed a priori to be good or bad for an individual. Itis only when social 
ties give access to rich social resources that positive effects can be expected (cf., Lin 
1999). Thus, we hypothesize that the quality and quantity of the ethnic environment 


are not independent entities in the creation of ethnic capital. Presumably, the higher 
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the number of coethnics, the larger the effect of the qualitative side on educational 
attainment. Our empirical model can thus be written as: 


EDUC, =a +B, QUANT), +8 QUAL, +8, (QUANT, “QUAL, | 


Bedee ie 1 oat ei” (1) 

The dependent variable EDUC is a measure of the highest educational level 
(in years) in 2003 for individual i with background in country j and who lived 
in neighborhood k from 1990 to 1995. The central variables whose effect on 
the educational career are analyzed are the variables measuring the quantity, 
QUANT), (the number of coethnics) and quality, QUAL, (educational resourc- 
es of coethnics), of the ethnic group j in neighborhood k. Because the number of 
coethnics for different ethnic groups as well as their quality varies within neigh- 
borhoods, 8, , are identifiable. Whether size (@,) of the ethnic enclave per se is 
related to educational outcomes of its members is an open question. If we are 
to expect any relationship, it is a negative one, as growing up ina neighborhood 
with a large ethnic enclave may hamper acquisition of host country language 
skills. For the qualitative side of the ethnic enclave (f,), we hypothesize that a 
“high quality” environment results in positive effects for members, and a “low 
quality” environment in negative effects. The main hypothesis of the present 
study is, however, that effects of the quantitative side of the ethnic enclave are 
conditioned by the quality of coethnics in the neighborhood. This hypothesis 
rests on the assumption that in neighborhoods with many coethnics, the prob- 
ability is higher that social influence processes endogenously reinforce behaviors 
and attitudes within the group of coethnics, simply because competing identity- 
based groups of influence are relatively smaller. To capture the joint effect of 
quality and quantity we introduce the interaction term (QUANT, *QUAL,) 
into the equation, and our expectation is that its associated coefficient B, is 
greater than zero. 

X__ is a vector including individual or family background characteristics, which 
we believe play an important role in the individual's educational performance and 
educational attainment. This vector encompasses the individual’s sex, whether the 
individual is born in Sweden, the education of the parents, whether the parents 
receive unemployment benefits, whether the parents receive social assistance, and 
whether the family is intact. To account for the fact that ethnic or national groups 
may differ with regard to educational traditions and the value attached to higher 
education, we include fixed effects 5, of the individual’s country of origin in the 
model. Further, neighborhoods may differ systematically with regard to factors 
affecting the educational attainment among inhabitants, which is why we include 
a set of neighborhood fixed effects 3,. All neighborhood-specific effects on edu- 
cational attainment common to individuals within the neighborhood are thereby 
accounted for. Finally, in order to analyze whether ethnic environment has a direct 
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. . . > 
impact on the educational career, or is mediated through children’s school success, 
. . . . . h 
we include in X_ the grade point average of the individual in the 9° grade. 
ic 


Data 


We used a dataset including information on a cohort of all students in the Swedish 
comprehensive schools, who attended the 9" grade in 1995. The 9" grade is the final 
year in the compulsory Swedish school system, and it is the year in which students 
(usually) turn 16. The data were obtained from an official school register of 9" grad- 
ers, and include information on their final grades. We selected those individuals for 
whom both parents were born abroad. We matched these data with information on 
(1. the students’ highest education in 2003 (i-e., the year most of them turned 24), 
(2. their parents’ country of birth, education, civil status, social assistance benefits 
and unemployment benefits, and (3. the country of birth, and education of all 
individuals above the age of 16 living in their residential area in the years 1990-1995. 
We matched the data using the individual identification numbers that are used in 

. all official Swedish individual-level registers. Thus, data on students’ achievements 
were combined with information on their families and their neighborhoods. Below, 
we describe in greater detail how we constructed the variables. 


Dependent Variable 


Our dependent variable is the highest educational level achieved in 2003. We 
used the Swedish standard classification of education (Statistics Sweden 1988) 
and recoded 38 categories of education into their corresponding years of educa- 
tion. In our data, this variable has 10 unique values, ranging from 8 (incomplete 
compulsory education) to 17 (five years of postsecondary education). We treated 
this variable as continuous in the statistical models. 


Children and Their Parents 


Because of concerns about the protection of privacy, the variable measuring na- 
tional origin is not a direct indicator of country of birth, but has been collapsed by 
Statistics Sweden into 28 categories, where relatively small neighboring countries 
and countries with language similarities have been grouped. The categorization 
is not ideal, as some groups are very heterogeneous with regard to the countries 
of origin (e.g., “Asia, other”). In a sensitivity analysis, we included only those 
individuals for which the exact country of birth is reported, and the results of this 
analysis did not deviate substantially from the results reported.* If the parents 
were born in two different countries, the mother’s is used to designate the child’s 
background. The children of immigrants are either born abroad or in Sweden,-and 
we distinguished between these groups using the variable Born in Sweden with a 
value 1 if the child was born in Sweden and 0 otherwise. Adopted children were 


not included in the dataset. The variable Girl has value 1 if the child is a girl, 0 if 
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the child is a boy. The variable Grade point average refers to the child’s average of 
the grade points for 20 subjects, given on a 1-5 scale in the final 9° compulsory 
year of education. Individuals whose grade was incomplete (usually because of 
absence) received a 0, which means that the GPA can range from .0 to 5.0. We 
distinguished between children living in intact and non-intact families, where we 
assigned the value 1 to the variable Intact family if the child is living with both 
biological parents in 1995.5 Furthermore, we have data on parents level of educa- 
tion, unemployment experience, and experience of receiving social welfare. We 
measured these variables in the year when the child left compulsory school, that 
is, 1995. We used Years of education, highest educated parent to measure the educa- 
tion of the parents jointly. We assigned the value 1 to the variable Parent receives 
social assistance if the household to which the child belongs received means-tested 
social assistance during 1995, otherwise 0. We assigned the value 1 to the variable 
Parent unemployed if at least one of the parents received unemployment benefits 
during 1995, otherwise 0. 


Neighborhoods and Ethnic Environments 


We defined a neighborhood using Statistics Sweden’s detailed geographical SAMS 
codes (Small Area Marketing Statistics). These codes have been developed in 
collaboration with Swedish municipalities, to designate relatively small and so- 
cially homogeneous neighborhoods.° To avoid conflating family characteristics 
with neighborhood characteristics, we excluded the individual’s family members 
from the computation of all aggregate neighborhood measures. In the analyses, 
we distinguished between the quantitative and qualitative side of the student's 
ethnic environment (cf., Bertrand, Luttmer and Mullainathan 2000). We also 
distinguished between the older (23 years +) and younger coethnics (16-22 years) 
in the neighborhood. We defined the quantitative side of the neighborhood ethnic 
surrounding to be the natural log of (1. the average number of coethnics 16-22 
years of age, and (2. the average number of coethnics 23 years + residing in the 
neighborhood (or neighborhoods, if they have moved between them) during the 
years 1990-1995.’ For the younger coethnics, we defined the qualitative side to be 
their average grades from compulsory school. For the older coethnics, we defined 
the qualitative side to be their average years of education. In order to investigate 
the possibility of multiplicative effects of the quality and quantity of the ethnic 
environment, we created interaction terms for them. After exclusion of 2,638 
individuals with missing values on any of the variables, most commonly because 
no younger coethnics resided in the neighborhood, we had a sample of 6,560 
individuals. We compared the highest education in 2003 for the excluded cases 
with those included, and the excluded cases had, on average, .3 years more educa- 
tion, suggesting that the immigrant children who are included in our analytical 
sample fared somewhat worse than did those immigrant children who grew up in 
neighborhoods without non-family coethnics.’ Descriptive statistics are presented 
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in Table 1. One may note the high levels of unemployment and social assistance 


ts. These are a reflection of the precarious 
in Sweden, which was aggravated by the 


among the immigrant children’s paren 
labor market situation of immigrants 


recession at the beginning of the 1990s. 


Results 


To provide an overview of the different fates of the immigrant groups, we list in 


Table 2 average values of crucial variables for these groups. The groups are ordered 
according to their size in the sample. There are some noteworthy differences in 


grades between the ethnic groups. For ethnic groups with at least 100 individuals 
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Table 2 continued 











Average Number 
of Coethnics in 


, Avgerage Years 
of Education 


Avgerage GPA 
9th Grade 







Country of Origin 





51 


44 
19 
29 


11.667 
12.800 
12.352 
11.500 
12.667 
12.000 
11.800 
12.000 


219] 


African countries other than the ones above 


“Africa, other’: 


40 


3.400 


Japan, China, Korea, Taiwan, Singapore, Israel 


“East Asia”: Hong Kong, 


27 


2.681 


Netherlands, Switzerland, Austria 


“North Africa”: Algeria, Egypt, Morocco, Tunisia 


Germany, GDR, Liechtenstein, 
Great Britain, Ireland 


25 
19 


11 


75 
14 
14 


2.156 
3.260 


2.182 
2.960 


3.850 


North America (excluding Canada, USA) including Central America 


Estonia, Latvia, Lithuania 


Canada, USA 


56 


* 


in our data, we can see that children from 
Iran, Poland, South-East Asia and Eastern 
Europe 1 (Albania, Bulgaria, Romania and 
the former Soviet Union excluding the Baltic 
states) are relatively successful at the compul- 
sory school level. Results for years of school- 
ing (in column two) generally repeat this 
pattern. High grades in compulsory school 
translate into future success at higher levels of 
the educational system. There is a correlation 
between the size of the immigrant group and 
the average number of coethnics in the neigh- 
borhood, but it is far from perfect (7 = .58). 

In Table 3, we report the results from six 
models with estimates of the effects of ethnic 
environments in neighborhoods on the high- 
est educational level achieved eight years after 
compulsory school. The first model in Table 3 
is the “naive model,” which assumes that the 
quantitative and qualitative sides of the ethnic 
environment in neighborhoods are the only 
factors influencing the children’s educational 
careers. In Model 2, the effects of ethnic en- 
vironment are estimated controlling for indi- 
vidual and family-related variables. In Model 3, 
the fixed effects for neighborhoods and ethnic 
groups are added. For these fixed effects, we 
defined the neighborhood of an individual to 
be his or her modal residential neighborhood 
during the period 1990-1995 (1,797 neighbor- 
hoods). In Model 4, in order to analyze how 
much of the neighborhood effect is mediated 
by success in compulsory school, we added the 
GPA from compulsory school. Finally, in mod- 
els 5 and 6, the analyses from models 3 and 4 
are re-run on a sample restricted to individuals 
who did not move between neighborhoods 
during the period 1990-1995. 

‘The first analysis investigates whether there 
is a correlation between the number of years 
of education achieved by the individual eight 
years after compulsory school and the charac- 
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ter of the ethnic environment of the childhood neighborhoods. Model 1 shows that 

individuals growing up in neighborhoods with a relatively high number of coethnics 

(both young and adult) have lower performance, in terms of educational careers, 
than do individuals who lived in neighborhoods with relatively few coethnics. The 

main effect of the qualitative side of the ethnic environment is small and insignifi- 
cant. However, the joint effect of the quantitative and qualitative sides of the ethnic 

environment (the interaction term) is positive and significant. Individuals brought 

up in neighborhoods with high numbers of coethnics who are well educated (the 

adult group) or reach high levels of achievement in school (the young group) are 

characterized by relatively high levels of educational attainment. A joint analysis 

of the effects of the quantitative and qualitative sides of the ethnic environment is 

presented in figures 1 and 2. We restrict our predictions to actually observed combi- 
nations of quality and quantity. They indicate that there is a critical average level of 
grades/educational level that conditions whether the increase in number of coethnics 

is negative or positive for the future educational career, As can be seen in Figure 1, for 
individuals growing up in neighborhoods where young coethnics have low average 

grades, the increase in number of coethnics is negative for individuals’ educational 

attainment. If the average level of degrees is equal to or higher than 3.2, the increase 

in number of coethnics is positive for educational careers. Similarly, as can be seen 

in Figure 2, there appears to be a critical level of about 14 years of education above 

which increasing numbers of older coethnics have a positive impact on educational 

careers. However, individuals residing in these kinds of neighborhoods constitute a 

minority of the immigrant children in our sample: 23 percent grew up in neighbor- 
hoods where coethnic youth had average grades equal to or above 3.2, and only 1.2 

percent grew up in neighborhoods where adult coethnics had an average education 

of 14 years or more. It is also worth mentioning that Model 1 accounts only for 
3.4 percent of the variance in individuals’ years of education, indicating that ethnic 

environment is not the major source of variation in educational careers of young 
people of immigrant origin in Sweden. 

In Model 2, we add controls for individual-level variables and family back- 
ground. The idea behind this model is that endogeneity causes an upward bias 
in estimates of neighborhood effects in Model 1. Families group themselves in 
tesidential areas, and children from affluent families have a high probability of 
ending up in afHuent social environments. The opposite can be assumed to be 
the case for children from families with relatively low resources. Thus, it can be 
expected that controls for family background will lower the association between 
ethnic environment and educational career among the individuals studied. The 
results from the analysis in Model 2 do not unambiguously confirm this presump- 
tion. Even though the associations between our two measures of the quantitative 
side of the neighborhood and the educational careers of the individuals studied 
ate somewhat attenuated, the joint effect of the quantitative and the qualitative 


sides remains almost unchanged. 
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Figure 1. Predicted Years of Education for Different Levels of Grades of 
Coethnics Ages 16-22 
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Note: the values are computed using coefficients in column 1, Table 3 (no controls). 


Thus, the results still indicate that there are some negative externalities of the 
quantitative side of the ethnic environment and some positive externalities of the 
joint measures of quantity and quality of the ethnic environment for the indi- 
vidual educational careers. Adding measures of individual and family background 
substantially increases the variance explained. Model 2 accounts for 11.5 percent 
of the variance in individuals’ years of education. 

As mentioned previously, our estimates of ethnic surroundings may be biased 
due to several processes: population sorting (or unobserved neighborhood hetero- 
geneity), environmental characteristics that are common to a neighborhood and 
that influence all young people and their long-term aspirations and educational 
careers. Distinguishing effects of the ethnic environment from these alternative 
sources of influence entails considerable methodological problems. In the analysis 
above, we have used a relatively “rich” individual-level model. However, we can- 
not claim that all neighborhood heterogeneity is controlled for in this model. 
For example, there may be unmeasured neighborhood variables that are related 
to immigrant children’s educational careers, but are unrelated to the measures of 
social background in the model. If some families with relatively low human capital 
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Figure 2. Predicted Years of Education for Different Levels of Education of 
Coethnics Ages 23+ 
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Note: the values are computed using coefficients in column 1, Table 3 (no controls). 


and/or social problems are characterized by high levels of educational aspiration 
for their children, they may move to neighborhoods with “positive” social charac- 
teristics. In this case, the estimates of the ethnic environment in Model 2 may be 
upwardly biased. Introducing neighborhood dummies into the equation (Model 
3) is an efficient way of dealing with this problem. This procedure allows us to 
account for all unobserved neighborhood specific effects common for the children 
residing there. Introducing dummy variables for ethnic groups similarly avoids 
the problem that ethnic environments may simply mirror varying educational 
traditions in immigrant groups from different countries rather than the effects of 
social interaction in an ethnic group in a neighborhood. 

Model 3 shows the results from the analysis of the effects of ethnic environ- 
ment, taking into consideration these extensive controls for individual-, ethnic 
group- and neighborhood-level variables. The effects of ethnic environment are 
modified with this model specification. The estimates indicating the effects of 
quantitative and qualitative characteristics of young coethnics in a neighborhood 
(and their joint effect) are somewhat larger than the corresponding estimates in 
the previous model. The joint effect of the quantitative and qualitative sides of 
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the ethnic environment indicates that sharing an ethnic environment with many 
young people of the same origin, who are successful in school, creates favorable 
conditions for a future educational career. The estimates indicating the charac- 
teristics of adult coethnics in the same neighborhood are all attenuated towards 
zero and become statistically insignificant. Thus, the characteristics of the younger 
group of coethnics seem to matter more in the process of educational attainment. 
Adding fixed effects for neighborhoods and ethnic groups further increases the 
variance explained. Model 3 accounts for 15 percent of the variance in individuals’ 
years of education. 

In Model 4, we add average grades from the 9" grade into the equation. The 
main result is that all parameters indicating the effects of the quantitative and 
the qualitative side of the ethnic environments (and their joint effect) are statisti- 
cally not different from zero. Thus, the effects that the characteristics of young 
coethnics in the neighborhood have on future educational career arise on lower 
educational levels, and disappear when differences in grades from compulsory 
school are accounted for. It is worth mentioning that the variance explained in 
Model 4 increases substantially. In this model, about 43 percent of individuals’ 
years of education are accounted for, indicating that the grades from compulsory 
school are a very important predictor of educational careers. 

One problem with many neighborhood effects studies is that neighborhood 
affiliation is measured for a very limited period of time. Short-term fluctuation 
and incorrect measurement of neighborhood affiliation then attenuate any true 
effect of neighborhood. We tried to circumvent this problem by extending the 
period of time in which we measure neighborhood characteristics from 1990 to 
1995. Still, one may argue that children who move during the period in which we 
measure their neighborhood affiliation should be less affected by the coethnics liy- 
ing there, simply because they fail to get to know the people living there. If this is 
a real problem, our estimates of neighborhood effects would be biased downwards. 
To assess the magnitude of this problem, we restricted the sample to include only 
those children who did not move between neighborhoods during the period in 
question, and reestimated models 3 and 4. By restricting the sample to this group, it 
turns out that the coefficients associated with the young ethnic surrounding in the 
childhood neighborhood increase in both size and significance (Model 5). However, 
the parameter estimates associated with the adult coethnic surrounding remain low 
and insignificant. In other words, characteristics of young coethnics growing up in 
the same neighborhood seem to be important for educational success in this group. 
‘The main effects of the quantitative and qualitative sides of the ethnic environment 
ages 16-22 are negative, and their joint effect is positive.? Thus, growing up in a 
large coethnic community (of young people) seems to be detrimental to education 
attainment. ‘This effect is, however, conditional on the groups average school suc- 
cess. The joint analysis of the effects of the quantitative and qualitative sides of the 
ethnic environment presented in Figure 3 indicates that there is a critical average 
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Figure 3. Predicted Years of Education for Different Levels of Grades of Coethnics 
Ages 16-22, Net of Controls 
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level of grades that conditions for whether the increase in number of coethnics is 
negative or positive for future educational attainment. We can see that for children 
growing up in ethnic surroundings with low average grades, an increase in the 
number of coethnics is negative for their educational attainment. As we found in 
our “naive” Model 1, if the average level of grades is equal to or higher than the 
threshold of 3.2, the increase in number of coethnics is positive. A minority of just 
23 percent of the individuals in our sample grew up in neighborhoods where the 
young coethnics had achieved this or a higher level of grades. ‘The joint effect of 
quality and quantity is nontrivial, around a six-month to one-year difference in 
education depending on circumstances in the childhood ethnic enclave. 

Adding average grades from the 9° grade into the equation (Model 6) indicates 
that the results in Model 5 are fully mediated by success in the compulsory school. 
That is, all of the effect of the quality and quantity of the ethnic environment in the 
childhood neighborhood is channeled through grades in the compulsory school. 

In summary, the pattern of an increasing effect of coethnic characteristics 
with an increasing quantity of these coethnics is consistent with our theoretical 
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expectations: that the joint effect of these factors sum to more than zero. The 
enclave effects that we were able to identify were fully accounted for by final 


school results in 9 grade. 


Concluding Discussion 


The question asked in this study is whether and how ethnic residential segregation 
influences the educational careers of immigrant children. The central premise 
behind the analyses is that a shared ethnic background facilitates social interac- 
tion between individuals, and that ethnic enclaves in neighborhoods constitute 
important socialization arenas for children growing up there. Among other things, 
interaction within an ethnic group in a neighborhood may influence levels of aspi- 
ration, study habits and norms concerning the value of education for an individual. 
To understand this social process, we need information on the characteristics of 
ethnic enclaves. In the empirical analysis, we distinguish between the quantity 
and quality of ethnic groups within neighborhoods. The quantitative side is the 
number of coethnics living in the neighborhood. The qualitative side measures 
the educational resources of the ethnic group in the neighborhood. We expected 
an environment with many coethnics and relatively limited educational resources 
to affect educational careers negatively, and an environment with many coethnics 
and relatively high educational resources to affect educational careers positively. 
‘Thus, we believe the direction of the effect of ethnic concentration in a residential 
area depends to be conditional on the qualities of the ethnic enclave studied (cf., 
Glaeser, Cutler and Vigdor 2007). 

We used a dataset comprising a cohort of immigrant children who graduated 
compulsory school in 1995. We followed these children retrospectively to 1990 
to measure neighborhood characteristics during late childhood, and through to 
2003 to measure their educational attainment thus far. Our empirical analysis 
showed that growing up in an ethnic enclave reduces the immigrant children’s 
educational attainment if they reside in the enclave for an extended period of 
time and if the average educational resources of the young coethnics in their 
residential area are medium to low. The larger the number of young coethnics 
in the neighborhood, the more pronounced the negative effect. The effect of the 
neighborhood is, however, fully captured by the school results from compulsory 
education. That is, children who grow up in neighborhoods with many young 
coethnics who have limited educational resources obtain relatively low average 
grades from compulsory school, and do not continue, on average, to higher 
levels of education as do children who grow up elsewhere. Although the effects 
we find are nontrivial, we would like to underscore that the ethnic environment 
during childhood is not the major source of variation in educational careers of 
young people of immigrant origin. 

‘Thus, even though its influence on educational careers is limited compared to 
other factors (e.g., family-related), the downside of the ethnic enclave is visible in 
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our analyses. We also expected that relatively large and resourceful ethnic environ- 
ments would promote the educational achievement of inhabitants. We do find such 
a positive effect, but it pertains only to the minority of immigrants who grew up 
in neighborhoods with educationally successful young coethnics. Also in this case, 
the effect of the ethnic environment is mediated by results in compulsory school. 

‘Thus, the effect of the ethnic surrounding on later educational attainment 
builds exclusively on school results achieved in childhood. This result is in line with 
Heckman’s (2006) observation that educational attainment primarily seems to rest 
on foundations put in place early in life. Heckman’s conclusion is based on an analy- 
sis of individual achievement tests of very young children. Our data on school grades 
come from 16-year-old students. Although we lack data on earlier achievements, 
the result indicates that the direct effect of the childhood environment is relatively 
limited in time. It is interesting to note that other studies (e.g., Erikson et al. 2005) 
have shown that the influence of students’ social background on their educational 
careers operates both in terms of school results, “primary effects,” and in terms 
of propensities for transition to higher educational levels, “secondary effects.” (cf, 
Boudon 1974) Our results indicate that the influence of the local ethnic surround- 
ing exclusively can be characterized as a primary effect. Beyond this primary effect, 
we do not find any remaining effect of the ethnic surrounding that can be classified 
as secondary (see Aslund et al. 2006 for similar findings). 

Our empirical model builds on the fact that the quantity and quality of ethnic 
groups varies within neighborhoods. This variation allows us to eliminate the 
influence of unobserved characteristics of neighborhoods and ethnic groups on 
the analyzed outcome by utilizing fixed effects for neighborhoods and ethnic 
groups. We believe that this gives us a sound analytical model that can be used 
for causal analysis of the impact of different characteristics of the ethnic environ- 
ment in childhood on stratification outcomes later in life. However, the analyses 
presented here may still have some limitations. The value of continuing to higher 
education may covary with some (unknown) characteristic within cells defined 
by ethnic groups and neighborhoods. For example, the better the opportunities 
for employment in an ethnic enclave, the less essential higher education may 
be for its inhabitants. If this is the case, our estimated effects may be biased. 
We would not like to discard this alternative hypothesis altogether, but find it 
implausible to assume that this kind of mechanism completely confounds our 
results. Another limitation of the dataset is that the actual interaction frequency 
between individuals living in an ethnic enclave is unknown. An alternative to the 
strategy employed here would be to collect representative interview data on the 
ethnic groups’ characteristics, frequency of contact between group members, and 
group members’ attitudes and performances. This type of data, however, does not 
exist. The, admittedly imperfect, method used here is common in studies where 
conclusions about possible social interaction effects are drawn from large-scale 
data (e.g., Bertrand, Luttmer and Mullainathan 2000), but future studies in this 
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area would benefit from using data that provide more accurate measures of the 
underlying hypothesized process of social influence. 

Is it good or bad to grow up in an ethnic enclave? We conclude that, for the 
educational career, the effect of the ethnic enclave is, as a rule, negative. For 
the majority of immigrant children, the larger the coethnic community in their 
neighborhoods, the lower their grades in compulsory school, and as a consequence, 
the shorter their educational careers. Only a minority of immigrant children who 
erow up in neighborhoods with a high level of education among coethnics benefit, 
in terms of educational attainment, from growing up in an ethnic enclave. 

Are the results of the present study unique to the Swedish context? We would 
argue that our results, in an international comparison, are lower bound estimates 
of the effects of growing up in an ethnic enclave. One of the major goals of the 
Swedish educational policy has been to decrease the correlation between social 
background and educational attainment, and inequality of educational opportu- 
nity and social inequality is relatively limited in Sweden (Bjorklund et al. 2005, 
Erikson and Jonsson 1996). The variation in school quality between neighbor- 
hoods is fairly low, and higher education is heavily subsidized. The role of ethnic 
isolation, resulting from residential segregation, in the process through which the 
social advantage of parents translates to the educational career of children, should 
therefore be rather small in Sweden. Still, our estimates of the effect of growing 
up in a low-resource ethnic enclave are nontrivial. This suggests the significance of 
ethnic enclaves, for the future educational career of immigrant youth from such 
areas, to be rather high in countries and institutional contexts in which there is 
more ghetto-ization, more variation in standards across neighborhoods, and where 
there are higher costs associated with higher education. 


Notes 


1. Weuse the concept “ethnicity” to designate the individual’s or his/her parents’ country 
of birth. Naturally, we are aware that persons with the same country of birth may 
belong to different ethnic groups. For example, individuals who were born in Turkey 
may be Kurds, Assyrians, Roma, Arabs, Greeks, Armenians or ethnic Turks. 


2. American researchers ask the following question as well: assimilation to what? 
Children of non-white immigrants may lack channels for being assimilated into 
society's better-off segments. Contacts with native-born persons who reside in 
disadvantaged neighborhoods may instead cement norms and cultures of exclusion 


(Portes 1995; Zhou 1997). 


o: In our data, there is a small group of students (7 = 21) whose grades in 9™ grade were 
incomplete. We assigned these children eight years of education to follow the practice 
of how these cases are treated in the Swedish educational register. 


4. ‘To further check the robustness of our results, we reestimated the models excluding 
immigrants from (1. Finland and (2. the “Western hemisphere” (the Nordic countries, 


Western Europe, USA and Canada). The results were very similar to those reported 
in this article. 
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5. Previous research has shown children who have one Swedish-born and one foreign- 
born parent are similar to those who have two Swedish-born parents, as far as school 
results are concerned. ‘The real divide is between those who do not have a Swedish- 
born parent and thus a non-Swedish language as their mother tongue (Simila 1994). 
We only include those who do not have a Swedish-born parent. 


6. The number of SAMS areas in Sweden is around 9,000, with an average population 
of 1,000 persons. A SAMS area is identical to an electoral area in densely populated 
areas. In less densely populated areas, it is identical to a so-called NYKO area. NYKO 
areas (nyckelkodomraden) are used by the municipalities for descriptive purposes, 
e.g., to keep track of the population size of different neighborhoods. 


7. Most empirical research on neighborhood effects uses data at a single point in time. 
As noted by Ellen and Turner (1997), this approach poses at least two potential 
problems: (1. measurement error because of random between-year fluctuations in 
neighborhood characteristics, and (2. insensitivity to the length of time an individual 
has been exposed to a neighborhood environment. In our study, we try to circumvent 
these problems by making use of six consecutive yearly observations of neighborhood 
membership during the period the children are 10 to 16 years of age. 


8. To assess whether we could account for the selection process into the sample with our 
observables, we estimated a regression model including those individuals who were 
excluded from the analytical sample, for whom we had information on variables other 
than those measuring the local ethnic community. We could account for the selection 
effect on educational attainment with controls for ethnic group, parents’ education, 
and dummies for social assistance and whether the family was intact. Thus, we judge 
the risk of selection bias in our estimated neighborhood-related effects to be low. 


9. ‘The negative main effect of average grades is largely a consequence of the coding 
of the quality/quantity variables, which effectively makes the main effects of the 
quality*quantity interaction theoretical starting values. None of the children had zero 
values on either the quality, quantity or interaction variables, which is why the main 
effect coefficient for quality (and quantity) refers to non-existing children. 
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Neighborhood Social Influence and Welfare Receipt in 
Sweden: A Panel Data Analysis 


Carina Mood, Stockholm University 


This article places the choice to claim welfare benefits in a social context by studying 
how neighborhood welfare receipt affects welfare receipt among couples in Stockholm, 
Sweden. It is expected that the propensity to claim welfare should increase with welfare 
use in the neighborhood, primarily through stigma reduction and increasing availability 
of information. I use individual-level panel data (N = 1,595,843) for the Stockholm 
County population during the 1990s, data that contain a wide range of information and 
allow extensive controls for observed and unobserved confounding factors. The results 
from pooled and fixed-effects logistic regressions suggest that welfare receipt among 
people in the same neighborhood substantially increases the number of households 
entering the welfare system (inflow), but the effects on outflow are negligible. 


Introduction 


Despite economic incentives to do so, poor people often fail to claim welfare 
benefits. Substantial non-take-up of such benefits has been found in Western 
countries with different welfare state organizations and with different poverty lev- 
els, such as the United States (U.S. Department for Health and Human Services 
2005), the United Kingdom (U.K. Department for Work and Pensions 2004), 
Sweden (Gustafsson 2002), Germany (Kayser and Frick 2001; Riphahn 2001) 
and Australia (Mood 2006). 

Non-take-up suggests that benefit decisions are not determined by economic 
incentives alone, but that information and/or non-economic incentives are also 
important. Both these factors are likely to depend on social context, which may 
affect the amount and content of information that reaches potential recipients, 
and non-economic incentives such as social status and approval. In this article, 
my perspective is that welfare dynamics result from individual choices, embedded 
in a social context. I place the choice to claim welfare benefits in a social context 
by studying whether the propensity to claim Social Assistance increases with SA 
receipt in the neighborhood. Neighborhood is a social context that has received 
much academic and policy attention, and the potential impact of neighborhoods 
has been studied for a number of individual outcomes (see reviews in Brooks- 
Gunn, Duncan and Aber 1997; Dietz 2002; Jencks and Mayer 1990; Sampson, 
Morenoff and Gannon-Rowley 2002). There are strong theoretical reasons to 
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believe that neighborhood welfare receipt can affect the choice to claim welfare. A 
high density of welfare receipt in a neighborhood should ease the transmission of 
information about welfare benefits, increase the visibility and perceived normality 
of welfare receipt, and decrease the normative pressures against claiming welfare, 
making households more likely to claim it in a given economic situation. 

This research is of inherent sociological relevance not only because neighbor- 
hoods and welfare receipt are sociologically relevant topics, but also because they 
are linked by a social influence mechanism. The study is also of social policy 
interest because a better understanding of the factors systematically affecting the 
choice to claim benefits will provide better grounds for interpreting increases in 
welfare receipt as well as for predictions about welfare caseloads and the effects of 
interventions. The kind of social influence studied here, in which a behavior is af- 
fected by the prevalence of the same behavior among others, is of particular policy 
interest because when present an endogenous effect functions as a social multiplier 
that reinforces exogenously caused fluctuations in welfare receipt. 

However theoretically plausible, it is notoriously hard to estimate empirically 
the causal effect of a contextual variable on individual outcomes (see Dietz 
2002; Hauser 1970; Manski 1993, 2000). Recurrent concerns include the risk 
of reverse causality (Manski’s “reflection problem”), spurious correlations due to 
omitted variables or population sorting, and inadequate operational definitions 
of the context in question. Although researchers have used much ingenuity, 
problems remain even in the most influential studies in the field (see the critical 
views of Dietz 2002; Durlauf 2004; Rivkin 1999). One fundamental obstacle is 
data availability: For reliable estimation of contextual effects with observational 
data, we need large sample panel data with appropriate operationalizations of 
the relevant context, and relevant observed variables for these contexts as well 
as for individuals or households. 

This article provides a uniquely reliable estimation of a contextual effect. I 
use population data on all individuals ages 18 to 64 years who ever resided in 
Stockholm County (Sweden) during the 1990s to construct a household-level 
dataset with a total of 1,595,843 observations from 903 neighborhoods and eight 
years. The data were compiled from different population registers, and contain a 
wide range of information on these households, which allows for extensive con- 
trols. I restrict my study to married couples and couples with common children, 
because these are the only categories that can be reliably identified as households 
in Swedish register data. The neighborhoods, in turn, are geographical areas 
purposefully defined by Statistics Sweden to represent meaningful and socially 
homogenous neighborhoods. Using population data rather than merely a sample 
from each neighborhood also enables construction of reliable neighborhood 
composition variables by aggregating individual variables. In addition, the panel 


structure of the data allows the use of fixed-effects models to avoid biases due to 
time-invariant unobserved characteristics. 
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Social Influence and Neighborhood Effects 


This study lies in the intersection of two important sociological areas: social influence 
research and the study of neighborhood effects. These areas overlap, though not com- 
pletely: Social influence effects can be mediated by people who are not neighbors, 
and neighborhood effects need not be social influence effects. The neighborhood 
may also causally affect the outcomes of individuals by the presence and quality of 
local social institutions such as schools and churches, or even the physical environ- 
ment. These are factors that are shared by, and potentially affect in a similar way, all 
those living in a specific neighborhood (cf., Jencks and Mayer 1990; Sampson et 
al. 2002). The focus of this article is on neighborhood social influence effects, that 
is, effects on an individual outcome of interpersonal influence in the neighborhood. 
The degree to which studies of neighborhood effects, or studies of effects of 
other contexts (such as schools), focus on interpersonal influence effects, or even 
distinguish these from other contextual effects, varies. Many social scientists fol- 
low Manski (1993, 2000), who distinguishes between two basic types of social 
influence effects. He uses the term endogenous effects for cases in which a behavior 
is affected by the prevalence of the same behavior among others, while contextual 
(exogenous) effects on behavior are due to the prevalence of other behaviors or 
characteristics among others. Apart from the two types of social influence effects, 
Manski also distinguishes correlated effects as an explanation of people in a group 
behaving in the same way. These are effects not mediated by interpersonal social 
influence. Importantly, Manski treats these effects as a source of bias and does not 
distinguish between such correlated effects that are non-causal, such as those due 
to differential selection of people into contexts, and such effects that are causal but 
not due to social influence, such as the effects of societal institutions that affect 
all people in a group in a similar way. Sociologists may be genuinely interested 
in this kind of effect, and would not generally treat it merely as a source of bias. 
Sociologists are often interested in studying the overall causal effect of belong- 
ing to some context, and the term “contextual effects” is often taken to include 
what Manski classifies as endogenous and contextual effects, as well as the causal 
subset of the correlated effects. A distinction between endogenous and other social 
influence effects is seldom made. This may be entirely reasonable if the aim is to 
study the presence and to estimate the strength of some contextual effect in toto. 
However, isolating the different mechanisms behind a contextual effect can not 
only improve our sociological understanding of the process, different mechanisms 
may also have different implications for aggregate dynamics and policy inter- 
ventions. For example, interventions that change a neighborhood’s composition 
will alter neighborhood effects if these are mediated through the social influence 
of neighbors, but not if they are mediated through existing local institutions. 
Crucially, an intervention to reduce welfare receipt will be particularly effective 
when social influence effects are endogenous because of the multiplier effect. 
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The present study deals with an endogenous social influence effect, namely 
whether welfare receipt is affected by the prevalence of welfare receipt among 
neighbors. Few studies have specifically addressed such effects on welfare use. 
Bertrand, Luttmer and Mullainathan (2000) found results supportive of social in- 
fluence effects among immigrants in the United States by using census data. They 
used extensive controls to reduce omitted variable bias, but were limited by a lack 
of data over time. For a related subject—unemployment—Hedstrém and Aberg 
(2005) have found evidence of neighborhood social influence effects in Sweden. 

What, then, are the reasons to expect social influence effects on welfare use? 
Obviously, information is important. The higher the welfare receipt is in an indi- 
vidual’s neighborhood, the greater the probability that the individual will get informa- 
tion about welfare benefits and eligibility criteria. More and better knowledge about 
benefit eligibility criteria makes it easier to make a claim, thus lowering transaction 
costs. In addition, increased knowledge should yield more correct expectations of the 
probability of receiving benefits, making eligible people more likely to claim them. 
Improved information about benefits can also be coupled with a decrease in informa- 
tion about job opportunities, which can affect the household’s economic prospects. 

Related to information are mechanisms of actualization. Our cognitive capaci- 
ties are limited, and we do not actively consider all alternatives we know to be 
open to us. Here lies an interesting asymmetry: claiming welfare benefits can only 
follow from an active decision, while not claiming them can be the result of an 
active decision or a non-decision. One way by which increased welfare receipt in 
the neighborhood can affect the propensity to claim benefits is that it can make 
this alternative more visible. 

One factor commonly assumed to be an important explanation of non-take-up of 
welfare benefits is the stigma often associated with them. In most Western countries, 
norms prescribe that people should earn a living, and economically successful people 
are viewed positively. Welfare receipt can be perceived as a violation of these norms, 
and eligible people can be deterred from claiming welfare for fear of disapproval, a 
decrease in social status or some other social sanction. The strength of the social pres- 
sure against welfare receipt is likely to vary with its perceived normality: the more 
people who behave in a certain way, the more normal such behavior will appear. 
In the case of a stigmatized behavior such as claiming welfare, this means that an 
increase in the prevalence of the behavior is likely to reduce the social costs associated 
with it, thereby increasing the propensity to adopt the behavior. Norms, however, 


can constrain behavior beyond the fear of social consequences. Norms may become 


internalized and shape our preferences, so that we avoid violating norms even when 


we believe that no one will know. If we learn that many of the people we identify 
with receive welfare, our internalized norms against it may weaken. 
It is plausible that social influence matters more when starting to claim welfare 


than when choosing to continue it, Obviously, those who receive welfare have 


passed the threshold of information and actualization. Furthermore, transaction 
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costs are typically higher when starting to claim welfare because it requires more 
extensive interviews and documentation. It is also plausible that any internalized 
stigma against welfare weakens with the duration of receipt. The upshot is that 
one should expect welfare receipt in the social context to have a positive effect on 
the propensity to claim welfare, but a stronger effect on inflow than on outflow. 
This would be consistent with the theoretical view that one is influenced more by 
the behavior of others in making decisions about which one has little experience 
(Cialdini 1984; Coleman, Katz and Menzel 1966). 

The theoretical mechanisms discussed above—information, actualization and 
stigma—provide a rationale for expecting an endogenous effect. It is not the aim of 
this research to study these theoretical mechanisms directly and unravel their relative 
importance, but to distinguish social influence effects from other sources of cor- 
relations between neighborhood welfare receipt and household welfare receipt. The 
empirical section tests two hypotheses: (1. whether neighborhood welfare receipt af- 
fects the choice to claim welfare; and (2. whether it affects inflow more than outflow. 
In addition, I seek reliable estimates of the substantive importance of these effects. 


The Neighborhood as an Arena for Social Influence 


The literature on neighborhood effects is extensive, and many kinds of effects can 
be denoted by the term (Jencks and Mayer 1990). Although theoretical accounts 
of neighborhood effects commonly delineate several potential mechanisms, em- 
pirical estimates of neighborhood effects tend to be of a black-box type, suggesting 
little about the underlying mechanisms. In this study, I do not model neighbor- 
hood effects in broad strokes, but rather seek to get the most valid and precise 
estimates possible for one potentially important mechanism —endogenous social 
influence effects. In contrast to much of the literature on neighborhood effects, 
I do not focus on disadvantaged neighborhoods, but see the neighborhood as a 
potential arena for social influence in all kinds of neighborhoods. In addition, | 
am interested in the effect of the neighborhood of residence on adults, not the 
effect of the neighborhood during childhood. 

Social influence effects operate with different mechanisms. However, the oth- 
ers whose behavior can influence a person, must be persons to whom one relates, 
through interaction (e.g., friends) and/or identification (e.g., role models). It is 
an open question how important the neighborhood is in terms of such relations. 
Research in social psychology has long emphasized geographical proximity as a 
powerful determinant of social relations (Festinger, Schachter and Back 1963). It 
is clear that in modern societies residential areas are far from the only source of 
social relations, but as suggested by Wellman (1996), neighborhood contacts still 
make up a sizeable proportion of people's contacts. 

Small physical distances need not be the only thing that engenders interaction 
among neighbors. People are not normally randomized into neighborhoods, but 
tend to cluster with others like themselves. Such population sorting is not only a 
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source of bias, but can also fuel genuine social influence effects because people are 
more likely to interact with others like them. This is an important insight, because 
it means that experiments are not necessarily ideal for the study of neighborhood 
social influence effects. When people are placed in neighborhoods where they 
otherwise would not have lived, the processes of social influence may well be 
different compared to what they would be in real, non-experimental, situations. 

Even if the behavior of neighbors affects people, we will not find genuine neigh- 
borhood effects unless we manage to capture the real neighborhood. However, this 
is a hypothetical entity, and it is therefore hard to defend the validity of any opera- 
tionalization (cf., Openshaw and Taylor 1981). The definitions of neighborhoods 
can be better or worse, however. Studies in the United States, for example, are 
often criticized for equating neighborhoods with census tracts, which are not pur- 
posefully defined and are relatively large (3,000-5,000 inhabitants). Fortunately, 
I can use areas that are comparatively small (1,092 average inhabitants) and 
purposefully designed by Statistics Sweden to delimit socially and economically 
homogenous neighborhoods~areas that have been found to correspond well to 
people's own perceptions of their neighborhoods (Andersson 2001). 


Swedish Social Assistance 


In Sweden, the bulk of the economic assistance to individuals is given as universal 
benefits and general social insurance. The means-tested Social Assistance benefit is 
the very last resort of formal economic assistance, intended only to give short-term 
relief to households in need. Municipalities are obligated by law to provide SA, 
and the National Board of Health and Welfare (Socialstyrelsen) establishes guide- 
lines concerning the amounts of support. Since 1998, this guideline also regulates 
the minimum economic standards that municipalities must guarantee the SA 
recipients. In an international perspective, the economic standard guaranteed by 
SA is rather high. However, the general economic standard for SA recipients is 
still significantly below that of a full-time worker in a low-paying job in Sweden 
(Ministry of Health and Social Affairs 2004). 

SA claims are made at a welfare agency on a monthly basis. Support is given 
only if the welfare agency finds that all individuals in the household have made 
sufficient efforts to improve their economic situation, e.g., by actively seeking em- 
ployment. Any income in the household reduces the benefit by the same amount, 
and the general rule is that assets should be realized. 

The period studied in this research is one with large variations in ageregate SA 
receipt. Sweden entered a severe recession at the beginning of the 1990s, and the 
subsequent increase in SA receipt was dramatic. In fixed-money terms, the sum 
of SA transfer payments doubled from 1990 to 1997 (SOU 1999); SA ‘receipt 
increased from 6 percent to 8.5 percent (Bergmark 2000); and the number of 
long-term recipients (defined as those who received SA at least 10 months in a 


year) almost tripled (Socialstyrelsen 1999).! 
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Data 


The dataset is longitudinal, consisting of annual observations from 1990 to 1999, 
and containing detailed information merged from various registers about all in- 
dividuals ages 18-64 who ever resided in Stockholm County during this period 
(the matching was carried out by Statistics Sweden and is based on unique per- 
sonal identifiers). Because entitlement to SA is judged on the basis of household 
needs and incomes, I cluster individuals into households and use households as 
observation units throughout the analyses. After delimitations of data, the number 
of observations in the empirical analyses is 1,595,843, consisting of 297,250 
households observed for an average of 5.4 years. 

These data are well-suited to the purpose of studying neighborhood social 
influence effects for several reasons. First, they allow the operationalization of 
neighborhoods at a fine geographical level, the Small Area Market Statistics area 
of residence. In the dataset, there are 903 different SAMS areas. The average 
population size for ages 18-64 in a SAMS area in Stockholm County is 1,092 
(median 660), and an average household lives in a SAMS area with a population 
of 2,845 (median 1,868). Second, because the dataset contains information for 
the entire population, it allows for the construction of reliable neighborhood 
variables. Third, the data contain a large amount of information on individuals 
and their households, which makes possible extensive controls for potentially con- 
founding variables. Fourth, the panel structure of the data allows the use of lags 
of independent and dependent variables, and of fixed-effects models. In addition, 
using register data eliminates panel attrition and misreporting benefit receipt or 
non-response from benefit recipients— problems that can severely bias estimates of 
benefit receipt from survey data (Riphahn 2001; Terracol 2002). As entitlement 
to SA is based on the household situation, reliable identification of households is 
vital. Thus, I study married couples and couples with common children, who are 
reliably identified as households in Swedish register data. All other individuals are 
classified as singles, but can actually be either singles or part of a couple that is 
neither married nor has common children.? While it would have been preferable 
to include all households, an analysis of couple households is interesting in itself 
and is likely to give an indication of the existence of a more generic social influence 
effect (although the size of this effect may be different for single households). 

While SA is claimed and received monthly, no monthly records exist in the 
data. Thus, the flows in and out of receipt within a year must be ignored, and the 
dependent variable must measure whether the household received SA at any time 
during the year in question. The household identification refers to the household 
composition at the end of the year, while SA receipt and many other variables 
refer to the whole year. Variables observed during a year may be ascribed to the 
wrong household if the household composition has changed during the year. 
Because of this, I do not include households that have formed during the year of 
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analysis. As a consequence, a household remains part of the studied population 
only if it consists of the same couple the 31 of December in year t, as the 31* of 
December in year t-1. Because lagged variables are used in the analysis, the models 
are estimated using only households that consisted of the same adults on Dec. 31 
for three consecutive years (i.e., the couple must have lived together for at least 
two full years preceding the observation year). This means two things. First, due 
to censoring, the historical observation window becomes 1992-1999, and no 
households that are formed in 1998 or 1999 are included. Second, when taking a 
given observation year as a point of departure, around 9 percent of the identifiable 
couple households will not be included as they are recently formed —this group 
includes those who will stay together for a longer time (in which case they will 
be included at a later time point) as well as a small group that will not form a 
household for more than two years. 


Identification of Neighborhood Social Influence Effects 


The data that I use overcome several common obstacles in the estimation of social 
influence effects. One obvious advantage is that the panel structure of the data 
makes it possible to use lagged neighborhood SA receipt as the independent variable, 
to avoid problems of causal order. A less obvious, but equally important, advantage 
is the ability to control for the lagged dependent variable. The state one is in (receipt 
or non-receipt) can affect attitudes, knowledge and opportunities and therefore the 
probability of future receipt. Such a causal effect of the lagged household SA receipt 
on current household SA receipt is called state dependence. A bias in the estimated 
effect of neighborhood SA can result because households with a high value of neigh- 
borhood SA at t-1 are also more likely to have received SA themselves at t-1. I will 
therefore take account of the lagged value of the dependent variable by estimating 
separate models for SA inflow and outflow. This means that I avoid confusing the 
endogenous effect with household state dependence, and allows the estimation of 
effects of different magnitudes for inflow and outflow. It also effectively eliminates 
a large fraction of the bias due to unobserved variables, because many variables that 
affect a household’s risk of SA receipt at t affected it also at t-1. 

However, bias in the estimated effect of neighborhood SA may remain. Even 
within groups defined by lagged SA receipt, there may be heterogeneity that is 
related to the likelihood of receiving SA. For example, non-recipients in neighbor- 
hoods with different levels of SA receipt may have different economic situations, 
so I control for household economic resources. This is a very powerful control, 
because many factors—observed and unobserved — affect the probability of SA 
receipt by affecting economic resources. This control 
of the endogenous effects on SA receipt that is mediated by economic resources, 
and concentrate on the endogenous effects on the take-up of SA given the eco- 
nomic resources of the household. Thus, this research does not address the issue of 
whether increases in welfare receipt in the neighborhood increase the probability 


means that I neglect any part 
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of becoming poorer, which may be the case if it tends to weaken work norms and 
discourage people from working. 

While the theories concern the claiming of SA, the data contains information 
only on receipt of SA. If the probability of receiving SA upon claiming it is system- 
atically related to the lagged neighborhood SA receipt, and this is not controlled 
for, the estimated effect of this variable will be biased. This could result from 
systematic variations across welfare agencies or over time in criteria for granting 
SA. There is also a possibility that some welfare agencies are more restrictive during 
particular years. To avoid biases due to these factors, I include welfare-agency*year 
dummies (N = 42*8 = 336). 

In research on contextual school effects, a common concern is that pupils sort 
into schools based on the outcome under study per se (e.g., achievement) and not 
only on factors related to the outcome (e.g., parents’ socio-economic status): that 
is, more high-achieving students choose better schools, given family characteristics. 
This mechanism is unlikely to be a concern here. People probably do not move to 
their neighborhoods based on their propensity to claim SA per se,* but on other 
characteristics that make them more likely to claim SA. The most obvious such 
factor is income, which is now controlled for. Remaining bias must thus be due to 
factors that can potentially affect sorting into different neighborhoods by people 
with different propensities to claim or receive SA given their income. | include several 
socio-demographic variables in the analysis to reduce the risk of any such biases. In 
the analysis of outflow, I also include a variable measuring the lagged ratio of SA to 
total disposable income. (Note that if the social influence effect of interest is medi- 
ated to some extent by the control variables, it will be underestimated.) 

This is as far as we can go with a pooled model. The estimation of separate 
models for inflow and outflow eliminates bias due to state dependence and any 
factors that affected a household similarly during the preceding year. Bias due to 
population sorting is addressed by a large range of controls for household charac- 
teristics, including income/need. In addition, bias due to period-specific factors 
or factors specific to welfare-agency districts (or to their interaction) is eliminated. 
It is difficult to think of any factor that is not controlled for and that would have 
more than marginal impact in practice.° 

The nature of the data, however, allows me to go further in the identification of 
social influence effects by using fixed-effect models (i.e., models with fixed effects for 
observation units, here households). Such models exploit only the variation within 
units over time, and hence eliminate all bias due to time-invariant unobserved 
characteristics of households. Thus, if unobserved household characteristics affect 
the probability of SA receipt (given income and other controls), and the type of 
neighborhood in which the household lives, the estimates from the pooled model 
are biased. Estimates from household fixed-effects models, however, are not. ° 

In linear regression, using fixed effects is analogous to adding unit dummy 
variables (i.e., one dummy for each unit in a panel). However, I will use logistic 
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models, and in such models the simple dummy fixed effects can cause severe 

inconsistency in all estimates (the so-called Neyman-Scott (1948) incidental pa- 
rameter problem).’ For the fixed-effects analyses, I will therefore use a method 
proposed by Chamberlain (1980) to condition out the household fixed effects of 
the likelihood. This procedure, normally referred to as conditional logit, calculates 

likelihood relative to each unit by conditioning on the number of observed posi- 
tive outcomes. Units that have a constant value on the dependent variable drop 

out, so using this method is difficult in analyzing rare events unless one has very 
large datasets. A great advantage of my data is that conditional logit models are 

possible to fit even though only 4 percent of the population in the pooled analyses 

remains. Although they have many advantages, fixed-effects models have their 
shortcomings. First, they control only for time-invariant effects on estimates, so 

other controls must still be included to control for time-varying factors that may 
bias estimates. In the fixed-effects models, I target this problem by adding controls 

for year and household economic resources (and, as before, by conditioning on 

lagged SA receipt). Second, the great strength of fixed-effects models is also their 
weakness. Because these models use only the longitudinal, within-unit variation 

and disregard cross-sectional variation, we must assume that the estimates also 
hold for the non-observable relationship between the dependent and independent 
variable between units. This assumption may be defensible, but is not necessarily 
correct. In my study, the fixed-effects estimates are most likely the best estimates 

of the true endogenous effect for the part of population that experienced both receipt 
and non-receipt. However, because the fixed-effects models include only a small 

(and non-random) subsample of the studied population, inference from this group 

to the population is not straightforward. In order to get an estimate somewhat 

closer to the population effect, I also estimate a fixed-effects model that includes 

households only until they have received SA for the first time.® This sample is likely 
to be less different from the non-recipient households than the sample including 
households that have many observations with SA receipt. 

The estimated social influence effects could in principle be both endogenous 
and exogenous (what Manski would call “contextual”) social influence effects. 
Here, an endogenous social influence effect means that a given household’s pro- 
pensity to enter or exit SA is influenced by changes in neighborhood SA receipt, 
while an exogenous social influence effect means that a given household’s pro- 
pensity to enter or exit SA is influenced by changes in some other behavior or 
characteristic of neighbors that tends to change when SA receipt changes, and 
that does not operate through income (because this is controlled for). For example, 
if unemployment and SA in the neighborhood tend to increase simultaneously, 
and households’ SA decisions—given their economic situations—are stimulated 
by neighbors’ unemployment rather than by their SA receipt, the estimated effect 
of neighbors’ SA could reflect also an effect of neighbors’ unemployment. To the 
degree that this is the case, the estimated effect is a social influence effect but not 


Neighborhood Social Influence and Welfare Receipt © 1341 


an endogenous one. However, considering that this effect would have to hold 
net of the household economic situation, it is theoretically less plausible than the 
endogenous social influence effect? 


Variables 


The dependent variable takes the value 1 if the household received SA at any 
occasion during the year and 0 otherwise. In a few cases, only one in the couple 
is registered as an SA recipient (probably due to brief separations), and these 
observations are excluded from the analysis. A household in which only an adult 
child received SA is classified as a non-recipient household. The neighborhood 
SA receipt variable gives the percentage SA recipients the preceding year among 
all individuals (except those in the own household) ages 18 to 64 in the SAMS 
where the household was located. 

To control for the household’s economic situation, I create an income/need 


variable by the formula I/(N+HC), where: 


si is the yearly pre-SA disposable income for the household 

N is the official (pre-housing cost) eligibility limit for SA for 
households with this composition;" and 

HC _ is the average housing cost among households with this 
composition in Stockholm County (estimated using data 
from Statistics Sweden’s Survey of Housing and Rents). 


This variable thus indicates approximately how far a household’s yearly income 
is from the SA eligibility limit. In the analysis, I include this variable as a control, 
but I retain all households regardless of their value on this variable. Excluding 
ineligible households would be desirable, but this is impossible because I have 
yearly data while eligibility for SA is judged monthly. Consequently, even though 
those with low values on the yearly income/need measure have a much higher risk 
of SA receipt, also households with relatively high values can receive SA." To allow 
for non-linear effects of the income/need variable, it is categorized and represented 
by a set of dummies (see Table Al in Appendix). Households with zero or negative 
registered incomes, and households in which at least one adult receives most of his 
or her income from self-employment, are excluded from the analysis because their 
incomes are less reliable as estimates of the households living standards. True zero 
incomes are very unlikely in Sweden due to the many other available benefits. An 
unknown number of households with no registered incomes are households that 
have moved from the country without notifying the authorities. 

Access to informal economic help should lessen the perceived need for money 
given by SA. For each adult in a household, the equivalized disposable family 
income of the individual’ parents, expressed per adult in units of base amounts 
(see note 11), is used to operationalize the availability of such economic help. The 
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equivalized income is calculated using the same consumption weights as for the SA 
eligibility limit (i-e., family income in base amounts*adult consumption weight/ 
family consumption weight). If an individual’s parents do not live together and 
both can be found, the average of their household incomes is used. If one parent 
is missing I use the information for the non-missing one. The couple's average of 
this variable defines the household variable, and if one value is missing the non- 
missing value is used. Information on parents’ income is only available for those 
years during the 1990s that an individual’s parents lived in Stockholm County 
and were under 64 years of age. This means that the population is reduced when 
this variable is included, and I will therefore only include it in additional models. 

It is not evident which factors may affect sorting into different neighbor- 
hoods by people with different propensities to claim SA given their observed 
economic resources, but I include several variables in the analysis to reduce the 
risk of such bias: the highest /evel of education in the couple (see Table Al in 
Appendix for categories); whether at least one in the couple received student 
loans or benefits during the year (1 = yes); the couple’s average age; the presence 
of children in the household (1 = yes); and whether someone in the couple has 
immigrated to Sweden within the past 20 years (0 = no immigrant, 1 = one im- 
migrant, 2 = both immigrants). In the analysis of outflow, I include a variable 
that measures the degree of SA dependency, defined as the percentage of total 
disposable income at t-1 that consisted of SA. I also use dummy variables for 
the interaction of welfare-agency district (N = 42) and year (N = 8). 


Descriptive Information 


Of the almost 1.6 million observations (households*years) included in the analy- 
ses, 60,987 pertain to the analysis of outflow and the rest to the analysis of inflow. 
The overall rate of SA receipt in the studied population is 3.6 percent. This is a 
rather low rate compared to the Swedish population as a whole during the same 
period, which is to be expected since SA receipt is higher among single persons, 
who are excluded from this analysis. The neighborhood SA receipt that the ana- 
lyzed households face varies from 0 to 90 percent, the average value being 6.6 
percent (std. dev. = 7.5 percent). The average annual inflow is .9 percent, and the 
average outflow is 29.1 percent. Recall that the population at risk of inflow is far 
bigger than the population at risk of outflow, 

The median value of the income/need variable is 1.8; that means the median 
household has a yearly income corresponding to 1.8 times the amount of the 
estimated SA limit, including housing costs. However, the distribution of in- 
come varies markedly between the populations in the two analyses — between 
those who received SA the preceding year and those who did not. For this reason, 
the categorization of the income/need variable into dummies differs in the two 


analyses, Descriptive statistics for all included variables are given in Table Al 
in the Appendix. 
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Results 


The results from the logistic regressions are presented in terms of log-odds ratios. 
Neither LnOR nor odds ratios are strictly comparable across groups and models 
(see Allison 1999; Mood 2009). However, all comparisons made across the pooled 
models here are roughly similar when coefficients are Y-standardized (rescaled to 
a constant variation in the logit, see Winship and Mare 1984). 

There are various ways to translate the results from logistic regressions to mea- 
sures that reflect the substantive effects of the independent variables. A popular 
procedure is to measure changes in the probability of Y for discrete changes in 
X, holding other variables constant at certain levels. Such measures are not com- 
parable in a straightforward manner across models with different independent 
variables. Importantly, I would not be able to compare such measures from the 
pooled and the conditional logit models, because several independent variables are 
conditioned out by the fixed effects in the conditional logit model. 

To allow comparability across models with regard to the substantive effects, I 
have chosen a procedure exemplified in Petersen (1985): I estimate the change in 
the probability of SA receipt following a standard deviation change in lagged NSA 
taking the average probability of SA receipt as the starting point. Technically, this is 
done by first translating the average probability of SA receipt into odds, multi- 
plying these odds by the estimated odds ratio for a standard deviation’s increase 
in neighborhood SA receipt, translating these new odds into a probability, and 
subtracting the original probability from the new probability. 

All models are estimated in Stata SE 9.0. For pooled models, I use the /ogit 
procedure with standard errors robust to correlation between individuals with the 
same lagged neighborhoods (Huber-White sandwich estimates, option cluster in 
Stata). The fixed-effects models are estimated using the c/ogit procedure. Note that 
household SA receipt is coded 1 in the analyses of inflow as well as outflow, so the 
coefficients should be interpreted as the effects on the decision to start claiming 
in the inflow analyses, and to continue claiming (remain in SA receipt) in the 
outflow analyses. Because the data do not come from a sample but from an entire 
population, the reported Z statistics do not have the traditional interpretation, i.e., 
they do not provide grounds for inference from a sample to a population. Instead, 
they should be taken as estimates of the variability of the effect. 

The full results are reported in Table A2 in Appendix, and I concentrate here on 
the estimated endogenous effect in different model specifications, as shown in Table 1. 

Model 1 is the “naive” bivariate model that disregards lagged household SA re- 
ceipt and includes no control variables. When I split the model on the lagged depen- 
dent variable (models 2a and 2b), the estimated effect of NSA shrinks substantially. 
Models 2a and 2b control for genuine state dependence and much unobserved 
heterogeneity, yet it is obviously premature to draw any conclusions about endog- 
enous effects. For example, those living in neighborhoods with higher SA receipt 


1344 e Social Forces 88(3) 


‘( 


LG =a, 


Sola 66S b- 





MONG MOU 








\dla091 VS ul BulUleWad Jo ysl @Y} SULa0UO sisJeUe MOYJ]NO Ou} 


‘(awooul/YS) ones Aouspuadep pebbe| ay} osje sjapow moyjno 
9U} Ul pue ‘pjoyssnoy juspnjs ‘abe ‘pjoyesnoy jue/Bwuw! ‘juasaid UsIp|IYO ‘UONeONpA Jo jaAg| :ae 9 0} p S[@POW UI SjosjJUCD DIUdesHoWAP-O1D0S BY 


3"! 


) 


MO|JJ]NO PUE MOYJU! Y}OG JO} | papod si Jdisoa1 WS pjouesnoH 
‘6661-266 AjUNOD WiOYYO0}S UI sajdnoo si uoNeindog 


xx 90° > 4,, ‘(poouoqyBbieu pabbe| ulyjim uoejau0d 0} JSNqoJ) sasayjualed ul SoNSHe}s 7 ‘Sones sppo-Bo| aie Se}eWI}SI :Se]ON 


w(6Z2) avsll0'6) aval(Z6'9) vayl6Z'€2) vaal09'GC) vaalP9' ZO 
GeS- ZOV'l- + ~=«spOB’- ~—S«éd@O'Z”=«“<“‘sKG'Z~«SC*«C*«*«O'E 





ee 


MOLINO 







y 


MOINGQ MOU; MOWINO rOUU z 


MO|JUI 
G9 IPPON E9IPPOW AS ISPOW ES IPPOW GPISPOW PPYISPOW OE lP=POW LE |epOW ZIaPOW eZIePOW | [ePOW 














|0}U09 
@WOOU! $JUBIeg 














Les 


MOU] 





nO iW 





}dladay WS Pooysogubian Jo s}9ajJ3 Jo Sayew}Sy UOIssaibay INSIbO7 pajood :} aqel 


Neighborhood Social Influence and Welfare Receipt © 1345 


may have characteristics associated with a higher risk of SA receipt even though this 
did not lead to SA receipt during the previous year. Most obviously, they may be at a 
higher risk of becoming poor. This is confirmed in models 3a and 3b: when control 
variables for income/need are added, coefficients for NSA are further reduced. 

‘The socio-demographic controls are included in models 4a and 4b, and in the 
case of outflow (4b) I also control for the lagged ratio SA/income. The LnOR 
shrink further, indicating that people with higher propensity to claim or receive 
SA given income tend to sort into high receipt neighborhoods." 

Models 5a and 5b control for parental income, information on which is avail- 
able only for a subset of the population. The estimated effect on inflow is slightly 
higher than in Model 4a, but tests (not shown here) demonstrate that this is 
due to the change in population rather than to the control for parents’ income. 
The effect of parents’ income is itself in the expected direction (see Table A2 in 
Appendix). In models 6a and 6b, dummies for year by welfare agency district 
are added, and the LnOR increase. 

We can summarize the results given in Table 1 in two general conclusions. First, 
the estimated endogenous effect is, as expected, stronger for SA receipt inflow than 
for outflow.'*Second, these effects are weaker when we take account of the sorting 
of people with different characteristics into different neighborhoods. 

To assess the substantive importance of the estimated endogenous effects in 
models 6a and 6b, Table 2 (inflow) and Table 3 (outflow) show the observed aver- 
age inflow/outflow and the estimated inflow/outflow per income/need category 
after a standard deviation’s change in NSA. 

We can see in Table 2 that, naturally, the change in percentage units is largest 
for those with lowest income/need, while the relative change is roughly similar at 
different income/need levels. Taking the departure point in the average inflow in 
each income/need category, one standard deviation’s increase in NSA is related to 
an overall increase in inflow of 29 percent (17,239/13,388 = 1.29). For outflow, 
Table 3 shows that the estimated change resulting from a standard deviation’s 
change in NSA is very modest at all income levels, corresponding to a net decrease 
in outflow (i.e., an increase in the risk of remaining in SA receipt) of 3 percent 
17.653 — 17,178}/17;653 = .03). 

We now turn to the models with household fixed effects. ‘To repeat, these 
analyses include only those households that had at least one change in the 
dependent variable during the period under study. The results are presented in 
Table 4 (full results in Table A3 in Appendix). In models 7a and 7b, all bias due 
to time-invariant differences between households is eliminated by the fixed- 
effects specification. I also add annual dummies and control for income/need, 
to control for time-varying factors that can change a household's propensity to 
receive SA and that are likely to occur when this household's lagged NSA is high. 
The results suggest that the LnOR of inflow increase by .019 per unit increase 
in NSA, and that the LnOR of remaining in SA increase by .008. The expected 
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increase in inflow following a standard deviation’s change in NSA, calculated 
(as in Tables 2 and 3) from the departure point of the average inflow in the 


population (.87 percent), is 15 percent. 
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on inflow of 22 percent per standard deviation change in NSA, departing from the 


average probability of SA receipt. This indicates that those without personal ex- 
perience of SA receipt are more influenced by NSA than those who have received 


have no previous SA receipt, but some SA receipt at this observation point or later. 
SA previously, a result consistent with the smaller effect found on outflow above.» 


The LnOR for the NSA variable increases to .027 which corresponds to a net effect 
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Table 4: Conditional Logistic Regression Estimates of Effects of Neighborhood 
SA Receipt 


Model 7a Model 7b Model 8 












Control for income/ need p __Yes 
Control for lagged dependency ratio 
Pieee iow es 
Observations (households*years) 40,832 23,786 aA 
Notes: Estimates are log-odds ratios. Z statistics in parentheses: **p<.05 **p<.01 
Population is couples in Stockholm County 1992-1999 who experienced both receipt and 
non-receipt. 
Household SA receipt is coded 1 for both inflow and outflow (i.e., the outflow analysis 
concerns the risk of remaining in SA receipt). 





Conclusions and Discussion 


In a neighborhood in which welfare receipt is common, the norms against wel- 
fare are likely to be weaker, and information about welfare benefits more readily 
available, than in a neighborhood where few receive such benefits. Norms and 
information about welfare are, in turn, likely to affect the welfare choices of other 
households, suggesting that there is a self-reinforcing component in welfare receipt 
dynamics. In this article, I have investigated whether the propensity to claim 
welfare is affected by welfare receipt in the neighborhood by studying couples’ 
entries to and exits from welfare in Stockholm during the 1990s. 

This study makes concrete use of a sociological perspective on economic choices 
in that it operationalizes a social context and analyses how social influence within 
it systematically affects a choice concerning the household’s economic welfare. ‘The 
analysis of longitudinal data for the entire population of couple households (ages 
18 to 64) who lived in Stockholm County during the 1990s overcomes traditional 
problems in the identification of social influence effects. I estimate two types of 
models: pooled models with control variables, including fixed effects for welfare 
agency district and year, and models with household fixed effects. The first analysis 
exploits the variation between households and within households over time, while 
the second analysis is restricted to the variations within households who have some 
variation in the dependent variable, i.e., households that had years both with and 
without welfare receipt during the 1990s. 

Both analyses suggest that neighborhood SA receipt has a substantial impact on 
SA inflow. In the pooled model with the maximum number of controls (including 
income) a standard deviation’s increase in neighborhood SA is associated with an 
increase in inflow of 29 percent (estimated from the average level of inflow). The 
corresponding estimated decrease in outflow is very small, only 3 percent. ‘The fact 
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that households are more affected by others in their initial choice to claim than in 
continuing to claim SA is anticipated. Those already receiving SA obviously have 
information about it, and the transaction and stigma costs are likely to be lower. 

‘The fixed-effects analysis suggests a corresponding increase in inflow of 15 
percent, and a negligible effect on outflow, but it is estimated only for the sub- 
population of households that have years with SA as well as years without SA, 
comprising only 4 percent of the population in the pooled model. Households 
in this group have more personal experience of SA receipt, which can make them 
less prone to influence by others. A fixed-effects analysis of only those who had 
no previous experience with SA suggests that this may be the case, because the 
estimated increase in inflow at one standard deviation’s change in neighborhood 
SA is clearly stronger at 22 percent. 

How should these different estimates be judged? The first fixed-effects estimate 
(15 percent) can be considered a lower bound on the overall effect. It is a valid 
estimate of the effect among those who have experienced both SA receipt and 
non-receipt, but it is likely to understate the effect in the rest of the population. 
My suggestion is that the pooled estimate, reflecting an effect of 29 percent, ora 
figure slightly below this, is likely to be the more valid estimate of the true endog- 
enous neighborhood effect, because it is hard to think of factors that can bias this 
relation more than marginally. 

The data used here come from an entire population of couple households in 
Stockholm during the 1990s. How can these results be generalized beyond this 
particular family type in this particular area, to other family types, other areas 
and even other countries with different welfare systems? The obvious answer is 
that they cannot, at least not in the statistical sense. Of course, it is seldom pos- 
sible to generalize studies beyond specific countries or years of investigation, and 
problems such as non-response often limit this possibility even further. The value 
of individual studies need not, however, lie in their broad generalizability, but in 
their contribution to a more general picture that can be achieved through the 
accumulation of high quality research from multiple sources. 

What speaks in favor of the existence of social influence effects on welfare receipt 
in other populations is that the mechanisms supposed to mediate the endogenous 
effect—information, actualization and stigma reduction—are likely to be universal. 
Naturally, the exact magnitude of the effect can vary across populations, but it is 
hard to say whether Stockholm, couple households or the 1990s would be excep- 
tional in this regard.'° The general stigma of welfare may differ between countries, 
but it is not obvious how this would affect the endogenous effects. On the one hand, 
if stigma is low, the threshold to start claiming in response to social influence should 
be lower. On the other hand, low stigma should mean less sensitivity to whether oth- 
ers claim SA or not, so the net effect on the importance of social influence is unclear. 

Single households, especially with children, are at high risk of poverty and 


thus of welfare receipt in Sweden as in most other Western countries, yet it is 
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unclear whether poor single households would be more or less susceptible to 
neighborhood social influence than poor couple households. It is possible that 
welfare receipt is less stigmatized among single households, and though this would 
probably lead to higher baseline take-up, it is again not evident how the level of 
stigma affects the importance of social influence. 

Immigrant households are strongly over-represented among the SA recipients 
(i.e., among those under risk for outflow). The estimated endogenous effects were 
somewhat smaller for immigrants than for native Swedes. It is possible that im- 
migrants’ have fewer neighborhood ties, and that their networks would be better 
operationalized as consisting of those from the same country of origin, or from 
the same language group, in a larger area (cf., Bertrand et al. 2000). For welfare 
use as for other outcomes, research that compares the effects of social influence 
for different populations and in different social contexts that individuals belong 
to would be an important step forward in the study of social influence effects. 

While more research on social influence effects is certainly needed before 
drawing conclusions for policy, the results reported here are nevertheless of 
policy interest. They suggest that a neglect of endogenous effects will lead policy 
makers to underestimate changes in welfare receipt following, for example, a 
labor market shrinkage. If the number of claimants starts to rise, we can expect 
an even sharper increase in SA due to a multiplier effect, so a recession will 
be more expensive for the government than otherwise expected. Because the 
estimates given here are net of economic resources, neighborhood endogenous 
effects mediated by eligibility for SA and size of entitlement are not addressed. 
In high-SA neighborhoods, networks may provide less information about jobs, 
and norms against unemployment may be weaker, increasing the likelihood of 
poverty and eligibility for SA. To the degree that this is the case, the potential 


multiplier effects are even stronger. 


Notes 


1. The rate of SA receipt is highly sensitive to the number of non-working people 
who are not covered by work-based unemployment benefits and social insurance. 
Immigrants and young people are especially vulnerable in this respect, and the fact 
that the labor market shrinkage coincided with a high inflow of refugee immigrants 
during the 1990s contributed strongly to the aggregate increase in SA receipt. 

2. Acouple needs only to have common children to be identified as a household; the 
children need not be part of the household (they may have moved out, for example). 
Married, non-cohabiting couples are not included. 

3. fit is the case that neighborhood SA negatively affects household stability, and leads 


to more single households —who have a higher risk of SA receipt—the social influence 
effect is underestimated when singles are excluded from the analysis. | thank a Social 


Forces reviewer for pointing this out. 


4. It is possible that more welfare-prone households sort into generous welfare-agency 
districts (cf., the discussion about welfare magnets in the United States), which in my 
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13. 


14. 


* 


data comprise several neighborhoods (SAMS areas). This is controlled for by use of 
welfare-agency*year dummies. 


One factor about which I have no information, and which can potentially give rise to 
some bias, is household wealth. Still, household wealth should largely be correlated with 
several controls used such as income, age, education, parent's income and the dummies 
for welfare-agency district, so the remaining bias due to wealth is likely to be minor. 


The variation in lagged neighborhood SA for a household over time is mainly due 
to variation within a neighborhood over time, but if a household moves it is also 
due to variation across neighborhoods. A bias can thus result if households’ moves 
between neighborhoods are caused by some change that is related to households’ 
risk of receiving SA given their economic situation (and given their lagged SA receipt). 
However, when excluding those who have moved between neighborhoods at any 
occasion in the data, the effect of neighborhood SA is clearly stronger (i.e, the bias is 
negative). As suggested by Hanushek et al. (2003), who found a similar pattern for 
peer effects in schools, this is consistent with social influence being stronger for those 
who live in a neighborhood for a longer period of time. 


The bias caused by the incidental parameter problem decreases rapidly with the 
number of observations within units (Greene 2004). The dummies for welfare-agency 
district by year in the pooled models will not generate biased estimates, because 
within each welfare-agency district and year, the number of observations (households) 
is very large. 


My data are left-censored in that I do not know whether households received SA 
before 1990. I exclude households observed for only two years from this analysis. 


Because other relevant characteristics of neighborhoods are highly collinear with 
neighborhood SA receipt (for example, neighborhood unemployment has a partial 
correlation of more than .90 with neighborhood SA receipt in models 6a and 6b), it 
is difficult to separate their effects empirically. It is also rather pointless, because the 
results would rely strongly on a few atypical neighborhoods. 


SA entitlement criteria changed in 1998 and more components became subject 
to individual assessment. The previous eligibility limit (expressed in so-called base 
amounts, which are amounts adjusted by the government according to the consumer 
price index each year to account for inflation) is probably the best estimate of the 
eligibility limits used in practice, so I use this limit throughout. It is based on the sum 
of the household members’ so-called consumption weights, i.e., single adult: 1.16; 


partnered adult: .96; child 0-3: .56; child 4-10: .66; child 11-17: .76; child 18+: .96. 


I would be able to better approximate eligibility with monthly data. However, the 
problem would remain because there is no such thing as objective eligibility. Eligibility 


is defined by the welfare agency, and it has a great deal of discretion within the limits 
of the general regulations. 


Y-standardized coefficients cannot be estimated in the conditional logit models due 
to the lack of an estimate for the standard deviation in the logit. 


More elaborate controls for income/ need, country of origin, time in Sweden, number 


of children, unemployment status and the second lag of SA receipt only marginally 
influence the effects of interest. : 


Coefficients must be compared with caution across groups in logistic regression 


(Allison 1999; Mood 2009). However, the NSA coefficients for outflow and inflow 
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differ by an order of magnitude, and other coefficients in the model generally do not, 
which suggests that the difference is genuine. 


15. I have performed several tests to check the robustness of results on different model 
specifications. Notably, I have in both the pooled and the fixed effect models 1. 
excluded observations with very high (more than 40%) and very low (less than 2%) 
neighborhood SA receipt; 2. excluded the years 1998 and 1999 to assess whether 
changes in the SA rules in 1998 affected the estimates; 3. included only households 
with income/need below 1.5; and 4. estimated the models separately for immigrant 
and native households. In the pooled models, the estimated effects on both inflow 
and outflow are somewhat lower for households in which both are immigrants, and 
slightly lower when excluding neighborhoods with very low or very high SA rates. 
Otherwise, the estimated effects are very similar to the original ones. 


16. With regard to generalizability to the rest of Sweden, it is important to recognize 
that Stockholm County is quite diverse and does not consist of the Stockholm 
metropolitan area alone. I have done separate analyses for the different parishes 
and municipalities in Stockholm County, some of which are small villages or even 
countryside farming communities at the outskirts of the county, while others are 
suburbs to the city of Stockholm, and yet others are part of the city center. Of course, 
the strength of the neighborhood social influence effect estimates differed across 
municipalities and parishes, although not wildly and not according to any easily 
discernable pattern. 
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Table A1: Descriptive Statistics 


Variable Received SA at t-1 

No Yes All 
Percent SA recipients 9 70.9 3.6 
Neighborhood SA Receipt at t-1 
Mean 6.1 18.8 6.6 
Standard deviation 6.8 12.2 75 
Income/Need 
Mean 2 8 2.0 
Median 1.8 8 1.8 
Percent in Income/Need Categories 
0-.5 57 22.4 Let 
5-1 44.9 6.3 
1-1.5 22.9 27.4 23.1 
1.5-2 31:5 4.7 30.5 
2-2.5 20.3 19.6 
2.5-3 9.7 6 9.3 
3- 9.9 9.5 
Highest Education in Household (%) 
Compulsory, < 9 years 3.4 10.6 Sar 
Compulsory, 9 years or more 4.5 15.0 4.9 
Lower vocational (comp + 1-2 year) 23.8 2ie2 23.9 
Upper secondary (comp + 3 years) iif 17.6 17.2 
Lower tertiary (1-2 years post-secondary) 20.4 13.0 20.1 
University (at least 3 years post-secondary) 30.7 15.6 30.2 
Household Immigrant Status (%) 
Both Swedish 85.8 26.2 83.6 
One immigrant Tal 13:5 7.9 
Both immigrants 6.5 60.5 8.6 
Households with Child(ren) (%) 73.0 90.7 Toul 
Student Households (%) 5.2 24.9 6.0 
Mean Age (Household Mean) 45.2 Bim 44.9 
Lagged Dependency Ratio (SA/ Income*100) Zio 


N 1,534,856 60,987 __ 1,595,843 
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Table A3: Conditional Logistic Regression Estimates of Effects of Neighborhood 
SA Receipt on Inflow to and Outflow from SA Receipt 


Model 7a Model 7b Model 8 
a Tie | inflows = Outlowae aera 
NSA at t-1 .019 .008 027 
(6.04)** (1.72) (1.67) 
Income/Need Inflow 
< 1 (reference) 
1-1.5 -1.044 -.538 
(25.89)** 3:52)" 
1.5-2 -2.003 -1.238 
(30.54)** (5.09)** 
2-2.5 -2.505 -1.630 
(19:3 }a (3.69)** 
2.5-3 -2.420 -.359 
(9.73)™ (.40) 
= 3 -2.356 -3.185 
(On (3.98)** 
Income/Need Outflow 
(reference < .5) 
5-1 -.153 
(1.45) 
1-1.5 -1.500 
(1 302)" 
1.5-2 -2.248 
Mba5- 
> 2 -1.929 
(6.47)™* 
Dependency Ratio 073 
(33.41)™* 
Year Dummies Yes Yes Yes 
First-Time Receipt “se ee ee ee eee 
40.839 23,786 1? tae 


Gun Cultures or Honor Cultures? Explaining Regional and 
Race Differences in Weapon Carrying 


Richard B. Felson, Pennsylvania State University 
Paul-Philippe Pare, University of Western Ontario 


We use the National Violence against Women (and Men) Survey to examine the ef- 
fects of region and race on the tendency to carry weapons for protection. We find that 
Southern and Western whites are much more likely than Northern whites to carry 
guns for self-protection, controlling for their risk of victimization. The difference 
between Southern and Northern whites is particularly strong for women. We do not 
find much evidence for regional/race differences in carrying knives or mace. These 
findings provide support for the idea that regional differences in weapon carrying 
reflect a gun culture rather than an honor culture. We see more evidence of an honor 
culture among blacks: they are more likely than whites to carry knives as well as guns, 
controlling for their risk of victimization. 


EN ee ae 


Research shows that whites from Southern and Western states have higher ho- 
micide rates than whites from the Northern states (e.g., Nelsen, Corzine and 
Huff-Corzine 1994; O’Carroll and Mercy 1989; Parker and Pruitt 2000). The 
difference between the Northern and Southern homicide rates is often attributed 
to a Southern subculture of violence or honor culture (e.g., Gastil 1971; Hackney 
1969; Nisbett and Cohen 1996; Reed 1971).! Research also shows that, regardless 
of region, blacks have higher rates of homicide and other violent crime than whites 
(e.g., Bureau of Justice Statistics 2005; Hawkins et al. 1998; see also Sampson 
and Lauritsen 1994). This pattern is also sometimes attributed to a subculture 
of violence (Wolfgang and Feracuti 1967). In this research we examine the cul- 
tural argument by analyzing regional and race differences in carrying weapons for 
protection. We attempt to determine whether these differences reflect an honor 
culture, a gun culture or an adaptation to current threat. 


Region and Honor Cultures 


According to the Southern subculture of violence thesis, Southern whites have 
an honor culture where violent retaliation is normative behavior when there is 
adequate provocation. In honor cultures, men are expected to defend themselves 
when threatened and to respond to insults with aggression (e.g., Greenberg 1997; 
Wyatt-Brown 1986). Nisbett and Cohen (1996) attribute the development of 
an honor culture in the South to its settlement by Scotch and Irish immigrants 
whose economy was based on herding (see also, Nisbett 1993; Nisbett et al. 1995). 
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They claim that herding cultures around the world tend to be more approving of 
defensive forms of violence because of the vulnerability of herds to theft (see also 
Campbell 1965). Other scholars have attributed high white Southern homicide 
rates to the history of slavery and racial domination in the South (Ayers 1991; 
Cash 1941; Vandal 2000). According to this perspective, toleration of the use of 
violence to control black slaves may have led to its use in other circumstances. 

The evidence is clear that Southerners have more favorable attitudes toward de- 
fensive or retaliatory forms of violence than Northerners (Cohen and Nisbett 1994; 
Dixon and Lizotte 1987; Ellison 1991; Hayes and Lee 2005). Experimental research 
on college students has shown that Southern men react more strongly to insults 
than Northern men on behavioral, attitudinal and even biological measures (Cohen, 
Nisbett, Bowdle and Schwarz 1996). For example, Southerners are more likely to 
behave aggressively in a game of “chicken,” and when provoked, they are more upset 
(as indicated by a rise in cortisol levels). In addition, they are more prepared for ag- 
gression (as indicated by a rise in testosterone levels) and display more dominance 
behavior (they give firmer handshakes).* Cohen (1996) has shown that Southern 
laws are more permissive toward the use of violence for self-defense and defense of 
home and property. Finally, Rice and Goldman (1994) found that homicides in the - 
South are more likely to result from arguments than homicides in the North. 

A slightly different conceptualization of the honor subculture emphasizes the 
tendency for Southerners to use “self-help” to redress their grievances (Black 1983; 
Chilton 2004). A Southern tradition developed in which disputes were handled 
privately, without the involvement of formal authorities (Reed 1971). Private 
retribution was prominent in the South because law enforcement was either in- 
adequate or corrupt (Cohen et al. 1996; Courtwright 1996; Cusson 1999; Vandal 
2000). The argument that a self-help tradition developed on the frontier could 
explain the higher rates of homicide in the West as well (e.g., Courtwright 1996; 
but see McGrath 1984). High Western rates have not received as much scholarly 
attention (Kowalski and Petee 1991; Limerick 1987). 


Race and Honor Cultures 


Some discussions of violence in black communities also imply an honor subcul- 
ture. For example, Wolfgang and Feracuti (1967:153) argue that members of 
some disadvantaged groups are more likely to value a “quick resort to physical 
combat as a measure of daring, courage, or defense of status.” A slight deroga- 
tory remark is likely to be seen as an occasion for violent retaliation. While race 
differences in violence can ultimately be attributed to racism and the historic 
oppression of black (e.g., Hawkins 1995; McCord 1997; Sampson and Wilson 
1995), honor cultures may be a more proximate causal factor. High rates of yio- 
lence in the black community have also been attributed to a self-help tradition 
that developed due to lack of access to the legal system (Black 1983). For example, 
Messner et al. (2005) argue that the code of honor among Southern whites dif- 
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fused to Southern blacks and that the black experience with lynching led them to 
distrust the legal system and rely more upon self-help (see also Butterfield 1995). 

The evidence is unclear as to whether blacks and whites differ in their at- 
titudes toward violence (e.g., Blumenthal 1972; Cao et al. 1997; Erlanger 1974; 
Markowitz and Felson 1998; Rossi et al. 1974). On the other hand, race differ- 
ences in violence have been observed at colleges and prisons where blacks and 
whites are living in similar circumstances (e.g., Harer and Steffensmeier 1996; 
Volkwein et al. 1995). These results suggest that there may be cultural differences. 


The Role of Guns 


There is a fundamental problem with the honor culture thesis as applied to re- 
gional differences: Southerners do not have higher rates of violence than Northerners. 
While Southerners have higher rates of homicide and aggravated assault, they do 
not have higher rates of simple assault (McCall et al. 1992). Simple assaults are 
much more frequent than aggravated assault and homicide so the overall rates of 
violence are not higher in the South. Erlanger (1974) reported national data show- 
ing that Southerners have /ower rates of fist fighting in conflict situations than 
Northerners. Finally, analyses of the National Crime Victimization Survey (Felson 
and Pare 2010) show that Southern and Western whites are much more likely than 
Northern whites to be victims of gun assaults but they do not have higher rates 
of knife and unarmed assault victimization. Only 10 percent of assaults in the 
data set involved guns (see also Baumer et al. 2003; Bureau of Justice Statistics 
2005). This study found a similar regional pattern for homicide: Southern whites 
are more likely than Northern whites to engage in gun homicides, but there is not 
much of a regional difference in homicides without guns 

An honor culture should lead to all types of assault not just the atypical gun 
fight. It appears that research in this area is attempting to explain a non-existent 
pattern. This evidence is, therefore, a serious challenge to the honor culture expla- 
nation of the regional differences that do exist. It suggests that regional differences 
among whites may be due to the prevalence of guns rather than honor cultures. 

Research generally supports the idea that the carrying of guns for protection var- 
ies across region. For example, Kleck and Gertz (1995) found that Southerners and 
Westerners were much more likely than Northerners to carry guns for protection (see 
also Bankston et al. 1990; McAnney 1993). They did not examine whether regional 
effects differed by race; the theoretical argument for a regional honor culture applies 
mainly to whites. Nor did they examine regional differences in carrying other weap- 
ons. Finally, studies of youth do not show consistent race or regional differences in 
gun carrying (Cook and Ludwig 2004; Lizotte et al. 2000; Sheley and Wright 1995). 

Research has also examined the relationship between region and owning weapons 
for protection. Ownership is a much more limited response than carrying a gun for 
protection. Most of these guns are kept at home and reflect the desire to protect 
self and family from an intruder. At any rate, these studies generally show regional 
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differences consistent with the gun hypothesis. For example, Cohen and Nisbett 
(1994) found that rural Southern white men were twice as likely to report having 
guns for purpose of protection as rural Midwestern white men (see also McAnney 
1993; Smith and Uchida 1988; Weil and Hemenway 1992; Young 1986). 
Comparisons of violent crime rates for blacks and whites also suggest that fire- 
arms play an important role in understanding race differences. Blacks have much 
higher homicide and aggravated assault rates than whites but only somewhat 
higher rates of simple assault (Bureau of Justice Statistics 2005). Black offend- 
ers are also more likely than white offenders to use guns during violent offenses 
(Harlow 2001). In addition, studies of the general population show that blacks are 
also more likely to carry guns than whites (Kleck and Gertz 1995). Finally, results 
from studies of race differences in victimization rates show the importance of 
weapons. While blacks have higher homicide and aggravated assault victimization 
rates, they have slightly Jower rates of simple assault victimization (e.g., Rennison 
2001). Recent analyses of the NCVS show that blacks have much higher rates of 
gun assault victimization, similar rates of assaults with knives and other weapons, 
_and dower risks of assault victimization by unarmed offenders (Felson and Pare 
2010). These results are surprising, given the relatively high rates of criminal _ 
victimization among blacks. 


Culture vs. Adversary Effects 


An important issue in the literature is whether a group differs from other groups 
because of a cultural tradition or because the group is responding to different cir- 
cumstances (Ellison 1991). The concern with honor and the practice of carrying 
guns for protection may be cultural traditions passed down for generations. Honor 
or gun cultures might persist in the South and West even after the disappearance of 
frontiers, herding economies or whatever circumstances led to their development. A 

“cultural lag” could be involved, whereby a behavior continues to be prevalent when 
circumstances have changed and the behavior is no longer adaptive (Ogburn 1957). 
A cultural tradition of using or carrying guns for protection might continue today, 
long after the disappearance of the frontier and the honor code. 

Alternatively, people may carry weapons because of their current circumstances: 
they believe the environment is dangerous, and they are concerned for their safety. 
Prior research has shown that weapon carrying is associated with fear of crime, 
prior violent victimization and county robbery rates (Cook and Ludwig 2004; 
McAnney 1993; Williams, Singh and Singh 1994; Ziegenhagen and Brosnan 
1990; but see Bankston et al. 1990). The current threat might be described as an 
adversary effect. The influence occurs because of spatial proximity to potentially 
dangerous adversaries; membership in a subculture or long-term residence is un- 
necessary. As a result of adversary effects, violent crime (and weapon carrying) may 
be more contagious than other crime. When potential adversaries have weapons, 
actors may acquire guns in response, leading to an “arms race.” 
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Anderson's (1999) description of the “code of the streets” in black neighbor- 
hoods implies an adversary effect. He argues that blacks in inner city communities 
often adopt an aggressive posture to avoid victimization. Even youth who are 
not otherwise prone to use violence—the “decent kids” —follow the code of the 
streets. Youth living in these communities are more likely to carry firearms or 
other weapons for protection (Nielsen, Martinez and Rosenfeld 2005). From this 
perspective, their behavior is primarily adaptive and tactical although it may reflect 
the indirect effects of culture. 


The Current Study 


In this research we examine cultural effects by analyzing regional and race dif- 
ferences in carrying weapons for protection. We use data on carrying of firearms, 
knives and mace from the National Violence Against Women (and Men) Survey. 
Our assumption is that those who carry weapons for protection—whether on 
their person or in their automobile—reveal a willingness to use self-help and to 
respond to provocations with violence. They are prepared to handle the situation 
themselves rather than rely upon the police. In fact, Gastil (1971) identifies the 
carrying of weapons as one manifestation of the Southern subculture of violence. 

Our measure of cultural differences is based on reports of a behavior with a 
specific motivation, as opposed to the more typical method of relying on attitude 
measures. We ask whether some groups are more likely than other groups to carry 
particular tools for a particular purpose. We think that a behavioral measure is 
a better measure of the propensity to defend oneself if attacked than an attitude 
measure. The measurement follows Swidler’s (1986) well-known conceptualiza- 
tion of culture as a “tool kit.” In our case, we consider tools in a literal sense: 
weapons are the tools of violence. 

To isolate cultural effects, we include extensive controls for the current threat 
of violence to the respondent. If regional differences in weapon carrying disappear 
when the threat of violence is controlled, it will suggest that regional effects reflect 
current circumstances—an adversary effect—rather than a cultural tradition. If they 
remain with these controls, it suggests a cultural effect. 

Unfortunately, the idea of a subculture exaggerates the extent to which mem- 
bers of a group share a particular characteristic that differentiates them from 
outsiders (Markowitz and Felson 1998). We use the term, as others do, to suggest 
that members of one group are more likely than members of another group to 
have certain attitudes or behave in particular ways, even when their circumstances 
are similar. For example, a cultural argument is implied if white Southerners are 
more likely to carry weapons than white Northerners, controlling for threat, even 
though most white Southerners do not carry weapons. 

Based on the honor culture thesis we predict a statistical interaction between 
region and race for all three types of weapons. Specifically, we hypothesize that 
Southern and Western whites will be more likely to carry guns, knives and mace 
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than Northern whites, controlling for threat. They carry weapons because they 
are more highly motivated to respond to provocations with violence. On the 
other hand, if we only find regional differences in gun carrying, controlling for 
threat, it will imply a gun culture explanation. It will suggest that, because of the 
prevalence of guns, Southern and Western whites are more likely to carry firearms. 
It will challenge the idea that Southern and Western whites are more prepared 
to respond to provocations than Northern whites. Their current behavior reflects 
the prevalence of a particular tool, not a difference in motivation. Perhaps honor 
cultures increased the prevalence of guns historically, but they no longer operate. 

We make similar arguments for race differences. The honor culture thesis im- 
plies that blacks are more likely than whites to carry weapons for protection, 
controlling for threat. If race effects are only observed for gun carrying, then a 
gun culture is implicated. If the effects disappear when threat is controlled, it 
suggests that race differences in weapon carrying are an adaptation to current 
circumstances, i.e., an adversary effect. 

We also examine statistical interactions involving gender, race and region. An 

. honor culture implies that we should find stronger regional and race effects on 

men because violence is usually associated with male honor. Felson and Pare _ 
(2010), however, found that regional and race differences in gun violence were just 
as strong for female offenders as for male offenders. They suggested that Southern 
white female offenders use guns because they must contend with adversaries who 
are mostly men (see also, Nisbett and Cohen 1996). 

Our examination of different outcomes involves a form of theory testing 
that experimental psychologists call “control construct design.”(Shadish, Cook 
and Campbell 2002) They compare the effect of a treatment on the predicted 
outcome to its effect on related outcomes that should not be affected. A simi- 
lar method—“discriminant prediction” —has been used in Criminology (Felson 
2002). Theories are tested by determining whether variables predict everything 
they should and not what they shouldn't. 


Methods 


The NVAW survey includes data collected in 1995-1996 from a nationally repre- 
sentative sample of 8,000 women and 8,000 men, ages18 and older (see Tjaden 
and Thoennes 2000). A computer-assisted telephone interview was conducted in 
which the respondents were asked about their weapon carrying and their experi- 
ences with violence. We omitted cases with missing data on the weapon carrying 
variable. We handle missing data on explanatory variables by including a missing 
data dummy variable. For example, ifa respondent did not reveal his or her annual 
income, this information is coded 1 on the “unknown income” dummy variable 
instead of the other income groups. Our final sample includes 7,511 men in 1,762 
counties and 7,587 women in 1,734 counties. 
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Measurement of Weapon Carrying 


Our measure of weapon carrying was based on two questions. Respondents were 
asked: “Do you ever carry something with you to defend yourself or to alert other 
people?” Those who gave an affirmative answer were asked: “What do you carry?” 
We use the responses to these questions to construct a five-category variable: (1. 
firearm; (2. knife or sharp object; (3. mace or other spray; (4. any other weapon; 
(5. no weapon (the reference category). The category of other weapons includes a 
variety of weapons and devices that were rarely used for protection (e.g., bats, keys, 
cellular phones). Since the meaning of this category is ambiguous, we do not pres- 
ent these coefficients. Note that the survey allowed respondents to report carrying 
more than one type of weapon. When respondents carried more than one weapon, 
they were coded as carrying the more lethal weapon. However, those respondents 
who carried mace and a weapon other than a firearm or a knife (n = 39) were 
coded as carrying mace. We did not analyze each weapon independently, because 
we wanted our comparison category to be people who did not carry any weapon. 

We assume that most of the guns respondents carry are handguns, although 
some presumably carry long guns in their vehicles. Note also that some citizens 
who carry guns or other weapons for protection are criminal offenders. Most 
people who carry guns do so illegally, according to data comparing the number of 
people who carry guns to the number of people who have permits (Kleck 1997); 
however, most are probably not committing other offenses. Less is known about 
the characteristics of people who carry knives and mace for protection. We suspect 
that those who carry mace—a defensive weapon that does not produce injury—are 
unlikely to be offenders. 


Measurement of Independent Variables 


The main independent variables are the respondent's race/ethnicity, region of 
residence and gender. Race/ethnicity is coded as either White, Hispanic, Other 
race (mixed race, Asian, Native American, Pacific Islander or unknown), or Black 
(the reference category). Region is coded as South, West or North (the reference 
category). A list of the states included in each region is presented in the Appendix. 

Note that we do not have information about how long the respondents have lived 
in the region. Because some people may have moved across regions, the regional ef- 
fects we observe may be conservative estimates. On the other hand, if these relation- 
ships reflect adversary effects rather than socialization, they should not depend on 
length of residence. The evidence regarding whether current residence or residence 
when growing up predicts gun ownership is somewhat inconsistent. O’Connor and 
Lizotte (1978) found that current region of residence, but not residence at age 16, 
predicted gun ownership. Dixon and Lizotte (1987), on the other hand, found that 
both current region and region at age 16 predicted gun ownership. Finally, Young 
(1986) found no effect of current region of residence on whether non-hunting men 
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owned guns but that region at age 16 affected whether non-hunting women owned 
guns. He assumed that non-hunters who owned guns used them for protection. 
Note that these studies focus on gun ownership not carrying guns for protection. 

Current threat is measured by the respondents’ concern for their personal 
safety, their history of personal victimization and characteristics of the county 
in which they reside, including the violent crime rate, percent black, percent in 
poverty, and whether the county is in an urban area. We use multiple measures, 
and subjective as well as objective measures, so as to capture as completely as 
possible the threat experienced by the respondent. Respondents may feel more 
threatened if they live in urban areas with high percentages of blacks and poor 
people (Liska, Lawrence and Sanchirico 1982). 

Concern for safety is based on the question “How concerned are you about 
your own personal safety?” Respondents are coded 1 for “not really concerned;” 
2 for “just a little concerned;” 3 for “somewhat concerned;” and 4 for “very con- 
cerned.” Note that, while concern for safety may encourage weapon carrying, 
weapon carrying may alleviate concern for safety. It is unlikely, however, that any 

reciprocal effect would affect the coefficients reflecting regional and race differ- 
ences, since concern for safety is not related to region and only weakly related to - 
race (analyses not presented). 

The number of prior violent victimizations is based on the total number of 
physical assaults, sexual assaults, threats and stalking incidents reported by the re- 
spondent during adulthood. Respondents were asked, for example, whether anyone 
had pushed, slapped or hit them, threatened or attacked them with a weapon, forced 
them to have sex, or followed or spied on them (see Tjaden and Thoennes 2000). 
Multiple victimizations by the same offender were counted as a single victimization. 

To measure the county-level control variables, we linked respondents to their 
county of residence based on the FIPS code. We used data from the 1990 U.S. 
Census to determine the percentage of residents who are black, the percentage of 
households living in poverty, and the percentage of respondents who live in rural 
or urban areas. The percentage of households living in poverty is based on the 
proportion of families with less than $20,000 annual income (in 1995 dollars). 
Counties were coded as “rural” when 80 percent or more of their population 
live in rural areas based on the 1990 census coding, as “urban” when 80 percent 
or more of their population live in urban areas (the reference category), and as 
“mixed” in the remaining cases. 

We used the Uniform Crime Reports for the years 1990-1995 to construct an 
index of criminal violence in their county. The index is based on the log of the 
total annual rate of homicide, rape, robbery and aggravated assault. Note that 
some scholars have questioned the validity of county-level UCR data (see Lott 
and Whitley 2003; Maltz and Targonski 2002, 2003 for discussions). For example, 
some police departments, especially in smaller counties, do not report their crime 
statistics every year. We address this problem by ignoring data from years in which 
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counties report no incidents of simple assault, based on the assumption that a 
county must have had at least one simple assault. We then computed an average 
based on the number of years reported. Also, some states, such as Mississippi 
and Vermont, have especially large gaps of UCR coverage. We replicated our 
analyses omitting the nine states with the greatest gaps in UCR coverage (see Lott 
and Whitley 2003). Results were similar to those we present. Finally, note that 
in the examination of the effects of county crime rate we control for individual 
differences in personal victimization and concern for safety. These variables may 
mediate the effect of the aggregate crime rate. 

Control variables at the individual level include the respondent’s age, annual 
income and level of education. The respondents’ annual income is in 1995 dollars, 
from all sources, before taxes. It is coded as either low income (less than $20,000, 
the reference category), medium income ($20,000 to $49,999), high income 
($50,000 or more), or unknown income. The level of education is treated as 
continuous, coded from 1 to 7, where 1 is “no schooling,” 2 is “1*-8" grade,” 3 is 

“some high school,” 4 is “high school graduate,” 5 is “some college,” 6 is “4 years 
college degree, and 7 is “postgraduate.” 


Analytic Strategy 


Because of the clustered nature of the data, we use a multilevel modeling strategy (see 
Raudenbush and Bryk 2002). The multilevel multinomial logistic regression models 
are estimated with HLM software. The multinomial logistic approach is useful to 
assess the likelihood of carrying of a specific weapon vs. no weapon, while taking 
into account the likelihood of carrying a different weapon. The individual-level 
variables are entered at level 1 and the county-level variables are entered at level 2. 
We include five multiplicative terms to test our predicted interactions. First, we 
include white x South and white x West terms to determine whether regional effects 
are stronger among whites than nonwhites. Nonwhites in this comparison include 
blacks, Hispanics, and people of other or unknown race. We also include multiplica- 
tive terms to test our predicted gender interactions: male x South; male x West; and 
male x white. The hypothesis is that region and race difference should be stronger 
among men since honor codes should affect men more than they affect women. 


Results 


Descriptive statistics for weapon carrying and the explanatory variables are presented 
in Table 1. The table shows that the favorite weapon for men is a firearm (12.4 per- 
cent) while the favorite weapon for women is mace (16.4 percent). Women are more 
likely than men to carry some weapon for protection (30.1 percent vs. 23.6 percent; 
p< .001) and to be concerned about their safety (2.7 vs. 2.4 percent; p < .001). 
The results from the multilevel multinomial logistic regressions for weapon car- 
rying are presented in Table 2. Equation I includes the threat variables and equation 
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Table 1: Descriptive Statistics 








—_Full% Ss Men% ~~ Women% __ 
Individual Variables N= 16008 ss N= 7 Site NEL 8k ee 
apon Carryin 
bes all je 78 12.4 a2 
Knife 3.9 4.9 2.9 
Mace 9.4 2.3 16.4 
Other weapon 5.8 3.9 76 
No weapon TA 76.4 69.9 
North 44.8 44.2 45.5 
South 34.4 33.4 ao 
West 20.7 22.4 19.0 
White 778 78.0 tit 
Black 8.5 8 9.2 
Hispanic 5.8 Sad 6.4 
Other/ Unknown race 78 8.9 6.7 
Low income 36.7 26.0 47.3 
Medium income 33.4 39.8 Zt 
High income 12.1 18.6 Sef 
Unknown income 17:2 15.6 19.8 
Age (mean) 43.2 42.4 44.0 
* Education (mean) 48 49 47 
Concern for safety (mean) 2.6 2.4 2% 
Victimization (mean) 8 a 8 
County Variables N = 2,208 N= 1,762 N = 1,734 
Violent crime index (Log 10, mean) 22 2.2 22 
% Black (mean) 8.7 8.4 8.9 
% Poor (mean) 39.7 39.6 39.8 
Urban 18.1 17.9 18.2 
Rural 17.9 18.6 17.2 
Mixed urban-rural 64.1 63.5 64.6 


II does not. The results from equation I reveal a significant Southern region x race 
interaction for gun carrying, but not for knife or mace carrying. White Southerners 
are more likely to carry guns for protection than white Northerners.* The lack of 
statistically significant main effects or interactions involving Southerners on knife 
or mace carrying is contrary to the honor culture thesis. The fact that the effects 
are similar when the threat variables are omitted from the equation (equation II) 
support the idea of a gun culture among Southern whites. 

We also observe main effects of Southern residence on carrying guns but not car- 
rying knives or mace. In analyses not presented we re-estimated the equations with- 
out the non-significant interaction terms. When these terms are omitted from the 
equation we find that Southerners are slightly more likely than Northerners to carry 
knives and mace (b = .28; t = 2.6 p = .01 and b = .20; t = 2.6; p = .01). These effects 
are much weaker than the relationship between Southern residence and gun carrying. 

We also observe a strong West x white interaction for gun carrying. Western 
whites are more likely than Northern whites to carry guns. We observe some 
evidence of interactions for knife and mace carrying but they are much smaller 
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Table 2: Multilevel Multinomial Logistic Regression for Weapon Carrying 


(Unstandardized coefficients, t-ratio in parentheses) 


Individual Level 
South 


West 

White 

Hispanic 

Men 

Medium income 
High income 
Age 

Education 
Concern for safety 
Victimization 
South x White 
West x White 
Male x White 
Male x South 
Male x West 


County Level 
Violent crime index 


% Black 

% Poor 

Rural 

Mixed urban-rural 


Intercept 






‘pise 05u7"p = 201 
Note: Variables for Other/unknown race and Unknown income were included in the 
equation but not presented. 
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108 .086 
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Mace 
| II 
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Table 3: Percentage Weapon Carrying by Gender, Race and Regions 


Firearm _ Knife Mace 


White Men 

Northern 7.16 4.20 2.28 
Southern 19.11 ul 2.64 
Western 13.81 4.74 2.69 
Black Men 

Northern 12.98 8.65 1.08 
Southern 17.02 Dit 2.43 
Western 4.05 8.11 235 
White Women 

Northern 86 1.89 16.47 
Southern 5.98 25 18.67 
Western 3.50 Deol 14.74 
Black Women 

Northern 4.38 9.56 17,53 
Southern 6.02 ee 19.30 
Western 2.04 12.24 12.24 


" effects and they are only statistically significant when the threat variables are omit- 
ted. In general, effects are similar when the threat variables are omitted from the - 
equation. We also observe main effects of Western residence on mace carrying: 
Westerners are less likely than Northerners to carry mace. On the other hand, 
Western residence is not significantly related to gun or knife carrying. 

Table 2 also shows main effects for race: whites are less likely than blacks to carry 
euns and knives. These effects are observed with and without controls for threat. 
Hispanics are less likely than blacks to carry all three weapons, although the effect for 
mace is weaker and not statistically significant when the threat variables are included. 

Most of the interaction terms involving gender were not statistically signifi- 
cant and all were in the opposite direction to the honor culture prediction. Two 
terms were statistically significant. First, the difference between Southerners and 
Northerners in gun carrying was stronger for women. Second, the race difference in 
knife carrying was stronger for women. These patterns can be seen in seen in Table 
3. The evidence is not consistent with the idea of an honor culture among men. 

We also tested for three-way interactions between gender x race x region. None 
were statistically significant (for gun carrying, b = -.294; t ratio = -.805; p = .420 
for white x South x male and b = -.903; t ratio = -1.63; p = .103 for white x West 
x male). It may be that we do not have the statistical power to detect a three-way 
interaction, given the relatively few women who carry guns. Note, however, that 
in an analysis of gender subsamples (not presented) the coefficients for South x 
white and West x white were much stronger for women than for men (b = 1.51 
vs. .476 and 2.20 vs. .674, respectively). . 

Table 3 shows the percentages of black and white men and women in differ- 
ent regions who carry firearms, knives and mace. They confirm that, for both 
men and women, regional differences are primarily observed in carrying firearms 
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among whites. One does not observe much regional variation in carrying knives 
or mace. As indicated above, the table also reveals stronger regional differences in 
gun carrying among women. While Southern white men are more than twice as 
likely to carry firearms as Northern white men, Southern white women are more 
than six times more likely to carry firearms than Northern white women. Western 
whites are somewhere in between. One also observes some regional variation 
among blacks in gun carrying, but the differences are not as strong as the regional 
variation for whites. 


Other Effects 


Most of the threat variables also affect weapon carrying (see equation 1 in Table 
2). Prior victimization and concern for safety have strong positive effects on the 
carrying of any weapon. Respondents are more likely to carry guns if they live 
in counties where a high percentage of residents have low income or are black. 
On the other hand, respondents are slightly less likely to carry guns if they live 
in counties with high violent crime rates. With the exception of the finding on 
violent crime rates and gun carrying, the evidence is consistent with the idea of 
adversary effects.* 

Analyses not presented show that these relationships do not depend on the race 
or socioeconomic status of respondents. In other words, we find no evidence of a 
statistical interaction between race of respondent and percent black in the county 
or socioeconomic status of respondent and percent poor in the county. The fact that 
whites and blacks respond similarly to the racial composition of their communities 
suggests that stereotyping reflects statistical discrimination rather than prejudice. 

Finally we observe a few other significant effects of demographic characteristics 
(1. young respondents are more likely than older respondents to carry all three 
types of weapons; (2. respondents with higher income are more likely to carry 
guns and mace, but less likely to carry knives; (3. educated respondents are less 
likely to carry knives but more likely to carry mace; (4. men are much more 
likely to carry firearms while women are much more likely to carry mace; and (5. 
respondents who live in urban counties are less likely to carry firearms than those 
who live in rural counties or counties that have an urban-rural mix. 


Guns vs. Other Weapons 


We also performed an analysis in which we treated carrying other weapons as 
the reference category. This allowed us to evaluate the statistical significance of 
effects on gun carrying vs. carrying other weapons among those who carried 
weapons. For this analysis we estimated a multinomial logistic regression, using 
a three-category dependent variable: (1. gun carrying, (2. other weapon carrying 
(including knives, mace and other weapons), and (3. no weapon carrying. The 
results (not presented in tabular form) show that the main effect of race and the 
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interaction term involving white Southerners are both statistically significant (b 
= -.54; t = -2.73; p = .007; and b = .50; t = 2.39; p = 017). The interaction term 
involving white Westerners is of borderline significance (b = .52; t = 1.94; p = 
052). The results suggest that, among those who carry weapons, white Southerners 
and Westerners are more likely to carry guns than white Northerners, and blacks 
are more likely to carry guns than whites. 


Alternative Regional Classifications 


In the analyses above we relied on the census definition to define Southern states, 
rather than the self-identification of residents or Civil War participation. In a 
supplementary analysis, we separated West Virginia, Maryland, Delaware and 
Washington, DC from states that are clearly Southern. Using four regions, we 
then looked at the white x “pure South” interaction. We observed the same inter- 
action, and it was of similar strength. We did not have sufficient sample size to 
examine whether whites in the border areas were different. We should note that 
cross-tabulations showed that Maryland was more similar to the Northern states 
‘than the Southern states in gun carrying among whites. 


Discussion 


By far, the strongest regional pattern we observe involves firearms. White 
Southerners and Westerners are much more likely to carry guns for protection 
than white Northerners. Regional differences are just as strong, and in some cases 
stronger, for women. In addition, the effects do not change much when we intro- 
duce controls for the threat of victimization, suggesting that regional effects are 
due to cultural differences. We do not observe the same regional variation when 
we examine the carrying of other weapons. 


Weapon Cultures 


Our evidence suggests that there is a gun culture among Southern and Western 
whites. The regional patterns may explain why Southern and Western whites have 
high rates of homicide and gun assaults but not high rates of unarmed assault, 
Recall that the same region-race interaction was observed for gun assault victimiza- 
tion with the National Crime Victimization Survey (Felson and Pare 2010). The 
strength of the statistical interactions and the consistency of the pattern in differ- 
ent data sets with different dependent variables is striking. The results increase our 
confidence that guns are a key factor in explaining regional differences. 

The nature of this gun culture, however, is not clear. We cannot determine 
whether our gun carrying measure reflects the prevalence of guns generally, the 
prevalence of handguns, or the tendency to carry guns for protection. We are not 
aware of any research that directly examines whether there is regional variation in 
guns used strictly for sport or hunting. Research has, however, examined regional 
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variation in ownership of hand guns vs. long guns and hand guns vs. all guns 
(Kleck 1991). Handguns are more likely to be used for protection. That research 
suggests that regional differences are observed for ownership of long guns but 
that regional differences in ownership of handguns are stronger. Future research 
will have to determine the nature of gun cultures in Southern and Western states. 

Note that the prevalence of firearms does not necessarily indicate that guns are 
used for protection. The prevalence of guns in Israel and Switzerland, for example, 
is attributable to army service among citizens, and is not related to honor cultures 
or the use of guns for individual protection. In addition, the use of guns for pro- 
tection is different from the use of guns for sport (Wright, Rossi and Daly 1983). 
The tendency for individuals to own guns for protection and sport are correlated 
to some extent, but at least some of the causes tend to be different. For example, 
Lizotte and Bordua (1980) found that the use of guns for protection was associ- 
ated with the use of guns by friends for this purpose while owning a gun for sport 
was related to gun ownership by parents. 

We also observe evidence of a cultural effect in the use of mace. Mace is more 
likely to be the weapon of choice for young, educated, middle-class women. These 
demographic effects are just as strong when we control for threat, suggesting cul- 
tural differences. In other words, mace is much more likely to be in the weapon 
of choice in the tool-kit of “yuppie” women. 

Our claims for cultural effects depend on the assumption that we have ad- 
equately controlled for current threat. One could argue, for example that the 
county level measures do not adequately capture contextual effects. However, we 
used a variety of measures at both the individual and county levels. Most of our 
threat measures have substantial effects on weapon carrying including county 
level measures of race and class. People are more likely to carry all three types of 
weapons when they feel unsafe or they have been victims of violent crime, and 
those who live in proximity to poor people and blacks are more likely to carry 
guns for protection. Yet, controlling for measures of threat had little effect on the 
effects of region and race. 

Note that adversary effects as well as culture may play a role in the continued 
prevalence of guns. The notion of an arms race implies that once guns become 
prevalent, their usage spreads because of concerns that potential adversaries are 
armed with guns. A cultural difference thus becomes an adaptive response to 


current circumstances. 


Honor Cultures 


The results do not provide much support for a regionally-based honor culture. 
In some equations we were able to detect some regional differences in carrying 
knives and mace but they were slight and of borderline statistical significance. The 
results suggest that, if honor cultures are playing a role in regional differences, that 
role is limited. It may be that a Southern honor culture operated in the past, but 
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primarily regional differences in gun carrying remain. We observe more evidence 
supporting an honor culture among blacks. Blacks are more likely than whites to 
carry both guns and knives for protection. 

The gender effects we observe, however, are not consistent with the honor cul- 
ture thesis. One would have expected stronger regional and race difference among 
men, because courage and the display of toughness are more important for men in 
an honor culture. The fact that that regional and race differences are just as strong, 
and probably stronger for women is problematic for the honor culture argument. 
In addition, the race difference in knife carrying is particularly strong among 
women: black women are much more likely to carry knives than white women. 

Perhaps adversary effects explain why race and regional differences in gun carry- 
ing among women are stronger in this study. Women must contend with adversar- 
ies who are mostly men. The advantage of carrying a gun may be even greater for 
them, given men’s advantage in physical strength (Felson 1996). If they interact 
with men who are quick to retaliate or well-armed then they may think they need 
a gun for protection. Thus, research shows that homicide and assault offenders are 
more likely to use guns when their victims are male (Felson and Messner 1996). 

It could be that a different type of adversary effect conceals the effects of a 
Southern honor culture by suppressing regional differences in knife and mace 
carrying. Perhaps the prevalence of guns reduces the prevalence of violence with- 
out firearms (see Cook and Ludwig 2004; Felson and Pare 2010; Griffiths and 
Chavez 2004). Citizens may be less likely to carry less lethal weapons because 
they are ineffective against gun toting offenders. Violence or self-defense using 
knives, mace, or fists may be too dangerous if adversaries might be armed with 
guns. Furthermore, offenders may be reluctant to engage in an assault without a 
gun if they think adversaries are armed and prone to retaliate. In sum, while seri- 
ous violence is contagious, it may be a deterrent to more minor forms of violence 
and aggression. Previous research suggesting that an honor culture continues to 
influence Southern and Western whites may be correct, but the prevalence of 
guns may suppress the tendency to carry other weapons and engage in fist fights. 

In conclusion, our research points to the importance of firearms in regional and 
race variation in violence. It also points to the theoretical importance of proper 
conceptualization of the dependent variable and the importance in research of 
examining multiple outcomes. Honor cultures cannot explain regional and race 
differences in weapon carrying and assault, if effects are mainly observed for be- 
havior involving firearms. Honor culture cannot explain why effects are stronger 
(or at least as strong) for women. Either honor cultures no longer operate or their 
effects on non-gun violence are suppressed due to adversary effects. Whichever 
explanation one prefers, our results show that the role of firearms is critical to an 
explanation of regional and race variation in violence in the United States. 
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Notes 


1. Higher homicide rates in the South are due, to some extent, to higher levels of poverty 
and greater numbers of blacks in that region (Loftin and Hill 1974; McCall, Land 
and Cohen 1992; Parker 1989; Parker and Pruitt 2000; Smith and Parker 1980). 


2. On the other hand, regional differences in attitudes toward general violence are not 
observed (e.g., Dixon and Lizotte 1987). 


3. In analyses not presented we include a control for whether the state had laws 
restricting or forbidding the carrying of firearm. It could be that these laws reflect a 
gun subculture, so we preferred not to include them in the main analysis. At any rate, 
the results were similar whether this variable was included or not. 


4, When concern for safety, victimization, and percent black and percent poor are left 
out of the equation, the violent crime index has small and sometimes borderline 
effects on knife carrying and mace carrying but no effect on gun carrying. 
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Appendix 1. States by Regions 


North South West 
Connecticut Alabama Alaska 
Illinois Arkansas Arizona 
Indiana Delaware California 
lowa District of Columbia Colorado 
Kansas Florida Hawaii 
Maine Georgia Idaho 
Massachusetts Kentucky Montana 
Michigan Louisiana Nevada 
Minnesota Maryland New Mexico 
Missouri Mississippi Oregon 
Nebraska North Carolina Utah 
New Hampshire Oklahoma Washington 
New Jersey South Carolina Wyoming 
New York Tennessee 
North Dakota Texas 
Ohio Virginia 
Pennsylvania West Virginia 
Rhode Island 
South Dakota 
Vermont 
Wisconsin 





Note: Based on U.S. Census, with Northeast and Midwestern states classified as North. 


The Effect of Minimum Wage Rates on High School Completion 


John Robert Warren, University of Minnesota 
Caitlin Hamrock, University of Texas at Austin 


Does increasing the minimum wage reduce the high school completion rate? Previous 
research has suffered from (1. narrow time horizons, (2. potentially inadequate mea- 
sures of states’ high school completion rates, and (3. potentially inadequate measures 
of minimum wage rates. Overcoming each of these limitations, we analyze the impact 
of changes in state and federal minimum wage rates on state high school completion 
rates for the graduating classes of 1982 through 2005. Our state-level analyses, which 
consist of a series of state and year fixed-effects models with controls for state-year 
time-varying covariates, provide no support for the argument that increasing the 
minimum wage reduces rates of high school completion. 


Proposals to increase minimum wage rates are typically met with two critical 
responses. First, detractors argue, raising the minimum wage raises employers’ 
costs. This, in turn, leads them to hire fewer workers—particularly lower-wage, 
entry-level workers—and to reduce the hours of the workers they already have. 
Second, raising the minimum wage induces some young people to drop out of 
high school. These two responses may seem contradictory—it is not clear why 
young people would leave high school when there are fewer lower-wage, entry- 
level jobs to be had. This apparent contradiction is resolved by considering that 
employers make hiring decisions while students make decisions about whether and 
when to drop out of school. 

In this research, we are centrally concerned with estimating the impact of 
changes in minimum wage rates on state high school completion rates. First, we 
place this question within theoretical perspectives on the processes that lead some 
young people to leave high school before obtaining a diploma. In short, while 
some economic theory may predict lower graduation rates because of higher mini- 
mum wage rates, most sociological, psychological and educational perspectives 
would posit few such effects. The latter perspectives would also suggest that any 
effects of changes in minimum wage rates on completion rates are heterogeneous 
across groups of students. Second, we review existing evidence about the empiri- 
cal relationship between minimum wage rates and high school completion rates; 
we are critical of extant research for methodological reasons. Third, we present 
analyses of the impact of minimum wage rates on high school completion rates 
for the graduating classes of 1982 through 2005. 


An earlier version of this article was presented at the 2008 annual meeting of the American Sociological 
Association in Boston. This research was supported by the University of Minnesotas Undergraduate 
Research Opportunities Program. We are also grateful to several anonymous reviewers for their useful 
suggestions. Direct correspondence to John Robert Warren, University of Minnesota, Department of So- 
ciology, 909 Social Sciences, 267 1 & Ave S., Minneaopolis, MN 55455. E-mail: warre046@umn.edu. 
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Why Do Young People Drop Out of High School? 


Research in sociology, psychology and education suggests that dropping out of 
high school is the end result of a developmental process that is influenced by 
“push” factors, “pull” factors and complex psychological processes. Researchers who 
emphasize the “push” factors have documented the effects of a variety of state edu- 
cation policies, including course graduation requirements, compulsory schooling 
laws and state exit examination policies (see Lillard and DeCicca 2001; Warren, 
Jenkins and Kulick 2006; Dee 2003), grade retention rates (Hauser, Frederick and 
Andrew 2007; Jimerson 2001), and suspension and discipline policies, among 
others. Researchers who emphasize the “pull” factors have documented the effects 
of labor market conditions (McNeal 1997; Stearns and Glennie 2006) and family 
and peer influences, among others. 

Research on students’ eventual decisions to leave school emphasize that “push” 
and “pull” factors must be considered within the context of various developmental 
processes. A number of developmental models—including the frustration self- 

_esteem model and the participation-identification model -—have been elaborated 
and reviewed elsewhere (e.g., Finn 1989; Mahoney and Cairns 1997; Stearns et 
al. 2007). These models emphasize the roles of school success and failure, students’ 
psychological orientations and reactions to academics and the educational process, 
and students’ resulting actions and behaviors. From these perspectives, the conse- 
quences of “push” and “pull” factors for dropout decisions are highly contingent 
on students’ school success, their psychological orientations toward school, work 
and family, and other individual-level factors. 

This research on the processes that lead students to drop out of school has impor- 
tant implications for research on the effects of minimum wage rates on high school 
completion. One implication is that the effects of minimum wage increases may be 
conditional on various “push” and “pull” factors. It makes sense to think that the ad- 
ditional “pull” of higher wage rates might be more consequential in the presence of 
relatively large “push” factors. For example, higher minimum wage rates might most 
strongly influence high school completion rates when states impose more rigorous 
graduation requirements (Chaplin, Turner and Pape 2003; Ehrenberg and Marcus 
1982). In our analyses, we test the hypothesis that the effects of minimum wage 
rates vary as a function of states’ course graduation requirements, states’ compulsory 
school age laws and states’ exit examination policies. 

Another implication of this body of research is that relatively few students are 
likely making rational calculations about the relative economic advantages of con- 
tinuing to acquire human capital by staying in school vs. leaving school sooner to 
take advantage of higher wages. The research reviewed above suggests that perhaps 
only a minority of students who are at risk of dropping out are swayed by labor 
market conditions. For most students’ decision-making processes, labor market 
conditions are likely unimportant or at least only important under very specific 
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conditions. For example, increased minimum wage rates may only matter for 
students who have histories of school failure; who are not engaged in meaningful 
extracurricular, social or athletic activities in school; and, perhaps most impor- 
tantly, who are in a position to meaningfully increase their levels of labor market 
participation. If relatively few jobs are available, if those jobs are of poor quality, 
if students are competing with adults for those jobs, and/or if would-be dropouts 
are not able to increase their work hours in order to make a go at supporting 
themselves, they may not be “pulled” out of school by minimum wage increases. 
To consider these issues in our research, we formally test the hypothesis that the 
effects of minimum wages on high school graduation rates vary as a function of 
states: unemployment rates. 


Prior Evidence 


Previous research has come to mixed conclusions about whether raising the mini- 
mum wage reduces school enrollment. While some authors find such effects in 
the United States and elsewhere (Neumark and Wascher 1995, 1995; Pacheco and 
Cruickshank 2007), others have found that the effect is limited to certain popula- 
tion subgroups (Ehrenberg and Marcus 1982), some have found positive effects on 
school enrollment (Mattila 1981), and still others have found no effects at all (Card 
1992; Campolieti, Fang and Gunderson 2005; Neumark and Wascher 2003). 

Most of this prior research suffers from three basic methodological problems. 
First, this research considers either a relatively narrow range of years/cohorts or 
else follows a single cohort over a relatively narrow range of time. Chaplin, Turner 
and Pape (2003), for example, considered only the eight years from 1979 through 
1986. Because minimum wage rates change gradually over time, such narrow 
time horizons may unnecessarily truncate variability in minimum wages. In our 
analyses, we model the impact of changes in minimum wage rates on high school 
completion rates for the graduating classes of 1982 through 2005. 

Second, prior research has suffered from unsatisfactory measures of the key 
dependent variable: high school completion rates. Frequently-used measures of 
high school completion are often biased by their failure to adequately account 
for grade retention, migration, mortality, alternate degree attainment and other 
factors (Warren 2005; Warren and Halpern-Manners 2007). This is especially 
true when school enrollment or high school completion is measured at the state 
level. For example, using simulations like those employed by Warren and Halpern- 
Manners, we find that the “high school continuation rates” used by Chaplin, 
Turner and Pape (2003) are biased to the extent that net migration differs from 
zero and to the extent that any students are made to repeat grades. Warren and 
Halpern-Manners demonstrate that school enrollment/completion rates based 
on data from the Current Population Survey like those used by Neumark and 
Wascher (1995a, 1995b) are seriously biased. Thus, we chose a measure of high 
school completion rates that is largely unbiased by these factors. 
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Third, prior research has potentially suffered from unsatisfactory measures 
of the key independent variable: minimum wage rates. Previous researchers have 
universally operationalized minimum wage rates as the minimum wage in one 
particular year. Most model the relationship between the minimum wage in 2005, 
for example, and the graduation rate at the end of the 2004-05 academic year. 
Thus, they consider only the minimum wage rate that is in effect for the second 
half of students’ senior years of high school. This operationalization departs rather 
markedly from behavioral theories of why students would leave high school as a 
result of changes in the minimum wage (and also ignores the fact that the majority 
of dropouts leave school before reaching the second half of the senior year). We 
might hypothesize, for example, that the dollar value of the minimum wage over 
the past two (or more) years of schooling is more influential in students decision 
making about whether to leave school than the dollar value of the minimum wage 
in the latter half of students’ senior years. We might also hypothesize that students 
react to any change in the minimum wage, regardless of dollar value. Thus research 
might do well to operationalize the key independent variable in terms of whether 

- any change in the minimum wage rate occurs in the last two (or even four) years 
of high school. We estimate separate models that differ with respect to how we . 
operationalize minimum wage rates. 


Research Design and Measures 


We are interested in estimating the association between minimum wage rates and 
high school completion rates— both of which vary across states and over time. To 
do so we employ state and year fixed-effects models which can be written as: 


n i-l ra 
Y¥, =a +BMinimumWage,, + ¥ 2,X,,, + ¥, State +) Year +¢,, (1) 
1 1 1 


where Ve represents the outcome variable in state 7 in year f; wis a constant; 7 
indexes the several time-varying covariates X described below that vary over the 
i states and the ¢ years; 1, represents the coefficients for these n time-varying 
covariates; State. and Year, are state and year fixed effects, respectively, and <, is 
a random disturbance term. ‘The state fixed effects in Equation 1 conceptually 
account for all aspects of states that remain constant over time but vary across 
states. The year fixed effects conceptually account for all aspects of years that are 
constant across states but vary over time. This technique is built on the fact that 
it is impossible to explicitly measure all aspects of particular states or of particular 
years that might bias our estimates of the associations between minimum wage 
rates and high school completion rates. To estimate the model in Equation 1 we 
use the least-squares dummy-variable approach, in which dummy variables are 
introduced for each of -1 states and for each of t-1 years. All results are robust to 
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Table 1: Descriptive Statistics 





_ ,. Standard 
Min Max XOrp Deviation 
Dependent Variable 
Estimated high school completion rate % 49.0 97.6 75.0 (8.80) 
Minimum Wage Variables 
In senior year — $6.25 (2005$) -.92 2.18 .00 (.55) 
Mean in junior & senior years — $6.23 (2005$) -.82 2.16 .00 (.55) 
Mean in four high school years — $6.19 (2005$) -71 2.51 .00 (.58) 
Any change in junior or senior years? (yes = 1) 0 1 123 
Any change in four high school years? (yes = 1) 0 1 AQ 


State Education Policy Variables 
Maximum Age of Compulsory Schooling 


16 or less (yes = 1) 0 1 64 
17 (yes = 1) 0 1 16 
18 (yes = 1) 0 1 .20 
Carnegie Units Required for Graduation 
No statewide requirement 0 1 | 
Fewer than 18 0 1 bod 
18 to 22 0 1 45 
More than 22 0 1 21 
State High School Exit Examination Policy 
No state exit exam 0 1 ie 
“Minimum Competency” state exit exam 0 1 16 
“More Challenging” state exit exam 0 1 3 


State Unemployment Rate - 5.8% (%) -3.5 11.6 0 (2.0) 


Note: Sample of 1,224 state-years constructed by cross-classifying the 51 states 
(including DC) by the 24 years from 1982 through 2005 (inclusive). See text for 
description of variables. 


alternative specifications in which we more carefully account for serial correlation 
(Bertrand, Duflo and Mullainathan 2004). 

We have obtained minimum wage rates for each state from 1982 through 2005 
and we have calculated state high school graduation rates for the graduating classes 
of 1982 through 2005. Consequently, the sample size for our analyses equals 24 
years (1982 through 2005) x 51 states (including D.C.) = 1,224. 


State High School Completion Rates 


Our measure of states’ high school completion rates, developed by Warren (2005), 
is based on Common Core Data and represents the cohort-specific, state-level 
percentage of incoming public school 9° graders who complete high school by 
obtaining a diploma. This measure does not count GED recipients as high school 
completers, and approximates longitudinal completion rates. 

The National Center for Education Statistics produces a CCD-based, four-year 
high school completion rate using counts of high school dropouts and completers 
in each state and in each academic year. Beyond conceptual and technical problems 
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with this measure (Warren 2005), the NCES-recommended four-year completion 

measure is only available for 36 states and the District of Columbia in 2000, and is 

not available at all prior to 1996. A number of researchers have proposed alternate 

CCD-based state-level high school completion rates based on a comparison of the 

number of high school completers—not counting GED recipients—in one year 
to the number of students enrolled in some prior years (e.g., Greene and Winters 

2002, 2005; Haney et al. 2004; Swanson 2003; Seastrom et al. 2005). Such mea- 
sures have the virtue of being available for all states and for a sufficient number 
of academic years. Unfortunately, as described in detail by Warren (2005), these 

measures are conceptually and technically flawed and empirically biased in serious 

ways (see also Kaufman 2001; Pallas 1990). In particular, the measures developed 

by these authors are systematically biased by interstate migration, grade retention 

and/or changes over time in the size of incoming cohorts of high school students. 
Unfortunately, the same biases are inherent in the graduation rate measures used 

by Chaplin, Turner and Pape (2003) and other researchers who have estimated the 

impact of minimum wage rates on state high school completion rates. 

The measure that we use—Warren’s (2005) “Estimated Completion Rate” —is 
based on a comparison of the number of public high school completers in one aca~ 
demic year to the estimated number of first-time 9" graders three academic years 
earlier after accounting for the interstate migration of students in the intervening 
years. Warren (2005) demonstrates in a series of simulations that this measure in 
not biased by interstate migration, grade retention or changes over time in the size 
of incoming cohorts of students. Warren (2005) produces state-level high school 
completion rates for graduating classes through 2002; we have followed exactly 
parallel techniques to update these state-level graduation rates through 2005. As 
shown in Table 1, the average high school completion rate across our 1,224 state 
years is 75 percent. 


Minimum Wage Rates 


As depicted in Figure 1, the federal minimum wage increased only four times 
from 1982 through 2005, but states’ minimum wages changed more frequently. 
Whereas only three states’ minimum wages exceeded the federal minimum wage 
in 1982, that number grew to 17 in 1990 and was at 15 in 2005. The “minimum 
wage’ is the higher of the federal minimum wage and each state’s own minimum 
wage. As shown in Figure 1, the longer it has been since an increase in the federal 
minimum wage, the greater the number of states with minimum wage rates that 
exceed the federal minimum wage. 

States’ minimum wage rates are reported for most years in the Council of 
State Government's Book of the States (see 2007). Also, the January edition of the 
Monthly Labor Review has traditionally summarized the preceding year’s changes 
to states’ minimum wage (and other labor and compensation) laws (Nelson 1982). 
Using these resources, we were able to ascertain each state’s minimum wage for 
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Figure 1. The Federal Minimum Wage Rate and the Number of States with Minimum 
Wage Rates Exceeding the Federal Minimum Wage 
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each year from 1982 through 2005. In some states the minimum wage is below 
the federal minimum wage, but this lower wage pertains only to the small share 
of workers who are exempt from the federal minimum wage (mostly farm work- 
ers, tipped workers and salaried professionals). In our analyses we take the higher 
of the federal and state minimum wage for each state-year; the resulting wage is 
the lowest that the vast majority of workers could be paid per hour in that state 
in that year. We have adjusted minimum wages for inflation using the National 
Consumer Price Index; all wages are expressed in 2005 dollars. 

Most analysts have modeled the relationship between minimum wages and 
graduation rates by regressing the graduation rate in year Z on the minimum 
wage in year Z. For reasons outlined above, this operationalization seems incon- 
sistent with the behavioral theories that underlie hypotheses about why changes 
in the minimum wage might affect students’ decisions about whether to persist in 
school. Consequently, we estimate separate models with the following alternate 
operationalization of the key independent variable, X: 


1. X = Minimum wage in the last year students are in 
school (e.g., the minimum wage in 2005 for the class 


of 2005) 
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2. X = Average of the minimum wage rates for the last 
two years students are in school (e.g., the average of 
the minimum wages in 2004 and 2005 for the class 
of 2005) 

3. X = Average of the minimum wages in the four 
years students are in school (e.g., the average of the 
minimum wages between 2002 and 2005 for the class 
of 2005) 

4. X= Dummy variable indicating whether the minimum 
wage changed in the last two years students are in 
school (e.g., whether the minimum wage changed in 
2004 or 2005 for the class of 2005) 

5. X= Dummy variable indicating whether the minimum 
wage changed during the last four years students are 
in school (e.g., whether the minimum wage changed 
between 2002 and 2005 for the class of 2005) 


For each of the first three measures we have expressed the minimum wage (or. 
average minimum wage) as a deviation from the mean across all observations. 


Other Covariates 


Although we are principally interested in associations between minimum wage 
rates and high school completion rates, we also include a series of time-varying 
covariates in our models to consider the possibility that state education policies 
(as “push” factors) and unemployment rates (as a “pull” factor) condition the 
effects of minimum wage rates. First, we include a measure of states’ maximum 
compulsory age of school attendance; this allows us to test the hypothesis that 
the impact of minimum wage rates varies as a function of whether students may 
legally drop out before age 18 (Chaplin, Turner and Pape 2003). Second, based 
on evidence that course graduation requirements affect dropout rates (Lillard and 
DeCicca 2001), we include measures of the number of Carnegie units that states 
require students to complete in order to graduate. We initially operationalize this 
in terms of whether states require fewer than 18 Carnegie units, 18 to 22 Carnegie 
units, or more than 22 Carnegie units for graduation; we also initially include an 
indicator of whether states have no statewide policy regarding course graduation 
requirements. Third, we include indicators of whether states require students to 
pass an exit examination as a prerequisite for graduation. Based on evidence that 
the consequences of exit examination policies for high school completion rates 
vary as a function of the difficulty level of the test (Dee and Jacob 2006; Warren, 
Jenkins and Kulick 2006), we differentiate between “minimum competency” and 
“more challenging” state high school exit examinations. “More challenging” exit 
exams assess mastery of any curricular material introduced in grade 9 or later. 
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Finally, we include a time-varying measure of states’ unemployment rates. Based 
on considerable debate about whether minimum wage increases affect the unem- 
ployment rate for youth and adults (e.g., Williams and Mills 2001), we allow for 
the possibility that the relationship between minimum wage rates and high school 
completion rates may be moderated by the unemployment rate. If increasing the 
minimum wage simultaneously causes some students to leave high school in order 
to work more intensively and causes other students to remain in school because 
there are fewer employment opportunities, then analyses of the relationship between 
minimum wage rates and high school completion rates that ignore unemployment 
rates may be biased. We have repeated all of analyses after omitting the unemploy- 
ment rate covariate, and find that the results are substantively the same. The results 
from these supplementary analyses are available upon request from the authors. 


Results 


We begin by estimating a series of five regression models of the form shown in 
Equation 1. The five models differ only with respect to how we operationalize 
minimum wage rates. Each model includes a minimum wage rate measure, state 
and year fixed effects, and the time-varying covariates shown in Table 1. Then, we 
tested the assumptions of the regression model by considering a series of diagnos- 
tic measures and plots. We determined that there were 28 cases with excessively 
large studentized deleted residuals (greater than |3|), high leverage values (greater 
than 3p/n), and/or high values of Cook’s D (greater than .01). We also found that 
some of the time-varying covariates were collinear. Our final models exclude these 
28 observations and include a more parsimonious set of time-varying covariates. 
After dropping these 28 observations and re-specifying the models, all of the 
model assumptions seem to be met. 

Table 2 includes results from a series of models in which we operationalize 
minimum wage rates in terms of their value in students’ senior year of high school. 
Most prior research has operationalized this focal independent variable in this 
way, but we believe this operationalization is not the only or best way to measure 
minimum wage rates for this purpose. It turns out, however, that our results are 
substantively the same regardless of which of the five measures of minimum wage 
rates we use. Consequently, we do not present results for the other four methods 
of operationalizing minimum wage rates. 

In Model 1 in Table 2 we regress state-level high school completion rates on 
minimum wage rates; a dummy variable indicating whether or not the maxi- 
mum compulsory age of schooling is 18; a dummy variable indicating whether 
or more than 22 Carnegie units are required for graduation; dummy variables 
indicating whether states require “minimum competency” or “more challenging” 
high school exit exams; state unemployment rates; and state and year fixed ef- 
fects. High school graduation rates are significantly related to each of the state 
education policies, and in ways that we would expect based on prior research. 
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Also in line with prior research, high school completion rates are, on average, 
significantly higher in state-years with higher unemployment rates. However, 
the results of Model 1 indicate no statistically significant relationship between 
minimum wage rates and high school completion rates. 

As reviewed above, some researchers (Chaplin, Turner and Pape 2003; 
Ehrenberg and Marcus 1982) find that increases in the minimum wage reduce 
high school completion rates most appreciably in states in which students are 
permitted to drop out of high school at younger ages (in particular, before age 
18). Models 2 through 4 in Table 2 include interactions between state minimum 
wage rates and education policy variables. These results provide no evidence that 
the association between minimum wage rates and high school completion rates is 
more (or less) pronounced when students are compelled to remain in school until 
age 18 (Model 2), to earn 22 or more Carnegie units for graduation (Model 3), 
or pass a state high school exit examination (Model 4). Although we hypothesize 
above that increasing the minimum wage rate may be most consequential for high 
school graduation rates when students face more rigorous graduation require- 
ments, the results in Table 2 indicate otherwise. 

Finally, in Model 5 we allow the effect of minimum wage rates to vary as a 
function of unemployment rates. In all models, the unemployment rate is strongly 
related to high school completion rates, such that completion rates increase as 
the unemployment rate goes up. In other words, relatively high unemployment 
rates seem to keep young people in school while low unemployment rates may 
pull young people out of school. The statistically significant interaction term in 
Model 5 suggests that higher minimum wage rates moderate the relationship 
between unemployment rates and high school completion rates. While students 
generally persist longer in school when unemployment rates are high — perhaps 
because there are fewer job opportunities for them if they leave school—they are 
less inclined to persist if minimum wage rates are higher —perhaps because the few 
jobs available to them pay higher wages. 


Discussion 


Does increasing the minimum wage reduce the high school completion rate? 
Previous research has suffered from (1. narrow time horizons, (2. potentially in- 
adequate measures of states’ high school completion rates, and (3. potentially in- 
adequate measures of minimum wage rates. Overcoming each of these limitations, 
we analyze the impact of changes in minimum wage rates on state high school 
completion rates for the graduating classes of 1982 through 2005. Our state-level 
analyses, which consist of a series of state and year fixed-effects models with con- 
trols for state-year time-varying covariates, provide no support for the argument 
that increasing the minimum wage reduces rates of high school completion. 

The presumption that raising the minimum wage reduces high school comple- 
tion rates is usually based on a particular perspective on how young people make 
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decisions about their lives. From this viewpoint, young people weigh the benefits 
of remaining in school vs. the opportunity costs of forgoing wage labor until after 
graduation. If the incentive structure shifts because of increased minimum wages, 
then more students will leave school before graduating. While not a definitive test 
of this theoretical position, our results suggest that this logic describes the deci- 
sion making of, at most, a small minority of students (or at least that the calculus 
that students use to weigh costs and benefits may vary across groups of students). 
In the end, raising the minimum wage does not result in detectable changes in 
high school graduation rates. These results are thus in line with other theoretical 
perspectives that emphasize the complex developmental, social and institutional 
factors that “push” or “pull” students out of school. From these perspectives, rais- 
ing the minimum wage would only induce students to leave school under very 
specific (and perhaps relatively rare) circumstances. 

Various economic, social and political considerations underlie decisions about 
whether to raise the minimum wage. There may be sound reasons to raise the min- 
imum wage, and there may be valid reasons for opposing such a move. However, 

- the empirical evidence does not support the argument that raising the minimum 
wage reduces graduation rates. E 


Notes 


1. While this may seem like a purely technical critique of the literature, it is also supported 
by recent research on how young people make plans about their educational futures 
(e.g., Morgan 2005). This work supports the notion that changes to the minimum wage 
in one year will not be an overriding factor in students’ decision making in that year. 


2. ‘The ECR is computed by dividing the number of regular diplomas issued by a state’s 
public schools in the spring of academic year X by an estimate of the number of 
students at risk of receiving those diplomas. The estimate in the denominator begins 
with the number of 8" graders in academic year X-4 and then adjusts that figure to 


account for migration into and out of the state over the intervening years. See Warren 
(2005) for details. 


3. As detailed by Warren (2005), despite the fact that it overcomes the major biases 
inherent in alternate CCD-based measures, the ECR is nonetheless still systematically 
biased by a number of factors. The magnitude of these biases is large only under 
very unusual circumstances. These factors include 8" grade retention, the large scale 
movement of young people into or out of private schools between the 8 and 9% 


grades, and the immigration of large numbers of young people who never enroll in 
American schools. 


4, Of course these measures do nothing to account for within-state variation in districts 


or schools’ own course graduation requirements. There is nothing that we can do to 
account for this limitation given the structure of our data. 


5. While omitting these 28 observations modestly reduces generalizability, it improves 
our confidence that our results accurately pertain to the larger set of 1,196 cases. 
‘The results obtained from the full set of 1,224 cases are not valid for any state-years 
because the assumptions of the regression model are not met. 
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oN 


Technically, what we found is that increases in the minimum wage are not associated 
with changes in high school completion rates within the range of minimum wage rates 
that we have observed. \t may be that at some higher minimum wage rate, students 
might be pulled out of high school. There may be some tipping point at higher 
wage rates (which exceed current minimum wage rates) that would lead substantial 
numbers of students to leave school. 
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Latino Employment and Black Violence: The Unintended 
Consequence of U.S. Immigration Policy 


Edward S. Shihadeh, Lousiana State University 
Raymond E. Barranco, Louisiana State University 


U.S. immigration policies after 1965 fueled a rise in the Latino population and, thus, 
increased the competition for low-skill jobs. We examine whether Latino immigration 
and Latino dominance of low-skill industries increases black urban violence. Using 
city-level data for the year 2000, we find that (1. Latino immigration is positively 
linked to urban black violence, (2. the link is most prevalent where blacks lost ground 
to Latinos in low-skill markets, (3. not all low-skill sectors operate in unison; black 
violence rises only when jobs in agriculture, manufacturing and construction are in 
short supply and, (4. Latino immigration raises black violence by first increasing black 
unemployment. We discuss the implications of these findings. 





Several schools of thought identified the structural conditions responsible for the 
decline of inner cities and the rise of black urban violence. Among them is Wilson's 
(1987) seminal work on how the flight of the black middle class away from center 
cities effectively isolated those residents left behind. The resulting social isolation 
contributes to a range of social problems from black infant mortality (Hummer 
1993) to black homicide (Shihadeh and Flynn 1996; Krivo and Peterson 2004; 
Shihadeh forthcoming; Shihadeh and Bisciglia forthcoming). During the same 
period, Massey and Denton (1988) identified racial segregation as a highly perni- 
cious form of structured inequality, one that effectively isolates blacks from white 
mainstream opportunities which, ultimately, raises the rates of black violence 
(Peterson and Krivo 1993; Shihadeh and Maume 1997). Indeed, an expanding 
literature links a variety of structural factors to black crime. Some of the causes, 
such as inequality (Blau and Blau: 1982; Harer and Steffesnmeier 1992), jobless- 
ness and family structure (Sampson 1987; Shihadeh and Steffensmeier 1994), 
have an established history in sociological thought. 

But another constraint to black mobility may be a consequence of U.S. im- 
migration policy. The Immigration Act of 1965 opened the door to non-European 
immigrants, particularly those of Asian and African descent (Daniels 2004). Over 
the years, this raised the concerns of an increasingly nativist U.S. population, lead- 
ing to the highly restrictive Immigration Reform and Control Act of 1986 (Massey, 
Durand and Malone 2003). While apprehension rates of violators increased as 
a result of intensified border control, paradoxically, many Latino immigrants 


This research was funded by the Manheim Research Fellowship at the Louisiana State University. 
The authors wish to thank the anonymous reviewers for their suggestions. Direct correspondence 
to Edward S. Shihadeh, 126 Stubbs Hall, Department of Sociology, Louisiana State University. 
E-mail: edsoc@lsu.edu. 


© The University of North Carolina Press Social Forces 88(3) 1393-1420, March 2010 


1394 © Social Forces 88(3) 


already in the United States felt compelled to settle here permanently. This fueled 
a major demographic shift in the ethnic/racial composition of the United States, 
one that looms potentially large for low-skill blacks. As the number of low-skill 
Latinos residing in the United States grew, this invariably increased the pressure 
on the supply of low-skill jobs which is critical for the mobility of blacks (Kasarda 
1993; Wilson 1996; Shihadeh and Ousey 1996; 1998; Parker and McCall 1999; 
Wadsworth 2004; Weiss and Reid 2005). Because even a minor drop in the rela- 
tive supply of low-skill jobs may affect the economic prospects of blacks, Latino 
immigration and black mobility seem inexorably linked in the United States. The 
tenor of the public discourse seems mired in an unfortunate mix of controversy 
and silence as the respective groups feel put upon. On the one hand, Latinos 
consider themselves unfairly blamed by hearsay and opinion, while on the other 
hand, blacks feel victimized and elbowed out of critical job markets (Wood 2006). 
But low-skill Latinos have a significant advantage in the competition for low- 
skill jobs. They have a low reserve wage (Wilson 1996), they are viewed favorably 
by prospective employers (Yoon 1997), they are embedded in established networks 
. which facilitate employment (Fernandez-Kelley 1995; Aguilera and Massey 2003) 
and streamline migration (Aguilera and Massey 2003), and they have a strong 
tradition of entrepreneurship which is invoked to hire their own (Bailey 1987; 
Hansen and Cardenas 1988). As these factors combine to make Latinos more 
competitive in a job market occupied traditionally by blacks, it thickens an already 
dense cluster of disadvantages to black social mobility. This analysis examines 
whether the link between Latino immigration and black violence is due to a shift 
in low-skill job markets towards Latino labor. 


Background 
Structural Disadvantage and Urban Black Life 


‘The Civil Rights Acts of the 1960s and the expansion of the public sector of the 
economy accelerated a bifurcation in black America. These changes fueled the mo- 
bility of the black middle class who then moved away from inner cities, effectively 
removing the vertical integration of the neighborhoods they left behind (Wilson 
1987). Whether this accelerated in the 1970s (Massey and Eggers 1990), or in the 
1980s (JJargowsky 1996) as some claim, in any case, this bifurcation decoupled 
the black underclass from mainstream life and segregated them from positive role 
models who demonstrate that steady employment is a reasonable alternative to 
a life of poverty and despair (Massey and Denton 1998). The consequences for 
crime in black inner cities were inevitable (Shihadeh forthcoming; Shihadeh and 
Bisciglia forthcoming; Shihadeh and Flynn 1996; Krivo and Peterson 2000). 
‘The disorganization of inner cities was first articulated by the heuristic models 
of the early Chicago school. During the formative years of that research tradition, 
low-skill jobs were plentiful, and so the attendant aspect of disorganization was 
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not joblessness per se, but population turnover and migration. As one group suc- 
ceeded and moved outward, it was replaced by a successive wave that replicated 
this process of assimilation and mobility (McKenzie 1933; Kasarda 1993). But 
following a post-WWII boom, the supply of these jobs began to dwindle. Wilson 
(1996), an inheritor of the ecological tradition, argues that during the 1970s and 
early 1980s, the major disorganizing influence in black inner cities was the loss 
of entry level jobs, primarily in manufacturing. Perhaps as much as 70 percent of 
private sector manufacturing and other blue collar jobs were lost from 1970 to 
1981 (Bluestone and Harrison 1982). To be sure, not all low-skill sectors declined; 
the service sector, for example, expanded in the new economy (Bluestone and 
Harrison 1982), particularly personal services that spawned from globalization 
and the resulting expansion of technology, information and finance industries 
(Sassen 1998). Nonetheless, when intermixed with discrimination, segregation 
and the flight of the black middle class, this substantial loss of suitable employ- 
ment transformed already vulnerable inner cities into institutional ghettos. 


Latinos and U.S. Immigration Policy 


Bearing down on this vulnerability is a population shift fueled by an unintended 
consequence of U.S. immigration policy. Immigration control from 1900 to 
the present vacillated between periods of increased and decreased enforcement 
(Massey, Durand and Malone 2003). But there are critical milestones in the story. 
Massey and colleagues note that pre-1986 immigration levels were a balance large- 
ly between economic push factors in sending countries, labor pull in the United 
States, and the rational choice calculus of potential movers. They challenge the 
view that immigrants, like any “sensible” people, all choose to settle permanently 
in the United States for a better life. On the contrary, their Mexican Migration 
Project reveals that many immigrants move to the United States temporarily in or- 
der to finance projects back home, like a house or a business (Durand and Massey 
2006). As a Presbyterian missionary from Mexico's Chiapas region said to us: 


“We [Mexicans] don’t want to stay permanently in the United 
States. We just want make a bit of money and then come home 
to our communities, where our families are.” 


~ Pablo Feliciano Cruz (Missionary, Hebron USA) 


In other words, immigration levels have been regulated in part by the temporary 
economic needs of individual migrants. 

But when the Immigration Act of 1965 choked off the legal avenues for 
Mexican migration, illegal avenues picked up the slack. Reflecting the civil rights 
mood of the era, the 1965 legislation diversified the ethnic and racial composition 
of the migrant stream to the United States. ‘The center piece of the legislation was 
a 20,000 persons-per-country limit that for the first time placed formal limits on 
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the number of Latinos entering the United States (Glazer 1980). Consequently, 
the next 20 years saw a dramatic rise in undocumented migration. 

But in 1986, riding a wave of U.S. nationalism, the Reagan administration 
redefined immigration as a national security issue in order to signal that the nation 
is being protected in an era of communist insurgency in Latin America (Massey, 
Durand and Malone 2003). Consequently, the Immigration Reform and Control 
Act was enacted which dramatically expanded border control and criminalized the 
hiring of undocumented workers (Calavita 1992). Suddenly, it became too risky 
for many undocumented immigrants in the United States to go home for fear 
that they would not be allowed to return (Massey, Durand and Malone 2003). 
Asa result, the United States population of Latinos more than doubled from 14.5 
million in 1980 to 37.4 million in 2002 (Ramirez and De la Cruz 2002) and now 
rank as the largest minority in the country.’ 

This expanding Latino population is well represented in low-skill job markets. 
Of course, automatically equating Latinos with low-skill jobs is a gross misrep- 
resentation, for as Bean, Leach and Lowell (2004) note, Latinos are represented 

- strongly in middle-tier job sectors as well. Nonetheless, “Latino male immigrants 
accounted for almost one half of the growth in low-quality jobs” from 1990 to. 
2000 (Bean, Leach and Lowell 2004:511). There are several reasons for their 
success in low-skill markets. First, Latinos have high rates of entrepreneurship 
(Bailey 1987; Light and Gold 2000), which gives Latino job seekers an inside 
track because ethnic employers are more likely to hire their own (Bailey 1987; 
Hansen and Cardenas 1988). This tradition of entrepreneurship accrues indirect 
advantages as well; when large cities have substantial numbers of self-employed 
Mexican immigrants, the gains spill over to non-self-employed Mexican immi- 
grants as well (Spener and Bean 1999). Second, Latinos also work for lower wages 
(Wilson 1996), perhaps as much as $3,900 a year less than blacks (U.S. Bureau 
of the Census-CPS 2002). However, this lower reserve wage may be partly invol- 
untary because their pay declined after the passage of the Immigration Reform 
and Control Act in 1986 (Cobb-Clark, Shiells and Lowell 1995; Fry, Lowell and 
Haghighat 1995). Third, prospective employers hold favorable views of low-skill 
Latino laborers in contrast to other low-skill workers (Waldinger 1997; Johnson- 
Webb 2002; Waldinger and Lichter 2003). Fourth, Latino immigrants leverage 
the network power of their ethnic enclaves (Shihadeh and Barranco forthcoming 
a). These networks serve as efficient conduits of information about prospective 
employment (Fernandez-Kelley 1995; Aguilera and Massey 2003). Aguilera and 
Massey (2003:674) point out that “friends and relatives may assist migrants by 
providing them with useful information: where to look for jobs, what wages 
to ask for, and which sorts of jobs and worksites to avoid,” which reduces the 
uncertainty and economic costs of moving. So in essence, many Latinos aré part 
of a job-information system that helps translate geographic mobility into social 
mobility. In contrast, such networks are less well established among working and 
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low-skill blacks (Stack 1974), who face significant barriers when invoking personal 
contacts to secure jobs (Holzer 1987; Waldinger and Lichter 2003). 


Immigration, Low-skill Jobs and Crime 


Nearly 100 years ago Northeast cities experienced an influx of immigrants from 
Eastern Europe (Rubin 1941) and migrant blacks from the American South 
(Grossman 1991). Because many of the new arrivals settled in high crime areas 
(Shaw and McKay 1942), they were met with time-honored fears about the “im- 
migrant problem” (Leuchtenberg 1958) that still resonate today. But despite the 
presumed links between immigration and crime, the empirical evidence is wide 
ranging and not consistent. Some find a link (Miller 1990; O’Kane 1992; Tanton 
and Lutton 1993; Borjas, Grogger and Hanson 2006) but others do not (Martinez 
and Lee 1998; Lee, Martinez and Rosenfeld 2001; Reid et al. 2005; Martinez and 
Valenzuela 2006). It is difficult to reconcile the inconsistencies in findings given the 
wide variation in research designs. There is no ove “immigration-crime” link any 
more than there is one type of immigrant or one type of job or one type of crime. 
For example, some research (e.g., Reid et al. 2005) examines overall immigration, 
not Latino immigration specifically, thus not registering ethnic-specific processes. 
Some research (e.g., Lee, Martinez and Rosenfeld 2001) offers rich and valuable 
detail but for only a circumscribed sample in a few cities and, thus, may not be 
capturing the full temporal and geographic range of this phenomenon. Also, and 
most important, none of the prior immigration-crime research simultaneously ac- 
counts for changes in the composition of low-skill labor markets, which may be 
an important component of the immigration-crime link. In the present study, we 
ask the specific question of how an unintended influx of Latino migrants flooded 
low-skill markets, thus increasing urban black violence. Indeed, a recent analysis by 
Shihadeh and Barranco (forthcoming b) finds that in rural areas, low-skill Latino 
immigrants are displacing non-Latino whites, which in turn is raising white violence. 

Overall, immigration has yielded generally positive economic outcomes for 
the United States. Muller (1993), for example, finds that overall immigration was 
critical in building the U.S. economy and accounts for why the immigrant-rich 
American North developed faster than the immigrant-scarce American South. But 
the effects are more nuanced when considering specific resident populations such 
as blacks. Adelman et al. (2005) report that overall immigration had a negative 
effect on earnings of low-skill black workers, but a positive effect on middle-class 
blacks. Rosenfeld and Tienda (1999) also find this class-selective effect; the lowest 
paid black maids were displaced by an influx of immigrants, but this was offset 
by an immigrant-induced rise in wages among the remaining black maids. But 
a detailed analysis by Borjas (2003) finds a demonstrable wage decline for the 
least educated workers from 1980 to 2000 (see also Reimers 1998). This drop in 
wages occurs because the least-skilled blacks must compete in a market increas- 
ingly saturated with immigrants (Lim 2001) who effectively leverage their ethnic 
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enclaves to compete for low-skill jobs (Waldinger 1997). Whether low-skill jobs 
are actually a good thing is subject to debate. Using dual labor market theory, 
Crutchfield (1989) argues that primary jobs (high skill) provide an implicit con- 
tract between employer and employee that each will invest in the other over the 
long term. This investment in conformity has a social control effect on each party 
and on the community as a whole. In contrast, secondary jobs invoke few stakes 
in conformity and are, thus, potentially destabilizing on a wide scale. On the other 
hand, much of the research conforms to the core of Wilson's (1996) thesis, that the 
dramatic loss of low-skill jobs tore the organizational fabric of black inner cities 
and, thus, increased overall crime (Shihadeh and Ousey 1996, 1998; Wadsworth 
2001; Weiss and Reid 2005) and youthful crime (Allan and Steffensmeier 1989). 

This connection between economic hardship and crime began with the notion 
that economic deprivation creates motivated offenders (Bonger 1916; Cantor and 
Land 1985). But as Shihadeh and Ousey (1998) suggest, economic motivation is 
insufficient as a stand-alone principle to explain crime. Indeed, individual-level 
interpretations of strain theory have fallen short of empirical expectations and for 

* many, such as Sampson and Wilson (1995:45), the “image of the offender steal- 
ing to survive flourishes only as a straw man.” Individual-motivational theories - 
also suffer from a reductionist fallacy, that aggregate processes are merely the sum 
of individual social-psychological mechanisms. Macro theorizing should neither 
appeal to individual-level mechanisms nor require that offenders possess the char- 
acteristics of aggregate predictors. In conceptual terms this means that offenders 
themselves need not be poor or feel hostility because of inequality. 

Dominating macro-level thinking is the social disorganization perspective, which 
places poverty as one of its key structural determinants of crime (Taylor 1996). 
Specifically, widespread economic deprivation increases crime by reducing both the 
social control capacity of the community and, by implication, the systemic capacity 
to combat crime (Kasarda and Janowitz 1974). The disorganizing capacity of eco- 
nomic deprivation operates through unemployment (Sampson 1987; Shihadeh and 
Ousey 1998), through poverty (Messner and Tardif 1986) and through inequality 
(Harer and Steffensmeier 1992; Shihadeh and Steffensmeier 1994; LaFree and Drass 
1996; Shihadeh and Shrum 2004; Hipp 2007). In its variety of forms, depriva- 
tion weakens the formal and informal capacity of communities to cultivate main- 
stream norms that delineate acceptable conduct. We see common ground between 
this heuristic framework and the structural interpretation of Relative Deprivation 
Theory that views widespread job loss as a system disjuncture that can delegitimize 
conventional norms that restrict violence (Messner 1988; Harer and Steffensmeier 
1992). Adherence to such norms, particularly those that restrict the use of violence, 
is a membership requirement of mainstream life and its institutions. Indeed, the 
conceptual centerpiece of Harer and Steffensmeier’s work on inequality and ctime 
is that when access to these institutions is cut, so is an important conduit by which 
mainstream norms are cultivated and maintained. 
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Prominent in this ecological tradition is Wilson’s (1987) description of how the 
personal experience of the underclass is shaped by the concentration of poverty. 
Poverty became concentrated after working- and middle-class blacks moved away 
from inner cities and after the loss of manufacturing and other jobs upon which poor 
blacks relied (Wilson 1996). The normative implications of this are demonstrated 
forcefully in Anderson’s (1993) landmark research in underclass ghettos. Young 
underclass men, unable to support a family or otherwise demonstrate their worth 
in society, turn to peers who offer more achievable goals, like fostering an image 
based on toughness and the ready willingness to use violence. In the stark existence 
of inner cities, this image eventually gains appeal and snowballs into a broad cultural 
influence with important symbolic value for other members of the group. Thus, 
severing disadvantaged blacks from a major institution such as the (low-skill) labor 
market has wide ranging effects, far beyond a single, motivated offender. 

We examine the link between Latino immigration and black crime in four 
stages. First, we examine the direct, overall link between Latino immigration and 
black crime. Second, we test whether Latino immigration increases black crime 
by shifting the ethnic composition of low-skill labor markets in Latino’s favor. 
Third, to refine our understanding of this macro-social mechanism, we examine 
this process for each major industrial sector (Agriculture, Manufacturing and 
Construction, etc.). Finally, we consider whether Latino immigration (1. reduces 
black income, (2. increases black unemployment, (3. removes blacks from the 


labor force altogether or, (4. bumps up blacks to higher-skill jobs. 


Methods 


We consider 117 major cities with at least 100,000 people, at least 5,000 blacks and 
5,000 Latinos, and for which there is racially disaggregated crime and census data 
available in 2000.2 City-level socioeconomic indicators are from the 2000 U.S. Census. 

To strengthen our inferences, we obtain homicide counts from both the FBI’s 
Supplementary Homicide Reports and the National Vital Statistics System’s 
Multiple Cause-of-Death Mortality Data. The SHR reports offender counts by race, 
estimated from race-specific arrest data. But since not all offenses lead to an arrest, 
the race of the offender is not always known. To address this, we adhere to Fox’s 
(2004) convention of assigning race to unknown cases based on the racial distribu- 
tion of known offenders within the state. Despite this algorithm adjustment, the 
missing (imputed) data is likely linked to race and so, for that reason, we strengthen 
our inferences using the vital statistics of coroner death certificates of black homicide. 
Arias and Tejada-Vera (2008) find that the race and Hispanic ethnicity reporting on 
death certificates does not suffer from serious problems of misclassification. 

To dampen the year-to-year fluctuations in homicide, the counts (for both data 
sources) are averaged and pooled over three years, 1999, 2000 and 2001. Because 
these data have many zero or near-zero observations, we estimate them using 
negative binomial regression; OLS is inappropriate in such cases (Osgood 2000; 
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Osgood and Chambers 2000). Finally, we use Cook’s D to test for the influence 
of outliers and find the values are all well below 1 (the highest is .467). 


Latino Immigration and Low-skill Job Markets 


Our first substantive independent variable is Latino Immigration. This is the 
proportion of Latinos in a given city who were not born in the United States.’ 
It does not include undocumented migration. We considered adjusting for this 
but found no defensible algorithm for making inter-city adjustments. Existing 
estimates of undocumented migration are rough, national-level figures which are 
of limited value at the city level. More importantly, if excluding undocumented 
migrants underestimates the effect of Latino immigration on black crime, then 
ours is a more conservative test. 

Our second substantive independent variable reflects the race/ethnic composi- 
tion of the low-skill job market. We identify low-skill jobs in several ways. First, 
following the convention in the literature, low-skill industrial sectors are those domi- 
nated by poorly educated individuals. We use the Career Guide to Industries of the 

’ U.S. Bureau of Labor Statistics (2006) to crosstabulate industrial sectors with the 
mean years of educational attainment. Low-skill industries are those where more” 
than 50 percent of individuals (ages 25 and older) working in that sector lack a 
high school diploma. ‘The resulting nine (pooled) sectors are: Agriculture, Forestry, 
and Fishing; Mining; Construction; Manufacturing; Transportation; Retail Trade; 
Accommodation and food services; Other services; and Waste Management. This is 
close to Kasarda’s (1993) classification, who likewise finds that education, though 
an imperfect measure of skill, comes close to operationalizing the concept (see also 
Bound and Freeman 1990; Moss and Tilly 1991). 

The composition of low-skill markets is computed by comparing the odds of 
blacks being in low-skill jobs to the odds of Latinos being in low-skill jobs. This 


is simply: 
(Bxi/Byi) / ( Lxi/Lyi) 


where Bxi is the number of blacks in low-skill jobs in city i divided by Byi, the 
number of that city’s blacks of working age not in low-skill jobs.* This ratio is 
then divided by the corresponding ratio for Latinos, Lxi/Lyi. The result is a ratio 
of ratios, which is equivalent to the well-known odds ratio or cross product ratio 
(Feinberg 1981). This converts algebraically to: 


(Bxi * Lyi) / (Byi * Lxi ) 
The standard convention is to take the natural log of this ratio so that it is centered 


on 0 and that a given ratio and its inverse are exactly symmetrical above and below 


zero (see Clogg and Shihadeh 1994 for more detail).> A value of 0 means that in a 
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particular city the proportion of blacks in low-skill jobs (vs. not in low-skill jobs) 
is the same as it is for Latinos. Values greater than 0 mean blacks are more likely 
than Latinos to be in low-skilled jobs, while values lower than 0 mean the reverse. 

This measure is expressed as Low-skill Composition and, based on our expec- 
tations, should have a negative effect on blacks’ homicide counts—meaning that 
when low-skill composition declines (and Latinos are more likely than blacks to be 
in low-skill jobs), black homicide will increase. This measure is derived by combin- 
ing the nine sectors that are defined as low skill. But for added measure, we derive 
4 more restrictive definition of low skill—one that combines only those four (out 
of nine) sectors with the very lowest educational attainment; Agriculture, Forestry, 
and Fishing; Mining; Construction; and Accommodation and food services. We 
term this Low-skill Composition... - 

Finally, to examine inter-industry differences in low-skill jobs, we derive a 
Low-skill Composition separately for these four major low-skill industry groups: 
Agriculture, Forestry, and Fishing; Manufacturing and Construction; Service; and 
other’. We group the sectors this way for several reasons. Agriculture, forestry and 
fishing are distinct because they are grouped that way in SF4 files and because of 
the historical involvement of Latinos in agriculture. Manufacturing and construc- 
tion are combined because they yield the same results in our models, and because 
these are considered “good” jobs in that many offer reasonable pay and benefits. 
We separated out service because it, too, produced distinct results, likely for the 
reasons outlined in the literature review. 

Of course, the competition for low-skill jobs depends partly on the baseline 
capacity of the low-skill job market itself. The pressure on these jobs is greater 
when they represent a small proportion of the overall job market, and is reduced 
when they are plentiful. Therefore, we control for overall labor market capacity by 
determining the proportion of all available jobs that are low skilled (Proportion 


of Jobs Low-skilled). 


Other Independent Variables 


To avoid multicollinearity from the combined use of disadvantage measures at 
the macro level, we use principal components factor analysis to derive Black 
Economic Deprivation. This is an index comprising the proportion of blacks that 
are (1. in single-headed households, (2. living below the poverty line, (3. without 
a high school diploma, and (4. unemployed.’ Because high density may increase 
the opportunity for interpersonal violence, we include Housing Density, the 
proportion of all housing units in clusters of five or more. As measures of social 
disorganization we include Moved which is the proportion of the population not 
in their current houses in 1995, and Vacant Houses which is the proportion of 
all housing units that are vacant. West is a dummy variable that identifies western 
cities (West = 1).8 We control for the Proportion Black in each city as well as the 
Latino Population. Given the age and crime link, we include the Median Age of 
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Table 1: Descriptive Statistics of City-level Variables for Blacks in Major 
U.S. Cities, 2000 


ey 


Mean Standard Deviation 
Blacks less than high school# ee 


Black unemployment af 04 
Black single-headed households 30 .06 
Blacks in poverty 24 .08 
Black median income 31,844.7 9,553.8 
Black median age 29.15 Zie2 
Black out of labor force .36 07 
Blacks in high-skill jobs 29 .05 
Blacks in education, health and social services 25.40 5.47 
Black vital statistics deaths rate* 64.13 61.08 
Black Supplementary Homicide Report rate 32.76 23.58 
Black incarceration rate 360.31 325.53 
Latino population 95,785 245,825 
Proportion black 20 16 
Proportion Latino immigrant 39 16 
Region (west = 1) oe AT 
Housing density wa 10 
_ Vacant houses 07 .03 
Moved 50 .06 
Proportion of jobs low-skilled 1 07 
Low-skill Composition 
All low-skill -.50 .26 
Extreme -.85 46 
Agriculture =(733 1.16 
Manufacturing/ Construction ha 40 
Service -.29 21 
All other 06 24 
N= 117 


# Black Economic Deprivation measures (high school, unemployment, single-headed 
households and poverty) are listed separately for descriptive purposes. In the multivariate 
analysis they are combined into a single factor. See methods section for more detail. 
“Homicide is expressed as a rate in the descriptive table, but counts are used in the 
negative binomial regression. 


Blacks. Finally, we include the Proportion of Blacks in Education, Health, and 
Social Services because of their highly successful inroads in this sector (Waldinger 
and Lichter 2003). Black participation in this higher-skill sector may partially 
offset losses in low-skill sectors. Also, the mass incarceration of blacks not only 
removes them from the labor market (thus opening the way for Latino labor), but 
the felony conviction makes them less employable regardless of the trends in the 
Latino population. Therefore, we control for the rate of black incarceration which 
is the proportion of black males 15 and older who are incarcerated. In the final 
stage of the analysis, we delve more deeply into the indirect economic mechanisms 
that mediate the link between Latino immigration and black violence. In this 
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stage, we consider the Black Median Income in 1999, the Proportion of Blacks 
in High-Skill Jobs, the proportion of Blacks Out of Labor Force, and the Black 


Unemployment rate.’ 
Results 


The descriptive findings are in Table 1 (see also Appendix 1 for the correlation ma- 
trix and Appendix 2 for a list of cities with their measures on low-skill composition 
and black violence). The results in Table 1 show that on average about 39 percent 
of Latinos in major cities are immigrants. The variability in that mean is somewhat 
limited—the standard deviation (.16) is well below the mean of .39. Taken together 
this implies that while most Latinos were born in the United States, immigrants are 
nonetheless a prominent feature across the major cities in our sample. Latinos are 
also more likely than blacks to be in low-skill jobs. Low-skill Composition is -.50 
which, being negative, means that Latinos are more inclined to be in low-skill jobs 
than are blacks. Stating this log-odds ratio as a simple odds of Latinos vs. blacks (1/ 
(antilog[-.50] = 1.65)) means that compared to blacks, Latinos are 1.65 times as 
likely to be in low-skill jobs. And despite the variability in this measure (.26), less 
than 3 percent of cities in our sample had a low-skill composition greater than 0. In 
other words, in almost every major U.S. city that fits our selection criteria, Latinos 
are more oriented toward low-skill jobs than are blacks. This trend is even more 
pronounced in the extreme low-skill sectors—that subset of low-skill industries with 
very low levels of education. For those sectors, Low-skill Composition, ..... is much 
lower at -.85, which after conversion to a simple Latino-vs.-black odds gives 2A: 
In other words, when compared to blacks, Latinos are nearly two and half times as 
likely to be in the very lowest skill jobs. Thus, Latinos not only dominate low-skill 
jobs overall, they are even more firmly entrenched in the nadirs of that industry. 
Latinos are also nearly four times as likely to be in Agriculture, Forestry and Fishing 
(1/(antilog[-1.33]=3.78) and more than twice as likely to be in Manufacturing/ 
Construction occupations (1/ (antilog[-.77]=2.16). Even in the service sector they 
have a slight margin over black workers (1/(antilog[-.29]=1.33). But this is not the 
case in the “other” category (.06). 

Additional results conform to expectations about urban black problems. For 
example, about a quarter (.23) of blacks 25 and older have not completed high 
school, and about a third of black households are headed by a single parent (35); 
Interestingly, while nearly a quarter of blacks fall below the poverty line (.24), only 
11 percent are unemployed, meaning that many are actually the working poor. 
The black murder offense rate (SHR) in these cities is high (32.76) as are murder 
victimizations (64.13) compared to the national UCR rate of 5.5 per 100,000 in 
the year 2000 (U.S. Department of Justice 2006). 

Table 2 reports the results from the negative binomial regression using SHR 
and Vital Statistics data. Findings show that Latino immigration is positively as- 
sociated with black violence, both from an offending (Model 1 = 1.30) and victim- 


ization (Model 2 = 1.24) standpoint. In other words, the more immigrants there 
are among the city’s Latino population, the greater the level of black homicide. 


For ease of interpretation, we convert selected coefficients to a proportion using 
(eb’s -1), where is the coefficient (-1.30 for model 1 using SHR data) and s is the 
standard deviation of Latino immigration (.16 — from Table 1). Since (¢ ©1-4018) 
)-1 =.19, then a one standard deviation rise in Latino immigration is associated 
with a .19 standard deviation rise in (SHR) black homicide. There is a similar 
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Model 3, for example, predicts black murder offending (SHR) while control- 
ling for low-skill composition. When including this control, there is no signifi- 
cant link between Latino immigration and black violence. Indeed the Low-skill 
Composition coefficient of -1.41 is significant at the .01 level. Converting this 
to a proportion gives -.31, meaning that when low-skill markets shift toward 
Latinos by one standard deviation, black violence rises by nearly a third. Likewise 
in Model 4, which predicts black murder victimization (Vital Statistics), the Low- 
skill Composition is -1.62, which converts to an even greater proportion of -.34. 
And we find the same substantive result when examining the four lowest skill sec- 
vcume: Model 3 and 4 yield coefficients for Low-skill 
Composition. of -.80 and -1.14 which, respectively, convert to proportions 
of -.31 and -.41. Overall, these results confirm a positive link between Latino 
immigration and black violence, and that association is accounted for by shifts in 
the ethnic composition of low-skill markets towards Latino labor. 

Other aspects of urban labor markets also predict black violence. When the 
overall job market is dominated by low-skill jobs, black homicide offenses decline 
(-4.27, Model 1) as do victimization (-4.80 model 2). Converting these to propor- 
tions yields .26 and .29, respectively, underscoring the importance of low-skill jobs 
for blacks, particularly with respect to violence. Such violence declines when low- 
skill jobs are in relative abundance, and this link exists independently of the ethnic 
composition of low-skill markets. Violence also declines when blacks are employed 
in education, health and social services (-.08). This suggests that the outlook is not 
entirely bleak; while underclass blacks are losing out in the low-skill sectors, the 
forays of other blacks into education, health and social services pays dividends. 

The findings for the control variables in Table 2 offer few surprises. In the 
presence of substantively relevant variables, the controls are not significant, with 
some exceptions. Specifically, black violence in Model 1 tends to be greater in 
those cities with more Latinos, (2.45 e-6), that have a greater proportion of blacks 
(3.40), and where black economic deprivation is high (.90). These findings are 
consistent across all four models in the table. 

As a check, we replicated the analysis with an alternative measure of Latino 
immigration—Latinos who arrived in 1990-2000 as proportion of the total city 
population. In doing so, it was necessary to control for the total city population 
size because large cities, naturally, have greater numbers of immigrants. Using this 
alternative measure produces identical substantive results (not reported). That is, 
Latino immigration is predictive of black violence, regardless of whether violence 
is measured either with the SHR or the Vital Statistics coroner reports (i.e., similar 
to models 1 and 2). Moreover, after controlling for Low-skill Composition, the 
effect of Latino immigration on black violence disappeared (i.e., similar to models 
3 and 4). So in either case, we get the same substantive results.” 

In Table 3 we disaggregate the Low-skill Composition measure by major indus- 
try group. Because the findings are similar whether using offending or victimiza- 


tors (Low-skill Composition, 
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Table 3: Negative Binomial Estimates Predicting Black Homicide in Major U.S. Cities, 2000 
Using Industry Specific Low-Skill Composition Measures 
Black Homicide 
Model 1. Model2 Model 3 Model 4 


Latino immigration 1.03 14 129" .86 
Low-skill Composition 2 
Agriculture -.17 cy — a 
Manufacturing/ Construction — -.67 — _ 
Services =— _— -.14 eee 
All other — ae —— ot" 
Latino population* 214A teeta 2.47" 2.23" 
Proportion black 258s SAG eae 3.54 
Housing density ze -.44 -.92 -.50 
Vacant houses JZ 44 6.04 5.17 
Moved -1.10 4.66 -.11 14 
Proportion of jobs low-skilled -4.90*** -3.60*** -4.28*** _-4.02*** 
Region (west = 1) 22 27 ao 34 
Black economic deprivation =f e068 60" ee ap 
Biacks in education, health and social services 10M SOR => BOB ae 
Black median age 01 01 01 01 
Black incarceration -.01 -.01 -.01 -.01 
N 86 113 113 113 


a o=<01  p=.05 fps 10 
@ Reported in millions to avoid exponential notation. 


tion counts, from here on we report only the results for offending (SHR). Model 1 
predicts black homicide while controlling for the ethnic composition of low-skill 
jobs in the Agriculture, Forestry and Fishing sector. Findings show that when this 
sector shifts toward Latinos, black violence rises (-.17). The converted parameter is 
-.18, indicating that a one standard deviation shift toward Latinos in this sector is 
associated with an 18 percent rise in black violence. Black violence also rises when 
Manufacturing and Construction jobs shift in favor of Latinos (Model 2; -.67). This 
converts to -.23, so that a one standard deviation shift toward Latinos in this sector 
is associated with a 25 percent increase in black violence—a substantial association. 

The ethnic composition of service jobs, in contrast, is not a significant pre- 
dictor of black homicide, perhaps due to the dominance of women in this sec- 
tor—who are not as violence-prone as men. The category of “other,” in Model 
4, contains Transportation, Retail Trade and Waste management. In contrast 
to the first two models, Latino involvement in this category is associated with 
less black violence (.91). Stated conversely, when blacks are more involved than 
Latino in this particular sector, the result in more black violence. While this 
seems puzzling at first, it implies that not all so-called “low-skill jobs” are the 
same. With respect to jobs in that last category, Crutchfield may be correct; 
some secondary sector jobs reduce investments in conformity and can actually 
increase crime. Jobs in retail—dominated by retailers like Walmart—offer few 
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benefits, low hours and poor pay, and in large numbers may actually be harmful. 
Regardless, the results underscore the importance of disaggregating by industrial 
sector in order not to mask industry-specific effects. Our findings reveal that 
inter-ethnic competition in Agriculture, Manufacturing and Construction has 
major implications for black violence. 


The Indirect Effects of Immigration on Black Economic Outcomes 


So far the findings imply that Latino immigration raises black violence by shifting 
the composition of (some) low-skill markets in Latinos’ favor. We now examine 
more closely the mechanism by which Latino immigration affect black economic 
prospects. Specifically, we consider four possible outcomes of Latino immigra- 
tion; (1. a reduction in black income (“income” effect), (2. a shift of blacks from 
low-skill to higher skill jobs (“bump up” effect),’' (3. a displacement of blacks out 
the labor force completely (“removal” effect), or (4. a displacement of blacks into 
unemployment, but not out of the labor force (“unemployment” effect). 

The findings of this analysis are revealed in Table 4. In Model 1 we examine 
“income” effects by regressing black Median Income on Latino Immigration, 
and find that there is no association between the two (2983.05, not significant). 
In other words, despite the lower income of Latino immigrants, this does not 
apply a downward pressure on black wages. We then re-estimate this first model 
using average income and per capita income, but the conclusions are the same. 
Likewise in Model 2, Latino immigration is not associated with an increase in 
the proportion of black workers in higher skill jobs (.002). Simply put, Latino 
immigration does not “bump up” blacks to higher skill jobs, nor did we expect 
that given how Latino immigration increases black violence. Indeed, if there 
is a bump-up effect then Low-skill Composition should have increased Black 
Median Income in Model 1. Moving to Model 3, we observe no evidence of a 
removal effect—that black workers are being displaced from the labor force al- 
together. So according to the first three models, Latino immigration has neither 
a negative influence (income or removal effects) nor a positive one (bump up 
effect) for urban blacks. 

In Model 4, where Black Unemployment is regressed on Latino Immigration, 
the result is significant (.019). Specifically, of all the outcomes considered in 
these models, Latino immigration does significantly increase the rate of black 
unemployment in major cities. And because of this significant effect, we then 
use unemployment to predict black homicide in Model 5. Results reveal that 
black unemployment significantly increases black violence (22.13). Moreover, 
upon controlling for black unemployment in this final model, the effect of 
Latino immigration on black violence is no longer significant. So of all the pos- 
sible economic outcomes considered here, Latino immigration seems to harm 
the economic prospects of blacks by first increasing their unemployment, which 


then increases their violence. 
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Discussion and Conclusion 


Past research has linked black crime to a variety of social problems. As pressing as 
these problems are, the research overlooks an important story in the contemporary 
history of the American labor force. Unnoticed in criminological literature is 
how U.S. immigration policy inadvertently fueled a rise in the low-skill Latino 
population, which then placed tremendous competitive pressure on the supply of 
low-skill jobs. In addressing this issue, we observe four important and interrelated 
findings; (1. Although Latino immigration is positively linked to black violence in 
major cities, the effect disappears once the ethnic composition (Latino vs. black) 
of the low-skill market is considered. (2. This suggests that black violence rises in 
those areas where blacks lose ground to Latinos in the competition for low-skill 
jobs. (3. But not all low-skill sectors operate in unison; we find that black violence 
rises only when jobs in Agriculture, Manufacturing and Construction are in short 
supply. (4. Latino immigration raises black violence by first increasing black un- 
employment. We draw several implications from these findings 

First, many urban blacks are ensnared in a confusing tangle of social problems 
including discrimination, segregation and the flight of the black middle class. 
These are serious issues, but Wilson (1996) and others distill the central problem 
down to joblessness—the sine qua non of black social dislocation. When access to 
low-skill jobs is blocked on a wide scale, the negative effects go beyond the mere 
loss of a paycheck and instead radiate outward to harm all aspects of black life. ‘The 
result is a system disjuncture that drives a wedge between underclass blacks and 
any semblance of a mainstream existence. But our approach suggests that current 
theoretical explanations of black joblessness and social dislocation may need to be 
revised. Existing heuristic frameworks (e.g., deindustrialization, global economic 
restricting perspectives, etc.) emphasize the supply side of the equation, that the 
decline in the number of low-skill jobs has harmed black economic prospects. 
We agree, but overlooked is how black prospects are likewise being narrowed by 
the increase in competition for the remaining low-skill jobs due to demographic 
changes. As the number of low-skill Latinos in the United States rises, so too does 
the competitive pressure on the existing stock of suitable jobs. This change has a 
demonstrable ripple effect; when competitive forces place those jobs beyond reach, 
the consequences for black violence are serious. Our conceptual understanding of 
black social dislocation needs to be expanded to include the potentially important 
macro social link between U.S. immigration policy and black violence. 

There is a connection between U.S. immigration policy and the economic 
prospects of urban blacks. Several decades of accumulated damage from a po- 
litically symbolic border control have taken their toll. To be sure, by no means 
do we advocate restricting the How of Latino migrants—in either direction. On 
the contrary, we view Latin America as a special case, one where existing quotas 
are meaningless in light of the high volume of migrants from this region, their 
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proximity to the United States and the sheer size of the border. Raising the cap 
to allow more flow and counter-flow will help satisfy market demand, enhance 
security, reduce undocumented migration and free migrants to return home to 
their communities. As it stands, politically motivated initiatives such as building 
barriers and increasing border enforcement have, ironically, made it less likely for 
landed migrants to leave for fear that they cannot return. And regardless of the 
(im) practical outcomes of building higher walls, we see no utility in depriving this 
country of the undeniably rich contribution made by Latinos. Further analysis 
of low-skill jobs and crime in Latino communities is needed, and we encourage 
others to use this study as a launching point. But ultimately, ours is a story of 
black structural disadvantage and how U.S. immigration policy contributes to the 
formation of the underclass. 

Second, losing good quality low-skill jobs is problem, but losing bad quality jobs 
is not. The findings disaggregated by industry reveal that violence among blacks rises 
when they lose access to jobs with reasonable wages and benefits. In contrast, black 
violence has no link to lesser quality jobs such as those in service. In further contrast, 

- black violence actually declines when access is reduced to the very bottom-tier jobs, 
like those in waste management or retail. In other words, while jobs that offer a - 
living wage can reduce problems such as lethal violence, jobs at the other extreme, 
those with very low pay, low hours and no benefits, actually increase violence. In 
this respect, Crutchfield is correct, not all low-skill jobs are the same. 

Third, there is a wide variation in findings in past research on immigration. 
While most of the literature points to the generally positive effects of immigration 
on the U.S. economy, some analysts report a negative effect on minorities. This 
variation in past findings is as expansive as the topic of U.S. immigration itself, 
which is a complex story involving many different ethnic and race groups over a 
very long period of time. Encapsulating immigration’s global effect on all crime 
may yield less purchase than taking a specific slice of the question—in our case 
Latino immigration and black violence—and disassembling it in detail. Doing so 
gives way to a more nuanced understanding of immigration’s effects. 

Fourth, we find that when blacks are employed in education, health and 
social services, black violence goes down. Perhaps underclass blacks can one 
day secure jobs in the lower tiers of education, health and social services as 
an alternative to a competitive low-skill market segmented into ethnic niches. 
But as yet, we see little evidence that the Latino dominance in low-skill jobs is 
bumping blacks into education, health and social services (or any other sector, 
for that matter). After all, the correlation between this variable and low skill 
composition is virtually zero (.001). Nonetheless, underclass blacks may have 
benefitted already from the gains made by their more fortunate counterparts. 
Since the raison d étre of education, health and social services is to qualitatively 
improve the lives of all citizens, especially those who are less fortunate, blacks in 
this sector are in a position to provide underclass communities with assistance, 
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outreach and real world examples of middle-class success. Although secondary 
to our main objective, this finding deserves future consideration. 

Fifth, future research should also examine the regional differences in the link 
between Latino immigration and black violence. During the 1990s, Latino mi- 
gration transformed from a regional event in the Southwest, to a national phe- 
nomenon involving other parts of the country (Durand et al. 2000). As Latino 
immigration expanded geographically, migrants went beyond the traditional 
Latino enclaves in the Southwest, Florida, Illinois and New York toward new 
places across the Old South and elsewhere. Perhaps these new job markets are the 
ones being hotly contested. This might explain the significant effect of “recent” mi- 
grants in the alternative specification of Table 2. And while our analysis focuses on 
black violence, future research should consider Latino violence as well. Although 
extant research finds little or no link between Latino immigration and crime, the 
studies fail to differentiate between Latinos in traditional areas and those who are 
pushing deeper into the country. Without the insulating blanket of their estab- 
lished enclaves, the new Latino migrants may be settling in socially isolated and 
potentially dangerous settings (Shihadeh and Barranco forthcoming c). 

Finally, despite the trend in Latino immigration, black crime rates actually 
dropped during the 1990s even after adjusting for changes in the age distribu- 
tion of the population (Steffensmeier 1999). This may have resulted from the 
unprecedented economic boom that increased the demand for low-skill labor so 
dramatically that it rendered the minimum wage virtually meaningless. Wilson, 
in a conversation with the authors, speculated that had the boom lasted another 
decade or two, the black underclass might have all but disappeared. Perhaps, but 
the boom is long over and we are now mired in a protracted recession, one that 
may push black crime rates back up. Wilson implicated the loss of manufacturing 
jobs in the ghettoization of inner cities and the rise of black crime in the 1980s. 
Today, major demographic changes and an intense competition for low skill jobs 


could bring back those bad old days with considerable force. 


Notes 


1. Although immigration contributes to the growth of the U.S. population of Latinos, 
Latino birth rates now contribute more (Johnson and Lichter 2008). 


2. Specifically, the limit of 100,000 population and 5,000 blacks and 5,000 Latinos 
reduces the city sample to 152. Five of those cities do not report SHR data (mainly 
in Florida), and 35 more are lost due to missing county-level black incarceration data. 
We also ran models without the incarceration control (and hence without the loss of 
those cities), but the substantive results are unchanged. The final sample is 117 cities 
when using Vital Statistics data, and 113 when using the Supplementary Homicide 
Reports. This necessitates the caveat that our sample is not 100 percent generalizable 
to the total U.S. population. 


3. A reviewer correctly pointed out that this measure would be influenced by the 
baseline Latino population in a city. Cities with large Latino populations would 
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score low since it increases the denominator in the measure. To compensate, we 
do two things: (1. we control for the baseline Latino population in each city and, 
(2. we model Latino immigration with an alternative measure—the proportion of 
Latinos who arrived between 1990 and 2000 (the recent wave) as a proportion of the 
total population (not just of Latinos). Both measures produced similar substantive 
findings. See results section for detail. 

4, “Not in low-skill jobs” includes those in high skill occupations, those unemployed 
or out of the labor force. We also estimate models where “those not in the low-skill 
jobs” excludes those who are unemployed or out of the labor force but we obtained 
the same substantive result. We explore this issue in greater detail later in the analysis. 


5. For example, the natural log of the ratio, 1/2, is -.69. The natural log of the inverse 
of that ratio, 2/1, is +.69. 


Transportation, Retail Trade and Waste management. 
7. The principal components loadings for all four variables are more than .80. 


This was chosen because Latinos concentrated originally in the Southwest first, and then 
panned out later to other nonwestern destination. However, we also ran models with a 
South / non-South dichotomy (South = 1) but the substantive results remained the same. 


-9, We report the results for median income, but we also used average income and per 
capita income but the substantive conclusions did not change. 


10. We also tried another Latino immigration measure; the number of foreign-born 
Latinos divided by the total city population. Unfortunately, it was highly collinear 
with other variables in the model and produced unstable estimates. 


11. Higher skill jobs are defined simply as those jobs that are not defined as low skill per 


our criteria. 
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Hispanic Population Growth and Rural Income Inequality 


Emilio A. Parrado, University of Pennsylvania 
William A. Kandel, Economic Research Service, USDA 


We analyze the relationship between Hispanic population growth and changes in U.S. 
rural income inequality from 1990 through 2000. Applying comparative approaches 
used for urban areas we disentangle Hispanic population growth’s contribution to 
inequality by comparing and statistically modeling changes in the family income 
Gini coefficient across four rural county types: established Hispanic, rapidly growing 
Hispanic, rapidly growing non-Hispanic, and slow-growth or declining counties. 
Results support perspectives that stress growing social heterogeneity for understand- 
ing the contribution of minority population growth to inequality, including changes 
in human capital and industrial restructuring. We find remarkably similar inequality 
growth across rapidly growing Hispanic and non-Hispanic counties. This suggests 
that growing rural inequality stems largely from economic expansion and population 
growth rather than changing Hispanic composition. 





Introduction 


Rapid growth of the U.S. Hispanic population, especially through immigration, 
has generated public concern and academic debate about its contribution to in- 
come inequality (Reed 2001; Borjas et al. 1997; Lerman 1999). Two general 
perspectives inform this discussion. ‘The first argues that relatively less educated 
and unskilled Hispanic immigrants contribute to income inequality because their 
low levels of human capital restrict their earning power (Borjas 1999). The sec- 
ond perspective argues that structural forces fueling the demand for low-skilled, 
low-paying jobs are the main cause of income inequality, and the increase in low- 
skilled immigration simply reflects those forces (Piore 1979). 

Empirical trends underlying the debate are clear. From 1990 to 2000 the 
Hispanic population in the United States increased more than 50 percent and 
in 2003 officially surpassed non-Hispanic blacks as the nation’s largest minority 
group. Much of this growth resulted from immigration. More than 7 million 
Latin-American migrants entered the United States during the 1990s, almost dou- 
bling the number of foreign-born Hispanics in a single decade. Rapid Hispanic 
population growth also coincided with rising income inequality. U.S. Census 
estimates indicate that from 1989 through 1999 the Gini coefhicient measuring 
income inequality among U.S. families increased 7 percent, from .401 to .429 
(U.S. Census Bureau 2006). Combined with the 10 percent increase during the 
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1980s, this trend of growing inequality represents a reversal of both consistent 
declines in income inequality during the first four decades of the 20" century and 
stable levels of low-income inequality registered from 1940 to 1970. 

Several studies have attempted to disentangle the connection between Hispanic 
population growth—especially through immigration—and income inequality 
(Altonji and Card 1991; Borjas et al. 1996; Bradbury 1996; Chevan and Stokes 
2000; Peri 2006). The majority of these analyses, however, have been conducted 
either at the national level or only for urban areas. Studies of mechanisms under 
girding rural inequality are relatively rare despite scholarly recognition that the 
forces fueling income inequality might differ in rural and urban areas (Domazlicky 
2005; Kuznets 1955; McLaughlin 2002). Moreover, with few exceptions (Albrecht 
et al. 2005; McLaughlin 2002) the handful of studies with a specific focus on 
rural inequality has not paid explicit attention to the impact of changing minority 
composition and immigration on inequality. 

However, understanding this association is increasingly relevant. Data from 
the 2000 U.S. Census revealed unanticipated and dramatic Hispanic popula- 

. tion growth in all U.S. regions but particularly in new rural destinations in the 
Southeast and Midwest. In fact, in the 1990-2000 decade, the rate of non-met-. 
ro Hispanic population growth exceeded that of metro counties (Kandel and 
Cromartie 2004) highlighting the need to expand the geographic focus of most 
previous ethnic and immigration research. Although often overlooked in current 
discussions of immigration and economic trends, non-metropolitan counties are 
not trivial entities, encompassing roughly 77 percent of all U.S. territory and 17 
percent of its population. The rapid influx of minority groups in non-traditional 
destinations has considerable economic and policy ramifications for rural areas. 

Accordingly, this research examines the relationship between Hispanic popula- 
tion growth and changes in family income inequality in non-metropolitan coun- 
ties from 1990 to 2000. Our analytical approach builds on urban-based research 
linking inequality across local labor markets to differences in the relative supply of 
immigrants and minorities (Card 2005; Friedberg and Hunt 1995). This approach 
disentangles the contribution of minority population growth to inequality by 
generating counterfactual comparisons of inequality trends across labor markets 
with different population trajectories. 

While cities function readily as delineated local labor markets by allowing 
researchers to compare individual cases (e.g., Card 1990), analysis of more sparsely 
populated rural areas benefits from grouping areas according to their changing 
population compositions. Our approach is to create a county-level typology that 
distinguishes established Hispanic, rapidly growing and recently settled Hispanic, 
rapidly growing non-Hispanic, and slow-growth or declining populations. We 
convey trends captured in our typology by mapping several county type distribu- 
tions and qualitatively describing illustrative cases in each group. The quantita- 
tive analysis that follows compares inequality patterns across these four county 
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types and models such changes according to labor force characteristics, industrial 
change and additional sources of heterogeneity. Overall, our comparative ap- 
proach reveals remarkably similar inequality trends across Rapid Growth Hispanic 
and non-Hispanic rural counties, suggesting that increases in rural inequality are 
largely the product of economic expansion and population growth and not of 
increases in Hispanic composition per se. 


Recent Rural Hispanic Population Growth 


U.S. Census estimates reported in Table 1 show that the non-Hispanic non-metro 
population’ was relatively stagnant in the 1980s, with a growth rate well below 
metro areas. This pattern changed significantly during the 1990s, when non- 
metro population grow revived to more than 8 percent and rivaled the growth in 
metro areas. This increase in population growth was even more dramatic among 
Hispanics, whose non-metro numbers grew 27 percent during the 1980s (rela- 
tive to 56 percent in metro areas) and 67 percent during the 1990s, surpassing 
the 57 percent growth registered in urban areas and accounting for more than 25 
percent of all non-metropolitan population growth (Kandel and Cromartie 2004; 
Kirschner et al. 2006). 

These general trends mask considerable variation across regions. Particularly 
striking was the growth of Hispanic populations outside traditional destination ar- 
eas in the Southwest, where the majority of rural Hispanics have resided since the 
turn of the century. Media reports have illustrated dramatic examples of Hispanic 
influx in places such as Dalton, Georgia; Storm Lake, Iowa; and Siler City, North 
Carolina, and a growing body of ethnographic research documents the mixed 
reception Hispanics typically receive in small communities with little experience 
or few public programs to assist foreign-born newcomers (Gozdziak and Martin 
2005; Griffith 1995; Kandel and Parrado 2004, 2005; Zufiga and Hernandez- 
Leén 2005; Massey 2008). 

Table 1 illustrates these changes. From 1990 to 2000, nonmetropolitan 
Hispanics increased 35 percent in the Southwest,” 113 percent in the Midwest, 
81 percent in the West, and a staggering 204 percent in the South. Most of the 
growth and dispersion of the Hispanic population was driven by the foreign born. 
In the South during the 1990s, for example, the native Hispanic population in 
the South grew 38 percent compared to an astounding 211 percent among the 
foreign born, explaining over 82 percent of total Hispanic population growth 
in the region. The majority of the foreign born (60 percent) entered the United 
States after 1990, and 40 percent entered between 1995 and 2000. A similar 
pattern occurred in the Midwest, where the growth rates of native and foreign 
born Hispanics were 53 and 206 percent, respectively. The growth of the foreign 
population explains the majority of the growth (61 percent) of the Hispanic 
population in the Midwest with 46 percent entering the United States after 1990. 
The main implication for our purposes is that understanding the role of Hispanic 
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Table 1: Hispanic and Non-Hispanic Population Growth by Metropolitan 
Status and Region 
Total % Growth across U.S. Regions 
Population %Change Northeast Midwest South West Southwest 
Non-Hispanics 
Metro 
1980 163,899 


1990 177,360 8.2 6) 1.8 14.3 16.9 16.0 

2000 193,133 8.9 2.4 6.1 14.3 19.0 9.4 
Nonmetro 

1980 48 038 

1990 48,995 2.0 40 -2.3 2.8 8.1 6.8 

2000 52,983 8.1 41 44 9.5 16.7 13.0 
Hispanics 
Metro 

1980 13,111 

1990 20,452 56.0 44.0 36.0 oud 62.2 59.9 

2000 32,130 Lay | 38.9 78.0 93.1 129.6 50.1 
Nonmetro 

1980 1,498 

1990 1,902 27.0 TA2 25.8 AZ 54.8 26.8 

2000 3,176 66.9 12.3 113.2 204.0 81.1 35.3 





Source: U.S. Census, SF1 files, 1980-2000 
Note: Regions are census regions, except for the Southwest which borrows from the 
West and the South and consists of Arizona, California, Colorado, New Mexico and Texas. 


population growth on economic outcomes overlaps with understanding the effect 
of immigration which has been the primary contributor to the growing Hispanic 
population in new areas of destination and the leading force affecting social het- 
erogeneity. The rapidity and magnitude of these changes highlights the need to 


evaluate their impact on inequality in rural America. 
Hispanic Population Growth and Social Heterogeneity 


Much empirical scholarship has analyzed the forces fueling inequality. Because 
we consider the particular phenomenon of Hispanic population growth within 
a mostly non-Hispanic rural population, our analysis builds on elaborations 
that stress the role of social heterogeneity to explain cross-sectional variation 
in inequality and their evolution over time. These articulations began with 
Kuznets’ (1955) work and later received expanded treatment by Nielsen and 
colleagues (Nielsen 1994; Nielsen and Alderson 1997; Nielsen and Alderson 
2001; Moller et al. forthcoming). A central tenet of this perspective is that 
inequality is “generated by social heterogeneity related to a specific stage of the 
development process.”(Nielsen 1994:655) Rather than. expecting monotonic 
growth or decline in inequality with economic development, this perspective 
emphasizes transitional aspects of growth that enlarge economic disparities. The 
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main expectation is that any dimension associated with economic growth that is 
unevenly distributed across individuals and affects their income prospects will 
translate into inequality (Nielsen and Alderson 1997). 

The classical example of a transitional mechanism affecting inequality is sector 
dualism. According to Kuznets (1955), population shifts between a traditional 
and a modern sector of the economy produce an inverted U-shaped relationship 
between development and inequality. Specifically, shifting a society’s labor force 
from a traditional agricultural sector with low productivity and wages to a modern 
sector with high productivity and wages is expected to affect the evolution of income 
inequality over time. Low levels of inequality prevalent among agricultural societies 
are predicted to increase as employment shifts to the higher paying modern sector 
during early stages of development. Inequality is expected to peak at intermediate 
levels of development and ultimately decrease at some advanced stage when substan- 
tial portions of the population are employed in the modern sector. 

The predicted Kuznetsian pattern of declining inequality at advanced levels of 
economic development was supported in studies of the evolution of inequality 
across U.S. counties (Nielsen and Alderson 1997). However, the reversal in the 
evolution of income inequality in recent decades highlights the importance of 
other sources of heterogeneity in affecting inequality trends, including educa- 
tional composition, industrial structure and demographic change (Nielsen and 
Alderson 1997). Given our focus on the role of growing Hispanic representation, 
we concentrate on transitional mechanisms that could account for the role of 
minority population growth on income inequality across rural areas. In general the 
connection between inequality and heterogeneity suggests that rapid population 
growth, especially of low-skilled minority groups, should contribute to inequality 
(Chenery et al 1974; McNicoll 1984). The mechanisms, however, are diverse and 
depend on the forces attracting people to rural areas. 

Two general sources of heterogeneity are central to our analysis (Chevan and 
Stokes 2000; Katz and Murphy 1992). The first, often associated with supply- 
side explanations that link Hispanic population growth with income inequality, 
is growing heterogeneity in population composition, specifically human capital 
endowments, foreign-born status and family structure. The general expectation 
is that Hispanic population growth expands the supply of low-skilled workers, 
altering the composition of the labor force and thereby fostering inequality. This 
is particularly so for human capital endowments. Among rapidly growing rural 
counties the sudden influx of immigrant Hispanics expands the lower end of the 
educational distribution, potentially contributing to inequality. Borjas (1999), for 
example, finds that between 1980 and 1995 immigration increased the number of 
high school dropouts by 21 percent, a period during which dropouts’ wages fell 
11 percent relative to more educated workers. At the other end of the educational 
distribution, rural counties attracting highly skilled professionals could also expe- 
rience growing inequality (McLaughlin 2002). 
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In addition, the Hispanics moving to new rural destinations during the past 
two decades tend to be primarily of immigrant origin (Kandel and Cromartie 
2004). Foreign-born status often embodies several detrimental labor market 
characteristics, including less English fluency, less U.S. work experience, and, 
frequently, unauthorized status. The increasing heterogeneity resulting from the 
growing relative representation of foreign-born groups in rural counties is also 
expected to contribute to inequality. ' 

The final supply-side process affecting inequality associated with minority 
population growth relates to changes in demographic composition, particularly 
age and family structure. In general, the life-cycle profile of earnings implies that 
the relative growth of the population at older ages should contribute to income 
inequality due to relatively higher earnings exhibited among older working age 
groups (Formby et al. 1989). This effect, however, might be mitigated in rapidly 
growing Hispanic counties because Hispanics, especially immigrants, are younger 
than the general population. Hispanic population growth is also likely to alter the 
family structure of receiving counties, including the proportion of female-headed 

. households and patterns of female labor force participation. Disadvantages affect- 
ing female-headed households imply that their growing representation should also. 
contribute to income inequality (Snyder and McLaughlin 2004). At the same time, 
female labor force participation represents an important source of family income 
that compensates for individual income disparities and thus reduces inequality 
(Cancian and Reed 1998). Female employment, however, is lower in Hispanic 
families, reducing its potential to abate inequality in rapidly growing Hispanic 
counties compared to other counties. 

‘The second general source of heterogeneity, often associated with demand-side 
explanations linking Hispanic population growth and income inequality, is growing 
heterogeneity in the sectoral composition of employment and industrial restructur- 
ing. The main expectation is that the changing industrial composition of rural coun- 
ties will contribute to inequality above and beyond changes in the socioeconomic 
characteristics of the labor force. Numerous studies have linked industrial restruc- 
turing to growing inequality (Harrison and Bluestone 1990; Morris et al. 1994; 
Neilsen and Alderson 1997; Chevan and Stokes 2000; McLaughlin 2002). These 
studies stress that the shifts in employment away from manufacturing and toward 
services increases income inequality. The main expectation here is that manufactur- 
ing employment reduces inequality due to its relatively high level of productivity and 
unionization, which provide the means and incentives for corporations to pay high 
wages even to low-skilled workers. This was in fact the case for immigrants and mi- 
norities during the early 20" century, when manufacturing employment facilitated 
access into the middle class. We expect the decline in manufacturing to reduce the 
number of good jobs available to low-skilled workers and thus to increase inequality. 

While much of this literature emphasizes the dichotomy between manufacturing 
and service sectors, applying the industrial restructuring framework to rural areas 
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requires special attention to industries attracting low-skilled Hispanics, including 
foreign-born workers to non-metropolitan counties. This comprises changes not 
only in manufacturing and services, but also in agriculture and construction. Given 
the emphasis on the characteristics of jobs available within these sectors, including 
constraints on wage increases and union protection, we expect that increases in 
agricultural and construction employment, as well as service industry employment, 
would contribute to inequality in rural areas, relative to manufacturing employment. 

The literature documents other socioeconomic sources of heterogeneity affecting 
inequality (Barro 2000; McLaughlin 2002; Martin 2006) including growth of the 
non-Hispanic black population, unemployment, median family income, labor force 
size and prevalence of full time employment. Key among these factors is family 
income because it directly ties with Kuznets’ argued link between economic growth 
and income inequality. Although we control for such mechanisms in our statistical 
analysis, they are not the focus of our study. Rather, we emphasize the role of chang- 
ing supply and demand conditions, because they directly address the unresolved is- 
sue of whether it is Hispanic population growth per se or broader economic changes 
that account for inequality trends in rapidly growing Hispanic destinations. 


Analytic Strategy: Counterfactual Comparisons and Rural County Inequality 


One challenge for our analysis is the difficulty of separating the unique impact of 
Hispanic population growth from broader processes of socioeconomic change in 
rural counties. One approach is to include a measure of the changing Hispanic 
composition in a county and estimate its effect on income inequality. This option is 
unlikely to capture the dramatic changes of new rural Hispanic destination areas be- 
cause the number of emerging rural Hispanic counties is not large, and effects might 
not be captured with a continuous measure of change in Hispanic representation 
across all rural U.S. counties. Moreover, such an analysis does not provide straight- 
forward comparisons across rural areas experiencing dissimilar population trends. 

An alternative approach, to identify counties with different labor market struc- 
tures and relate trends in inequality to their particular population trajectories, has 
been applied extensively in metropolitan studies of the impact of immigration 
on natives’ wages (Card 1990, 2005; Friedberg and Hunt 1995). This analyti- 
cal strategy compares the local wage structure across cities with large and small 
influxes of immigrants. This approach is particularly advantageous in our case 
because it generates clear counterfactuals that permit comparison of changes in 
inequality across rural areas that have recently received a large influx of Hispanics 
against those that did not. It also allows for a deeper and broader understanding 
of the transformations occurring in rural areas by distinguishing demographic and 
economic patterns associated with changes in minority composition. In addition, 
it can be used to geographically locate different local labor markets with particular 
population trajectories and thereby identify the regional concentration of the 
economic and social transformations affecting rural areas. 
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While cities function naturally as local labor markets, facilitating individual 
case comparisons, rural area comparisons are less straightforward. Our approach 
is to construct a rural county typology that distinguishes distinct population 
trajectories from 1990 to 2000. The typology combines three factors: Hispanic 
share of 1990 county population, the change in county Hispanic share from 1990 
to 2000, and total 1990-2000 county population change. Together, these factors 
produce four county types: (1. counties with established Hispanic populations, 
(2. counties with rapid Hispanic population growth in regions with little previ- 
ous Hispanic presence, (3. counties that grew rapidly but do not have sizeable 
Hispanic populations, and (4. demographically stagnant counties.’ Comparing 
inequality across these county types, especially between Rapid Growth Hispanic 
and Rapid Growth Non-Hispanic, is our main objective. 


Data and Methods 


Data for this analysis come from the 1990 and 2000 U.S. Census SF3 files. The 
unit of analysis is the non-metropolitan county as defined in 2003 (see footnote 
1). The dependent variable is the change in Gini coefficient for family income 
from 1990 to 2000. We focus on family rather than individual inequality becausé 
it more directly reflects overall population well-being, especially for women and 
children. 

Sociologists have long debated appropriate statistical approaches for analyzing 
socioeconomic change when the dependent variable is measured at two points in 
time, and in particular how best to account for omitted variables in panel data 
designs. Two methods most commonly proposed are: the lagged-regressor variable 
method, in which the dependent variable measured at time 2 (Y,) is regressed on 
the dependent variable measured at time 1 (Y,) and additional covariates (X); and 
the difference score method, in which the time 1 score is subtracted from the time 
2 score (Y,-Y,) and then regressed on X. It is important to note that the regression 
of Y,-Y, on both Y, and X is computationally equivalent to the regressor variable 
method and produces the same results (see Allison 1990; Werts and Linn 1970). 

The literature further distinguishes pure-difference from semi-difference models 
depending on how the predictor variables (X) are specified. In the pure-difference 
model, the change score (Y,-Y,) is regressed on the difference score of the indepen- 
dent variables (X,-X’). In the semi-difference model, the change score is regressed 
on the level of the predictor variables at time 1 (X ) which more closely resembles 
the lagged-regressor variable method in which predictors are measured at time 1. 

Both for methodological and theoretical reasons Allison (1990) and Firebaugh 
and Beck (1994) support the pure-difference approach. Methodologically, the pure- 
difference model eliminates the potential bias due to omitted variables. Unmeasured 
enduring traits of individual counties are removed when differentiating both the 
dependent and independent variables because constant effects get cancelled out. 
Theoretically, model choice depends on the causal connection expected between 
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the variables. The lagged-regressor variable model assumes a temporal ordering 
from Y, to X, to Y, which is not appropriate in two-wave panel designs in which X 
is measured contemporaneously with Yat both time points (Allison 1990). 

In addition, Firebaugh and Beck (1994) argue that the theoretical argument 
for expecting the level of the independent variables at time 1 to affect change in 
the dependent variable is unclear. They show that a semi-difference model also 
implies that a change in Y is caused by a change in X and Y but during the previ- 
ous interval. A semi-difference model assumes that the effect of a change in X lies 
dormant for a period before affecting Y. The extent and rationale for the dormancy 
period is usually unclear. Especially if the interval in a pure difference model can 
be extended beyond the expected dormancy period, regressing difference scores on 
change in the levels of the independent variables more appropriately estimates the 
causal connection between Y and_X in two panel designs (Allison, 1990). 

Accordingly, we specify a pure difference model which predicts change in income 
inequality between 1990 and 2000 according to four types of independent variables: 
rural county types (7), changing population composition (P), changing industrial 
composition (/), and other sources of heterogeneity (O). Our working equation is: 


a = B, - B, a z BA Poa-90 7 BA lite e BA Ores a 


where the dependent variable (Y,,-Y,,) corresponds to the arithmetic difference 
between the 2000 and 1990 Gini Concentration Ratios computed for family 
income by county. We estimate our models using OLS techniques. 

To assess the robustness of our results to model specification, we tested ad- 
ditional models, including a lagged-regressor model where Y,,, was regressed on 
Y, and the change in independent variables, as well as a complete semi-difference 
model in which Y,,-Y,, was regressed on the change between 1990 and 2000 and 
the level of independent variables in 1990. While the magnitude of the coefficients 
differs, results do not change our substantive findings, especially the role of county 
types and changing socioeconomic conditions on rural county inequality.’ 


Independent Variables 


Rural County Types 

Table 2 summarizes the criteria used for the construction of our county typology. 
Established Hispanic Counties were at least a 10 percent Hispanic in 1990. Rapid 
Growth Hispanic Counties were less than a 10 percent Hispanic in 1990 and saw 
their percent Hispanic increase by more than 2.5 percentage points between 
1990 and 2000. Rapid Growth Non-Hispanic were less than 10 percent Hispanic 
in 1990, saw their percent Hispanic increase by less than 2.5 percentage points 
between 1990 and 2000, but experienced overall population erowth exceeding 
17 percent. Lastly, Slow Growth and Decline Counties were less than 10 percent 
Hispanic in 1990, saw their percent Hispanic increase by less than 2.5 percentage 
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Table 2: Criteria for Non-metro County Typology 
% Hispanic Changein% % Growth in Total 


Composition Hispanic Population 
County Type 1990 1990-2000 1990-2000 
Established Hispanic counties = 10 
Rapid growth Hispanic counties < 10 = 25 
Rapid growth non-Hispanic counties < 10 ea = 17 
Slow growth & decline counties* < 10 <2 al 


*Refers to the total county population, including any Hispanic population 


points between 1990 and 2000, and experienced overall population growth of 
less than 17 percent. The cut-off points for Hispanic composition in 1990, dif- 
ference in percent Hispanic (1990-2000), and population growth (1990-2000) 


correspond to the variable means plus one half their standard deviations. 


Changing Population Composition: Education, Foreign-born Status and Family Structure 

. To account for the impact of heterogeneity in population composition on 
inequality, our statistical model includes measures of change in educational . 
composition, foreign-born representation, and demographic and family change 
in rural counties. We include two variables measuring 1990-2000 change in 
the share of the population with less than a high school education and at least 
a four-year college degree. We expect these variables to be positively associ- 
ated with growing inequality, relative to expansion at the intermediate levels of 
education, since they directly measure increased heterogeneity in educational 
endowments. Moreover, given relatively low average education levels among 
Hispanics, we expect growth of the less educated population to mediate the 
impact of Hispanic population growth on inequality. In addition, we include a 
measure of counties’ 1990-2000 change in foreign-born composition. Because 
immigrants typically occupy lower labor market positions, we expect a growing 
immigrant population to increase inequality. Finally, three variables capture 
the effect of changing age and family structure on income inequality. Change 
in the percent of the population age 65 and older and of female-headed house- 
holds are expected to positively affect income disparities since they increase the 
heterogeneity of low and high-income groups. In turn, greater female labor 


force participation, an important contribution to family income, is expected to 
reduce income inequality. 


Changing Industrial Composition 

We measure change in non-metro county industrial composition by the difference 
between the 1990 and 2000 proportions of the employed population working 
in eight mutually exclusive, all-encompassing sectors. We derive these sectors by 
combining and expanding the 13 industrial sector categories in the United States: 
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manufacturing is divided into durable and nondurable goods manufacturing, and 
non-durable goods manufacturing is further divided by extracting a critical rural 
employer of low-skilled foreign-born workers, meat processing, which is roughly 
7 percent of that sector’s employment.’ 

We group rural industries into those with declining and growing represen- 
tation to facilitate discussion of their effects. The former include: agriculture/ 
mining, non-durable goods manufacturing, durable goods manufacturing, and 
trade/wholesale/retail trade. The latter include: construction, meat processing, 
low-skilled services/transportation, and high-skilled services/communication/fi- 
nance/insurance/real estate/and public sector employment. We expect patterns 
of growth and decline to be central factors accounting for differential change in 
inequality across county types. 


Other Sources of Heterogeneity: Control Variables 

Our models control for other sources of heterogeneity that previous studies have 
found to explain cross-sectional inequality, in order to obtain net estimates of the 
effect of changes in socioeconomic characteristics of the labor force and industrial 
representation on inequality. To account for larger demographic trends in minor- 
ity composition, particularly in large areas of the rural South, we control for 1990- 
2000 growth of the black population. To control for economic climate apart from 
industrial composition, we include variables measuring the arithmetic change in 
the civilian non-institutional unemployment rate, log of median family income, 
and total 1990-2000 employment rate. Finally, we control for census region to 
account for geographic variation in inequality. Appendix I reports descriptive 
statistics for all variables by county type. 


Descriptive Results: Rural County Population Trajectories and Inequality 
Trends in Inequality and Labor Force Characteristics across County Types 


Trends in inequality differ considerably across county types, as seen in Figure 
1, which plots Gini coefficients in 1989 and 1999 across the typology. While 
inequality fell slightly (from initially high levels) in established Hispanic counties, 
Slow Growth counties witnessed modest increases in inequality over the period. 
Rapid Growth Hispanic and Rapid Growth Non-Hispanic counties, on the other 
hand, show substantial increases in inequality, with Gini coefficients rising 2.7 and 
2.5 percent, respectively, during the period. 

The county types also vary markedly with respect to changes in supply- and 
demand-side conditions, two constructs central to the independent variables 
in our analysis. Table 3 presents descriptive statistics for 1990-2000 changes in 
educational composition, percent foreign-born, age and family structure, and 
industrial composition over time. The right hand panel in Table 3 compares trends 
for each county type in reference to Slow Growth and Decline counties. 
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Figure 1. Gini Coefficient for Families (County Averages, 1989-1999) 
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Overall, from 1990 to 2000, Established and Rapid Growth Hispanic coun- 
ties experienced a significantly slower improvement in educational composition 
relative to Slow Growth and Decline counties. In the former, the decrease in the 
percent of the population with less than a high school education was significantly 
lower than in Slow Growth and Decline counties. Clearly, Hispanic in-migration 
to new areas of destination has expanded the pool of poorly educated inhabitants. 
‘The opposite applies to High Growth Non-Hispanic counties where the percent- 
age of the population with at least a four-year college degree increased significantly 
faster than among Slow Growth and Decline counties. 

Table 3 also shows that among all rural counties, the share foreign-born increased 
| percentage point from 1990 to 2000. Established and Rapid Growth Hispanic 
counties exhibited much higher growth than the other county types, 2.4 and 3.4 
percentage points, respectively. Notably, the foreign-born population grew by a mere 
-G percentage points in Rapid Growth Non-Hispanic counties, highlighting the dif- 
ferent population basis fueling change across types of rapidly growing rural counties. 

Differences also appear for age and family composition trends. As expected 
the proportion of the population over age 64 declined most in rapidly grow- 
ing Hispanic and Non-Hispanic counties relative to slow-growth counties. At 
the same time, female labor force participation increased more slowly in rapidly 
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growing Hispanic counties relative to slow growth counties. These additional 
dimensions of heterogeneity are likely to mediate the effect of changing minority 
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Table 3b: Socieconomic, Industrial and Demographic Change across 


Rural County Types 


Rate of Decline or Growth 
(Relative to Slow Growth & Decline Counties) 


Established 
Hispanic 
Supply Side Conditions: Population Change 
Change in Educational Composition 
Less than high school 
College or more 


Slower Decline 


% Foreign born Faster Growth 
Change in Age and Family Structure 

% Aged 65 and older 

% Female labor force participation Slower Growth 


% Female headed households 
Demand Side Conditions: Industrial Change 
Industries with Declining Representation 
Agriculture/ Mining 
Non-durable goods manufacturing 
Durable goods manufacturing 
Wholesale and retail trade 
Industries with Growing Representation 


Slower Decline 
Slower Decline 


Faster Decline 


Rapid Growth 
Hispanic 
Slower Decline 
Faster Growth 
Faster Decline 


Slower Growth 
Slower Growth 


Faster Decline 
Faster Decline 


Rapid Growth 
Non-Hispanic 


Faster Growth 


Faster Decline 
Slower Growth 


Faster Decline 
Faster Decline 
Faster Decline 
Faster Decline 


Construction Slower Growth Faster Growth Faster Growth 
Meat processing Faster Growth Faster Growth 

Low-skill services Faster Growth 
High-skill services/ Public sector Faster Growth 


end (i.e., high-skilled services) of the industrial sector distribution contributes to 
inequality while employment growth in middle-skill industries (i.e., manufactur- 
ing) reduces inequality. 

Table 3 shows that agriculture and mining, non-durable goods manufacturing, 
durable goods manufacturing, and wholesale and retail trade, have declined in 
rural counties. However, the change differed across county types with important 
implications for income inequality. Again, relative to Slow Growth and Decline 
counties, several trends are clear. Established Hispanic counties experienced a 
slower decline in agriculture/mining and non-durable goods manufacturing but 
a faster decline in trade activities. They also experienced a slower expansion of the 
construction industry and a slightly faster expansion of meat processing. Among 
Established Hispanic counties, the main industry gaining representation was high- 
skilled industries, driven mainly by a rapid public sector expansion. 

We emphasize the comparison of Rapid Growth Hispanic and Non-Hispanic 
counties. In general, relative to Slow Growth and Decline counties, rapidly grow- 
ing counties have experienced a much faster decline in the representation of de- 
clining industries in their economies. Rapid Growth Non-Hispanic counties saw 
particularly rapid declines in agriculture/mining, non-durable and durable goods 


manufacturing, and trade. Similar if less pronounced trends occurred among 
Rapid Growth Hispanic counties. 
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Rapidly growing counties have also seen a substantial expansion of construc- 
tion industry employment share, registering a 6.8 and 8.8 percentage point 
difference during the 1990-2000 period in Rapidly Growing Hispanic and Non- 
Hispanic counties, respectively, compared to only 5.2 percentage points among 
Slow Growth and Decline counties. Rapid Growth Non-Hispanic counties have 
also experienced a rapid expansion of low-skilled services while Rapid Growth 
Hispanic counties, in turn, have seen greater expansion of meat processing indus- 
try employment share and a slower expansion for high-skilled services. 

To summarize, these descriptive results show significantly lower educational 
improvement and considerably higher expansion of the foreign-born population 
in Established and Rapidly Growing Hispanic counties. These changes took 
place against substantial differences in employment shifts between 1990 and 
2000 for industries fueling economic expansion. Established Hispanic counties 
did not reduce their agricultural base as fast as other counties. Among rapidly 
growing Hispanic and Non-Hispanic counties, the construction industry acted 
as a leading sector attracting populations in both contexts. At the same time, 
meat processing industry expansion has been particularly pronounced among 
Rapid Growth Hispanic counties while low-skilled services experienced above 
average growth in Rapid Growth Non-Hispanic counties. We expect this com- 
plex articulation of change in socioeconomic characteristics of the labor force 
and industrial composition to account for cross-group differences in changes 
in inequality. 


Geographic Dispersion and Differences in Economic Basis: Case Studies 


To better illustrate the economic and demographic profiles of the rural areas in 
the typology, and the differential impact of industrial restructuring across types, 
we describe substantively several cases for each county type. 

Established Hispanic counties predominate in traditional rural Hispanic settle- 
ment areas of the Southwest. In fact, of the top 20 counties most characteristic 
of this group reported in Table 4, 14 are in Texas and the rest in just three states: 
New Mexico, Arizona and Colorado. Starr County, Texas is a typical example. It 
represents a Mexico-U.S. border region characterized by Colonia communities 
and an extraordinarily high Hispanic representation (97.5 percent). With a rela- 
tively impoverished industrial base in extractive industries, agriculture and social 
service employment, it has ranked for decades among the poorest U.S. coun- 
ties. Historical antecedents such as party machine politics and the separation of 
more fertile Brooks County in the early 20" century contributed to the county's 
entrenched economic weakness. Proximity to Mexico explains why foreign-born 
individuals make up 37 percent of the population, more than three times the 
national average. It also explains why socio-demographic indicators such as low 
median age, large average household size, and low educational attainment and 
English language proficiency differ notably from other non-metro counties. Forces 
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Table 4: Top 20 Counties Representing the Four-County Typology 
FIPS Code State County Name __ FIPS Code State County Name 


Established Hispanic Counties 


48427 Texas Starr 48377 Texas Presidio 
48323 Texas Maverick 48505 Texas Zapata 
48247 Texas Jim Hogg 35047 New Mexico San Miguel 
48047 Texas Brooks 48261 Texas Kenedy 
48507 Texas Zavala 4023 Arizona Santa Cruz 
48131 Texas Duval 48283 Texas La Salle 
35033 New Mexico Mora 8023 Colorado Costilla 
48489 Texas Willacy 48389 Texas Reeves 
35019 New Mexico Guadalupe 35039 New Mexico Rio Arriba 
48127 Texas Dimmit 48163 Texas Frio 
Rapid Growth Hispanic Counties 
31037 Nebraska Colfax 37061 North Carolina Duplin 
31047 Nebraska Dawson 19021 lowa Buena Vista 
40139 Oklahoma Texas 5149 Arkansas Yell 
13313 Georgia Whitfield 8045 Colorado Garfield 
13101 Georgia Echols 20111 Kansas Lyon 
12027 Florida DeSoto 16053 Indiana Jerome 
5133 Arkansas Sevier 27165 Minnesota Watonwan 
13139 Georgia Hall 53007 Washington Chelan 
20075 Kansas Hamilton 51640 Virginia Galax 
13003 Georgia Atkinson 27105 Minnesota Nobles 
Rapid Growth Non-Hispanic Counties 

8039 Colorado Elbert 12129 Florida Wakulla 
8093 Colorado Park 25019 Massachusetts Nantucket 
16015 Idaho Boise 8049 Colorado Grand 
8027 Colorado Custer 16055 Idaho Kootenai 
21215 Kentucky Spencer 29213 Missouri Taney 
13085 Georgia Dawson 8047 Colorado Gilpin 
8053 Colorado Hinsdale 41017 Oregon Deschutes 
8119 Colorado Teller 13311 Georgia White 
8091 Colorado Ouray 29209 Missouri Stone 
49021 Utah Iron 2170 Arkansas = Matanuska-Susitna 
Slow Growth Counties 

32009 Nevada Esmeralda 38007 North Dakota Billings 
38013 North Dakota Burke 30033 Montana Garfield 
54047 West Virgina McDowell 46063 South Dakota Harding 
32021 Nevada Mineral 38047 North Dakota Logan 
38023 North Dakota Divide 38001 North Dakota Adams 
38041 North Dakota Hettinger 38075 North Dakota Renville 
38095 North Dakota Towner 38043 North Dakota Kidder 
38083 North Dakota Sheridan 40129 Oklahoma Roger Mills 
38019 North Dakota Cavalier 31087 Nebraska Hitchcock 
38037 North Dakota Grant 20065 Kansas Graham 


contributing to reduced income inequality during the 1990s include a 30 percent 
population increase and declining agricultural employment. 


In contrast, Mora County, New Mexico represents an Established Hispanic 


county with a far more stable population. Despite its overwhelming Hispanic 
representation (81.6 percent), foreign-born residents make up a remarkably low 
proportion of the total population (1.7 percent), thereby illustrating sociodemo- 
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graphic convergence over time with the broader, mostly non-Hispanic U.S. non- 
metro population. Median age and average household size also strongly resemble 
the demographic profile of non-metro counties as a whole. While the county's 
population grew 20 percent during the 1990s, it also became less Hispanic. 
Declining proportions of high school dropouts, reduced agricultural employment 
and greater high-skilled service employment helped reduce income inequality in 
the years 1990-2000. 

Slow Growth and Decline counties also concentrate geographically, most vis- 
ibly in the persistent population loss areas of the northern and central Great 
Plains, with 13 counties in North Dakota alone (Johnson and Rathge 2006). This 
group's large number— 1,503 counties - highlights the challenges to demographic 
vitality that rural areas have confronted for decades. For example, Logan County, 
North Dakota saw its total population decline from 2,847 to 2,308 from 1990 
to 2000, and its tiny Hispanic population remain unchanged. Like many North 
Dakota counties, its population has declined consistently since at least the 1950s, 
ironically from improvements in agricultural productivity that reduced demand 
for farm labor. Yet as young adults and their families migrate to urban areas for 
jobs that reward their increasing educational attainments, they leave a population 
that, while not impoverished, continues to both diminish and age in place. Other 
counties have lost population from changes in the global economy. McDowell 
County, West Virginia, whose population quintupled to about 100,000 by 1950 
and which consistently set coal production records, now leads West Virginia coun- 
ties in population decline and poverty. 

The 283 Rapid Growth Hispanic counties are widely dispersed and reflect areas 
with above-average employment opportunities. In the Midwest and Southeast, for 
example, new Hispanic population growth correlates directly with the rural expan- 
sion and transformation of beef and poultry processing since 1980 (Kandel and 
Parrado 2004, 2005). In the Pacific Northwest, Rapid Growth Hispanic counties are 
closely tied with fruit and vegetable production. In mountain states with booming 
vacation and retirement communities, such as Idaho and Colorado, Hispanic pres- 
ence reflects growing service and construction employment. In certain non-metro 
counties in Georgia, North Carolina and Texas, Hispanic communities reflected la- 
bor demand from construction booms in neighboring metropolitan areas in Atlanta, 
Raleigh-Durham, Dallas-Fort Worth and Houston, respectively. 

In Echols County, Georgia, only 8 percent of adults possess a college degree and 
the poverty rate in 2006 stood at 30 percent. Nevertheless, labor force participa- 
tion rates match national averages, indicating a relatively large proportion of work- 
ing poor. In fact, family income inequality in Echols County rose rapidly during 
the 1990s, caused in part by a shift from seasonal crops such as cotton, corn and 
peanuts, to year-round crops including bell peppers, tomatoes, squash and carrots. 
Consequently, former transient migrant workers have settled permanently, swell- 
ing the Hispanic population since 1990 from 2 to more than 20 percent of the 
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Map 2: Rapid Growth Non-Hispanic Counties 
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county's population. By contrast, the growth of the Hispanic population from 2 
to 15 percent in Duplin County, North Carolina during the same period stemmed 
largely from manufacturing employment in meat and food processing. In addition, 
the change in employment shares for two low-paying sectors—low-skilled services 
and construction—increased by 7 and 10 percentage points respectively. 

Finally, the 282 Rapid Growth Non-Hispanic counties shown in Map 2 and list- 
ed in Table 4 illustrate the geography of recreation and retirement-related non-metro 
population growth. Counties located in mountainous areas of Idaho, Montana and 
Colorado, desert areas of Utah, or wooded areas of northern Michigan, for example, 
score high on the Natural Amenities Index developed by the USDAs Economic 
Research Service (2004) that measures climate and geophysical variation. Other such 
counties are concentrated in the Southeast and Gulf Coast. 

Grand County, Colorado boasts a dozen national parks, forests, wilderness 
areas and year-round outdoor recreation. Steady population growth from young 
family in-migration and recreation seekers has turned the county into a bed- 
room community of Denver. Not surprisingly, construction and low-skill services 

_ showed substantial employment growth during the 1990s. As the median age in- 
creased from growth in the baby boomer cohort, income inequality also increased. 
During this period median home prices almost doubled to $205,000, and median 
incomes increased 50 percent to more than $55,000, exceeding the national aver- 
age. Similarly, Deschutes County, Oregon enjoys spectacular scenery, proximity 
to national parks, numerous golf courses, and a temperate climate that offers 
rain-weary Portland and Seattle in-migrants sunny, year-round recreation. The 
county seat of Bend, which has reaped substantial benefits from stringent growth 
and environmental regulations approved in Oregon during the 1970s, frequently 
appears on popular lists of the most desirable places to live. Outside of Bend, more 
affordable real estate markets cater to rapidly growing Hispanic and non-Hispanic 
communities whose residents often work in construction and low-skilled services. 


Multivariate Results 


The following multivariate analysis links these transformations to changes in coun- 
ty inequality. Table 5 reports results from three OLS models predicting 1990-2000 
change in the county-level Gini coefficient. Following our theoretical discussion, 
all models include indicators for our four-county typology as well as controls for 
additional sources of heterogeneity. Model 1 estimates the effect of changes in 
the socioeconomic characteristics of the labor force. Model 2 estimates the role of 
industrial change without controls for labor force characteristics. Model 3 includes 
all predictors. In general, results are consistent across models. 

Estimates for the effect of our four-county typology on changes in inequality 
are of central importance for understanding the unique effect of rapid Hispanic 
population growth in new areas of destination. Overall, results show that inequal- 
ity trends in Rapid Growth Hispanic counties do in fact differ from Slow Growth 
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and Decline counties. Even after controlling for all our independent variables 
(Model 3), the change in Gini Coefficient 1990-2000 among Rapid Growth 
Hispanic counties is .62 points higher than among Slow Growth and Decline 
Counties. However, their trajectory is actually quite similar to that experienced by 
High Growth Non-Hispanic counties for which the change in Gini coefficient is 
.64 times higher than among Slow Growth and Decline Counties. The similarity 
in trends is consistent with the description presented in Figure 1. The fact that the 
effects remain significant after controlling for all the model indicators reinforces 
the association between growth, heterogeneity and inequality in rural areas. 

Consistent with our expectations, growing heterogeneity in human capital en- 
dowments is a central dimension accounting for trends in rural inequality (Model 1) 
even after accounting for changes in industrial composition (Model 3). Full model 
estimates show a particularly strong effect for the proportion of the population 
with college or more education. Results indicate that a 1 percentage point increase 
(or slower decline) in the proportion of the population with less than high-school 
education increases the change in Gini coefficient by .068; for college or greater, the 
effect is .279. The significance of this difference is reinforced by the standardized 
estimates that show the effects to be .079 for the proportion of the population with 
less than high-school compared to .189 for the proportion with college or more. 
The difference in size highlights the importance of growth of the highly educated 
population to understanding trends in inequality across rural counties. 

At the same time, change in the size of the foreign-born population does not 
appear to have an independent effect on inequality, suggesting that much of the 
association between immigrant status and inequality is captured by other sources 
of heterogeneity, including education. Relative increases in the population over 
64 and in the percent of female-headed households also contribute to inequality. 
Faster declines among the older population in rapidly growing Hispanic and non- 
Hispanic counties imply this effect is mitigated in growing rural areas. Growing 
female labor force participation also significantly reduces family income inequality. 
Results (Model 3) indicate that a 1 percentage point increase (or slower decline) 
in the change in the proportion of the female labor force participation reduces the 
change in Gini coefficient by .138, and the standardized estimate is -.159, similar 
in size to the effect of changes in the proportion of the population with a college 
degree or greater. That female employment increased the least in Rapidly Growing 
Hispanic counties implies that part of the growing trend in family inequality in 
these counties reflects lower rates of female labor force participation. 

Model 2 estimates the effect of industrial change independent of changes in 
socioeconomic characteristics. As before, results between models 2 and 3, which 
includes all predictors, are remarkably consistent. The reference category is percent 
change in non-durable goods manufacturing employment which is expected to 
have the most equalizing effect in rural areas. Overall, our analysis supports the as- 
sociation between manufacturing employment and reduced inequality. Relative to 
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change in non-durable goods manufactur- 
ing, all other industrial sectors significantly 
contribute to inequality, except for durable 
goods manufacturing and meat processing. 
Results also show that among industries 
with declining representation agriculture 
stands out as a key source of inequality. 
The positive effect of changing agricultural 
representation (.115) on the Gini coefh- 
cient indicates that inequality increased 
the most among counties with slower de- 
clines in agricultural employment, mainly 
Established Hispanic counties (see Table 
3). Changes in wholesale and retail trade 
also contributed to inequality (.086) rela- 
tive to non-durable goods manufacturing. 
Among industries with growing repre- 
sentation, changes in construction, low- 
skilled services and high-skilled services 
have fueled inequality. Construction is 
particularly important; estimates from 
Model 3 show that a 1 percentage point 
increase in construction employment in- 
creases the Gini coefficient by .119. In 
fact, the estimated standardized coeffi- 
cients (last column of Model 3) show that 
change in the construction industry gener- 
ates the largest inequality effect relative to 
non-durable goods manufacturing (.198) 
closely followed by agriculture (.188). 
Results for other industries with growing 
representation show that the growth of 
the service industry, both low- and high- 
skilled, also contributed to inequality rela- 
tive to non-durable goods manufacturing. 
The standardized estimate for the effect of 
low-skilled services shows that the size of 





its impact is second among industries with 
erowing representation (.086). 

Again, the significance of these forces 
varies considerably across county types. 
Construction increased very rapidly in 


2,303 
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Rapid Growth Hispanic and Rapid Growth Non-Hispanic counties, although the 
growth was much more pronounced in the latter. The same applies to low-skilled 
services that tended to increase in all counties but grew substantially higher in 
High Growth Non-Hispanic counties. 

Finally, controlling for other sources of heterogeneity shows that, consistent 
with findings from previous studies, change in family income is the main factor 
associated with inequality reduction, reinforcing the importance of broader pro- 
cesses of socioeconomic change for inequality. 


Discussion 


Rapid Hispanic population growth in all regions of the United States and particu- 
larly in new midwestern and southeastern destinations has drawn an extraordinary 
level of popular and policy attention. Some of this concern relates to the impact 
of high levels of migration on the social organization, including inequality, of 
local communities. A long tradition in the social sciences considers the impact of 
population change on a wide array of outcomes, including inequality. One view 
“holds that Hispanic population growth will result in increased inequality because 
it increases and concentrates the number of low-skilled poorly paid workers that ” 
then heightens wage competition among low-skilled workers, driving wages down 
and poverty up. An alternative view, argues that the human capital distribution 
of local areas is a response to market forces rather than a determinant of them. 
That is, larger institutional, structural and macroeconomic factors drive inequality 
patterns to a greater degree than the human capital characteristics of migrants. 

The key to adjudicating these two views is to separate the impact of Hispanic 
growth per se from overall processes generating population growth and develop- 
ment more generally. Borrowing from urban-based research our study takes a 
counterfactual approach, comparing inequality and its correlates across established 
Hispanic, rapidly growing Hispanic, rapidly growing non-Hispanic, and slow- 
growing or declining rural counties. Our results show that income inequality 
did indeed increase more rapidly in new rural Hispanic destinations during the 
1990s compared to slowly growing or declining counties. However, inequality 
was no greater in rapidly growing Hispanic areas than in areas that were rapidly 
growing without Hispanic migrants. Multivariate results show that both supply 
and demand characteristics affected inequality. In particular, rapid growth was 
associated with increased representation of both tails of the educational distribu- 
tion; thus growth in the highly educated workforce was a central element of rising 
inequality in rapidly growing areas. After controlling for education, foreign-born 
population growth had no independent effect on inequality. 

Industrial change also contributed to inequality above and beyond supply- 
side changes in human capital. Consistent with research stressing industrial 
restructuring for understanding inequality we find that all industrial sector 
growth contributed to inequality relative to manufacturing. Agricultural and 
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construction industries have been especially important. That agriculture still 
attracts Hispanics to rural areas, including new destinations, partly explains the 
association between Hispanic population growth and rural inequality. Likewise, 
rapid expansion of the construction industry is an important source of inequal- 
ity in rapid-growth rural counties. 

Thus, consistent with structural interpretations, our results imply that growth 
in general engenders widening income inequality, not Hispanic growth per se. 
Most rural counties struggle to maintain their population size and economic 
dynamism. The counterfactual comparison suggests that stagnant counties do 
not experience increased inequality. However, counties that grow economically 
and demographically can expect considerable increases in inequality, irrespective 
of whether population growth is attributable to Hispanics. 

These findings also have implications for the future of rural America. Population 
change portends significant racial, ethnic and socioeconomic change for rural 
areas, raising issues of ethnic conflict, immigrant adaptation and public sector 
burdens. The emphasis on race and ethnicity rather than socioeconomic origin in 
affecting social outcomes undergirds much negative representation of these popu- 
lation trends on the United States (Huntington 2004). Our study, however calls 
into question this interpretation by highlighting that general economic processes 
associated with growth rather than race/ethnicity, account for substantial changes 
in rural income inequality. To the extent that our findings can be generalized to 
other social effects they suggest that migrants’ ethnic origin might be accidental 
in explaining social ills in rural areas and that a focus on class and industrial struc- 
ture might be more useful for understanding the trajectory of rural development, 
Hispanics’ socioeconomic outcomes and the forces driving population change. 


Notes 


1. “Nonmetropolitan” areas follow the Office of Management and Budget definition 
based on population and commuting patterns. A metropolitan area consists of one or 
more core counties with an urbanized area of 50,000 or more inhabitants, together 
with surrounding counties with metropolitan characteristics such as commuting 
patterns and population density and growth. Nonmetropolitan areas consist of all 
other counties and contain only open country, small towns or small cities. The term 

“nonmetropolitan” is distinct from “rural,” which refers to a U.S. Census Bureau 
definition for places with fewer than 2,500 inhabitants. In this article, however, we 
use the term “rural” in its general context. 


2. Regions are census regions, except for the Southwest which borrows from the West 
and the South and consists of Arizona, California, Colorado, New Mexico and Texas. 


3. Wealso tested an extended six-group typology that distinguished, within Established 
Hispanic counties, those with constant and declining Hispanic representation and, 
within Slow Growth and Decline counties, those with stable and declining populations. 
The extended typology did not illustrate inequality trends more effectively than our 
more parsimonious typology. 
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4, Estimates are available upon request. 
We did so by computing the ratio of meat processing to non-durable goods 
manufacturing employment using County Business Patterns data for 1990 and 2000 
and applying this ratio to U.S. Census data for the respective years. We use the same 
process with CBP data to distinguish nondurable from durable goods manufacturing 
employment for the 2000 SF3 data which, unlike the 1990 STF3 data, do not make 
this distinction. 
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Risk Groups in Exposure to Terror: 
The Case of Israel's Citizens 


Yariv Feniger, Ben-Gurion University of the Negev 
Ephraim Yuchtman-Yaar, 7e/ Aviv University 


This research addresses a largely ignored question in the study of terror: who are its 
likely victims? An answer was sought through analysis of comprehensive data on 
civilian victims of terror in Israel from 1993 through 2003. The chances of being 
killed in seemingly random terror attacks were found unequally distributed in Israeli 
society, but the weaker sectors were not the most vulnerable. This pattern may be 
attributed to the perpetration of most terror attacks in public places, where mem- 
bers of underprivileged groups are less likely to be. Paradoxically, ethnic segregation, 
gender and other forms of social exclusion and inequality may have helped to protect 
marginalized social groups from the risk of terror victimization. 





Introduction 


Terror is not a new feature in the modern world, but its scope has increased drasti- 
cally in recent years, especially at the international level. Today it is global, often 
targeting innocent civilians. A prominent scholar in the field has gone so far as to 
propose analyzing terror as a “routine” phenomenon, inseparable from the real- 
ity of our lives (Horowitz 1983). The escalation and far-reaching consequences 
of terror are reflected in the growing interest in the subject among researchers. 
Yet with few exceptions this research has hardly concerned itself with the social 
composition of the victims. A recent review by Turk (2004) indicates that social 
scientists researching terrorism have been mostly concerned with various aspects 
of terror organizations; their structure, political ideology, modes of operation, and 
processes of recruitment and socialization. Attention has also been paid to the 
ways in which states try to contain the spread of terror and to its coverage by the 
media. Psychologists and social workers deal with terror victims extensively. Yet 
here the focus is on the victims’ and their relatives’ coping with the physical and 
psychological damage and the experience of loss. 

In terms of sociological theory, our question can be placed in the context of so- 
cial stratification and inequality. Cumulative research in this field has consistently 
shown that modern society is characterized by considerable gaps in the structure 
of opportunities, and that, among other things, these gaps affect its members’ stan- 
dard of living, quality of life and life expectancy. As for the latter, social variables 
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can apparently be considered health risk factors. Williams and Collins (1995:280- 
81) assert that ethnic inequality is “inarguably the single most important health 
issue” in the United States. More relevant to our concern is the close relationship 
between social class and unnatural death resulting from acts of murder, whose 
frequency is relatively higher among marginal and disadvantaged groups in many 
contemporary societies (e.g., Pridemore 2002; Messner and Rosenfeld 1999). 
The present research thus asks whether marginalized groups in Israeli society are 
relatively more at risk of exposure to politically motivated random terror, and 
whether the characteristics of these groups overlap “classic” variables that affect 
marginality such as gender, age, ethnicity and socioeconomic status. 

No agreed definition of terror exists in the political and academic community. 
For the purposes of the present discussion it is defined as any violent act directed 
at a civilian population to achieve political goals. This definition accords with 


the approach adopted by most students of terrorism (for a short review see 
Goodwin 2006.) 


Conceptual Considerations 


From a sociological perspective the neglected question of “who are the victims 
of terror” is highly relevant, particularly in places where it has been an ongo- 
ing phenomenon, thus potentially becoming a significant threat to the lives of 
individual members of society. Our main argument is that, in its consequences, 
terror is socially structured: certain groups in society are at higher risk than oth- 
ers of exposure to terror attacks. The nature of such risk groups can be largely 
understood through two main variables, one taken from the world of terror and 
the other from the realm of society. 

The first variable pertains to the distinction among types of terror according 
to their level of selectivity. Selective terror is aimed at specific groups or individu- 
als, whereas unselective terror treats any group or individual who belongs to the 

“enemy camp as a legitimate object of attack. In practice there can be different 
levels of selectivity, but the dichotomous division serves to sharpen the distinc- 
tion between the two strategies. As recently discussed by Goodwin (2006:2031), 
unselective (“categorical”) terrorism “... is directed against anonymous individu- 
als by virtue of their belonging (or seeming to belong) to a specific ethnic group, 
nationality, social class or some other collectivity. This type of terrorism... is typi- 
cally called indiscriminate or ‘random terrorism because it makes no distinctions 
among the individual identities of its targets.” 

Drawn from the sociological discourse, the second variable is embedded in 
the distinction between public and private spaces. By “public space” we refer 
to the variety of sites and places outside the home where the general public, or 
parts of it, tend to be present for various purposes. The more open the places and 
the wider the range of people they serve, such as shopping centers and public 
transportation, the more “public” the nature of the space. The public nature of 
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certain spaces is relatively limited, such as exclusive clubs or protected residential 
neighborhoods that prevent entry by the general public. The classic private space, 
of course, is the home. 

Following this distinction, we suggest that the degree to which people are pres- 
ent in the public space is not a matter of chance alone. Rather, it is systematically 
affected directly or indirectly by some major social characteristics which determine 
a person's position in the social structure, such as gender, age, ethnicity and socio- 
economic status. For example, men participate in the labor force at higher rates 
than women, so their presence in the public space is likely to be proportionately 
higher. Similarly, poorer people are less likely to participate in leisure and enter- 
tainment activities because they cannot always afford to pay for them. Accordingly, 
we propose that position in the social hierarchy is a significant determinant of the 
risk of exposure to unselective terror. This proposition is tested in the context of 
Israeli society, which has been frequently exposed to this type of terror, particularly 
since the early 1990s. 


Research on the Social Attributes of Terror Victims 


A salient exception to the above-mentioned sparse research on the characteristics 
of terror victims is a series of studies conducted in Northern Ireland. For example, 
Fay, Morrissey and Smyth (1999) provide extensive data on victims’ distribution 
by certain basic demographic attributes such as age and gender, religious afhiiation, 
and place of residence. A distinction is also made between victims who belong to 
the security forces, paramilitary organizations or the civilian population. However, 
the authors do not provide separate demographic analyses for civilian and other 
casualties. In an earlier study Hewitt (1988) analyzed the economic and human 
cost of terrorism in six countries: Uruguay, Northern Ireland, Spain, Germany, 
Italy and Cyprus. As to the attributes of the fatalities, a similar pattern was ob- 
tained in all six countries with respect to age (most victims were ages 20-39 years) 
and gender (more than 80% were men). These findings might be related to the 
fact that the victims consisted largely of members of the security forces. Neither 
does Hewitt provide separate data on the attributes of the civilian victims. 

A systematic study that addresses some of the lacunae in research on victims 
of terror was recently reported by Canetti-Nissim, Mesch and Pedahzur (2006). 
These authors investigated Israeli citizens killed in terror attacks from 1993 to 
2002. They compared suicide bombings with other types of terror attacks with 
respect to different attributes of the fatalities. The data analysis, based on a series of 
logistic regressions, indicates that residents within the 1967 borders, students (at 
school or university), women, unmarried people, passersby and the less religious 
were struck more in suicide bombings than in other types of attack. However, 
Canetti-Nissim et al. did not probe the connection between social attributes and 
the chances of being a victim of either type of terror. This potential connection is 
explored in the present study. 
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The protracted Israeli-Palestinian conflict involves harsh and violent struggles that 
have claimed a large number of victims on both sides—soldiers as well as civilians. 
One conspicuous manifestation of the violence has been the systematic use of ter- 
ror by radical Palestinian organizations, such as Hamas and the Islamic Jihad, as a 
major means of attaining their goals. This policy actually intensified following the 
signing of the Oslo Accords between the Israeli government and the Palestinian 
Liberation Organization in September 1993. From that month until the begin- 
ning of the second Palestinian uprising (Al-Aqsa) at the end of September 2000, 
281 Israelis were killed in terror attacks, most of them (about 70%) civilians. The 
number of victims grew considerably after the outbreak of the uprising, and by 
November 2003 about 1,000 victims had been added to the list, also mostly from 
the civilian population. The number of wounded was higher. 

In their efforts to undermine the morale of the civilian population, these terror 
groups adopt a strategy of unselective attacks on crowded places such as buses, 
restaurants and shopping centers, so that they reverberate with great resonance 
due to the large numbers of civilian victims they claimed. Indeed many attacks 
are accompanied by shocking images on newspaper pages and television screens. 
This strategy implies that with the probable exception of Israel’s Arab-Palestinian 
minority, the entire Israeli Jewish population serves as a legitimate target, regard- 
less of who the potential victims are. As vividly depicted by Schweitzer (2006:33), 
one Palestinian woman who was arrested shortly before she was due to carry out 
a suicide attack said, “I calculated how many Jews I should kill for it to be consid- 
ered a success, and for it to be worth my death. How many? As many as possible...” 

This strategy of unselective terror can be contrasted to the strategy applied in 
other terror-afHlicted areas, such as Northern Ireland. There, most attacks were aimed 
at members and associates of the rival paramilitary organizations, along with attacks 
by Catholic organizations against the security forces (Fay et al. 1999). To be sure, the 
terror attacks in Northern Ireland claimed the lives of many innocent civilians. But 
compared with the Israeli experience, the scope of such incidents was much more 
limited and often advance warning was given to prevent the killing of innocent 
people. The consequences of the two strategies are set out below. 


Data 


Data for this research cover all Israeli civilians killed by acts of terror within the 
1967 borders of Israel from the signing of the Oslo Accords in September 1993 
to the end of 2003. We chose the Oslo date as our starting point mainly because 
the ensuing period witnessed a considerable increase in the scope of terror attacks, 
as well innovation in the modes of terror, especially the phenomenon of attacks 
perpetrated by suicide bombing. We excluded from the study civilians killed in 
the occupied territories of the West Bank and Gaza Strip. The Israeli population 
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in these territories is protected by the military forces, and as such is perceived by 
the Palestinians as part of the mechanism of occupation. Also omitted from the 
list are foreign civilians killed in the terror attacks (about 40 people in the years 
under study) for lack of reliable information. Members of the security forces killed 
within the 1967 borders were included because most of them died in terror at- 
tacks against obviously civilian targets. In the rare cases where it was clear that the 
attack was carried out against a military target (e.g., a military post on the border 
between Israel and the Gaza Strip) the fatalities were not included in the list. 

Because reliable data on those injured in terror attacks were unattainable, the 
study deals with fatalities only. But being injured or killed in such attacks is 
usually a matter of chance, depending for example, on where one happens to be 
seated at the instant of an explosion on a bus or in a restaurant. Accordingly, we 
assume that the fatalities can be viewed as representing the entire population of 
the casualties of terror. 

The list of victims is based on several sources: nongovernmental websites that 
give a detailed account of terror attacks, official Israeli commemoration sites, and 
the daily print media. Each terror attack and each victim was examined using at 
least two sources of information in order to verify the relevant details. In most 
cases, the perpetrators were killed or captured, and a certain Palestinian organiza- 
tion took responsibility for the attack. This allowed us to label the incident as a 
terror attack. In rare cases in which the perpetrators remained unknown and no 
terror organization claimed responsibility for the attack we did not include the 
victims in the list. The list of names and identifying details of the fatalities that we 
prepared were transferred to the Central Bureau of Statistics to be collated with 
data of the 1995 Census of Population and Housing. Of the 638 names on our 
list, 621 matched the CBS data. 

We used these data to obtain information on gender, age, ethno-religious group 
(Jewish/Arab), year of immigration to Israel and socioeconomic status for each 
victim. Because the 1995 census provided detailed information on socioeconomic 
status (level of education, occupation and income) for only 20 percent of the 
Israeli population, this information was available for just a small number of the 
fatalities (about 10% of the victims). Still, for about 62 percent of the cases we 
were able to obtain socioeconomic data on the aggregate level according to the 
place of residence. This variable concerns the ranking of the statistical area in 
which the person lived. Each of these areas, which is relatively homogeneous in its 
population composition, was ranked by the socioeconomic level of the residents, 
based on detailed information collected in the 1995 census (CBS 2000). 

Regarding two variables, gender and age, we were able to compare victims in 
Israel and Northern Ireland. The results for the latter are based on Sutton’s Index 
of Deaths (Sutton 2005), which includes fatalities from the conflict in Northern 
Ireland from 1969 to 2001. For comparison with our data, we refer to fatalities 


only among civilians. 
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Table’: Terror Victims in Israel 1993-2003 by Type of Place of Terror Attack 





% Among Victims 
Type of Place of Terror Attack _(N = 638) 
Public transportation 52.0 
Dining and entertainment 22.6 
Shopping areas 14.6 
Other 10.8 
Total 100.0 





Results 


Table 1 shows the distribution of victims according to the type of place in which 
the terror attack was perpetrated. We distinguished between four categories of 
places. The first category includes terror attacks on public transportation (e.g., 
busses or bus and train stations). Most of these terror attacks were executed by 
suicide bombers. The second category relates to terror attacks which took place 
in dining and entertainment sites. These include restaurants, cafes, hotel dining 
rooms and night clubs. Here again, most attacks were perpetrated by suicide 
bombers. The third category consists of attacks which occurred in shopping 
areas such as open markets, shopping malls, and main streets and pedestrian- 
only streets in city centers. These attacks were executed by means of suicide 
bomber, hidden bombs or shooting. The forth category includes other sites 
of terror attacks such as private houses, side streets, working places and roads. 
According to Table 1 about 52 percent of the victims were killed in attacks on 
Israeli public transportation. Almost 23 percent of the victims were killed in 
dining and entertainment places and about 15 percent were killed in shopping 
areas. Only some 11 percent of the casualties were killed in places which are 
not obvious public spaces. 


Gender 


Table 2 shows that about 43 percent of the Israeli fatalities in terror attacks were 
women and about 57 percent were men. This finding is compatible with previous 
studies on the attributes of terror victims which show that men are struck more 
than women (e.g., Fay et al. 1999; Hewitt 1988). However, most of those studies 
deal with terror directed at civilians as well as at members of the security forces 
or activists in rival organizations, namely target groups in which men form the 
large majority. Because our own research concerns civilian victims, it is difficult to 
explain why Israeli men are struck more than women unless we assume that men 
are more likely to be present in the public space. Women may indeed spend more 
time than men on shopping and may be more dependent on public transportation, 
but they participate in the workforce at lower rates than men (Alon 2004), work 
more at part-time jobs (Central Bureau of Statistics 2004) and at home (Felstead 
and Jewson 2000), and spend more time at home with the children. 


Table 2: Terror Victims in Israel 1993-2003 and in Northern Ireland 1969-2001 by Gender 
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Still, Table 2 also reveals that the percentage of women among the victims of 
terror in Israel was considerably higher than the corresponding rates in Northern 
Ireland, where women constituted only about 16 percent of all the terror fatalities 
in the civilian population. We suggest that the main reason for women’s higher 
rate of risk in Israel is the relatively unselective nature of terror attacks in this 
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Total 


1,p<.001. 


Notes: Significance of difference between terror victims and general population in Israel: x? = 15.8, df 


Percentages in the general population in Israel were calculated according to data of the CBS Statistical Yearbook for 2004. Percentages 


in the general population in Northern Ireland were calculated according to data of the 2001 Population Census available at: http://www. 


nisranew.nisra.gov.uk/Census/pdf/Census2001PopRep.pdf. 


country, as compared with the patterns of terror 
in Northern Ireland. Whereas the gender gap in 
exposure to terror both in Israel and Northern 
Ireland is due mainly to the greater presence of 
men in the public space, Israeli women are at 
greater risk than their counterparts in Northern 
Ireland because anyone present in public places 
in Israel is more vulnerable due to the random- 
ness of terror attacks. 


Age 


A comparison of the distribution of the fatalities 
by age with the distribution of the ages in the 
whole Israeli population (see Table 3) shows that 
for the 17-24 age group fatalities were double or 
more, while for the youngest age group (up to 
16) fatalities were lower at a ratio of 1:3. For the 
other three age groups the fatality rate was roughly 
equal to their proportion of the entire population, 
although in the two oldest age groups the fatalities 
were somewhat overrepresented. We believe that 
the over-exposure to terror attacks of the 17-24 
age group is explained by its inclusion of young 
people who tend to frequent places of leisure and 
entertainment and rely more on public transporta- 
tion. Accordingly this age group is more present 
in public places where most of the terror attacks 
are perpetrated. 

An age-group comparison of Israel and Northern 
Ireland shows that the 25-39 age group in Northern 
Ireland featured more than the other groups in the 
list of victims, whereas in Israel the corresponding 
group was that ages 17-24. A further difference 
concerns the fatality rates in the youngest and old- 
est age groups. The victim population in Israel in- 
cluded more teenagers, children and elderly people 
than in Northern Ireland. 
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Table 3: Terror Victims in Israel 1993-2003 and Northern Ireland 1969-2001 by Age Groups 





To account for the findings shown in Table 3 we again resort to the distinctions 
between public vs. private space and selective vs. unselective terror. In both Israel and 
Northern Ireland the number of victims among children and elderly people was rela- 
tively small since both age groups have a more limited presence in the public space. 
As for the disparities between Israel and Northern Ireland, namely the higher fatal- 









% in Northern Ireland 
General Population 


% among Civilian Victims in 
Northern Ireland (N = 1851 


14.0 
20.1 
19.6 
Tal 
100.0 


Population 
33.2 


% in Israeli General 


% among Victims in 
Israel (N = 621 


0-16 
17-24 
25-39 
40-59 
60+ 
Total 


=4,p<.001 


320.9, df 


Data on the general population in Israel were taken from the 1995 census because for these data we could determine age groups 


for purposes of the study. Percentages in the general population in Northern Ireland were calculated according to data of the 2001 


Notes: Significance of difference between terror victims and general population in Israel: x2 


Population Census available at: http://www.nisranew.nisra.gov.uk/Census/pdf/Census2001PopRep.pdf. 


ity rates in the young and old age groups in 
Israel, this finding, like the differences with 
respect to gender, can be explained by the 
unselective nature of the terror committed 
by Palestinian organizations as compared 
with the terror perpetrated in Northern 
Ireland. Many of the Israeli children and 
teenagers were killed on their way to or from 
school in suicide bombings carried out on 
buses or at bus stops. Similarly, many of the 
old people were killed in public places while 
shopping or dining. 


Ethno-Religious Groups 


Table 4 substantiates the hypothesis that 
Jewish and other non-Arab civilians are 
struck at higher rates than Arab civilians 
in terror attacks. Some 95.7 percent of 
the fatalities were Jews (including “oth- 
ers’) whereas their proportion in the 
population is 81 percent. In comparison, 
the Arab citizens of Israel were harmed in 
terror acts at significantly lower percent- 
ages (4.3%) than their proportion in the 
general population (19.1%). This finding 
is consistent with the assumption that the 
Palestinian perpetrators direct their terror 
acts at the Jewish population. The ability to 
distinguish between the two ethnic groups 
is facilitated by the spatial separation be- 
tween Jews and Arabs in the country and 
by the limited presence of Israeli Arabs 
in the economic and social centers of the 
Jewish communities. Furthermore, the few 
Arabs killed in terror attacks were appar- 
ently struck unintentionally, either because 
the attacks occurred in the few cities with a 
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Table 4: Terror Victims in Israel 1993-2003 by Ethno-religious Groups 
% among Victims % in General 


Ethnicity (N = 621) Population 
Jews and others 95.7 80.9 
Arabs 43 19.1 
Total 100.0 100.0 


Notes: Significance of difference between terror victims and general population: y? = 
88.0, df= 1, p< .001. 

Percentages in the general population were calculated according to data of the CBS 
Statistical Yearbook for 2004. The category “Others” mainly includes Christians who 
immigrated to Israel because they married Jews. 


mixed population, such as Jerusalem and Haifa, or because those Arabs happened 
to be present at typically Jewish sites. Thus, most of the Arab victims on our list 
were killed in terror attacks which most of their victims were Jewish. An incident 
that can illustrate such situations occurred on August 2002 in the Upper Galilee 
in Northern Israel. A suicide bomber who embarked on a bus en route to Safed 
encouraged an Arab woman who sat next to him to leave the bus before “some- 
thing will happen.” The Arab woman told her friend and both disembarked the 
bus at the next stop. A few minutes later the terrorist blew himself up. This attack 
claimed the lives of nine people, two of whom were Arab (Roffe 2002). 


New Immigrants 


New immigrants are a disadvantaged group in Israeli society (e.g., Semyonov 
1997). Given that almost all the victims were from the Jewish population, and 
since the new immigrants are part of that population, we compared the percentage 
of victims among the former with their proportion in the latter. Table 5 shows 
that the percentage of new immigrants among the Jewish fatalities was 20 percent. 
The percentage of the new immigrants in the Jewish population was 19 percent 
in 2003. That is, the odds of a new immigrant being killed in a terror attack were 
practically the same as that of other Israeli Jews. 


Table 5: Terror Victims in Israel 1993-2003 by Immigration Status 


% among Victims % in General 
Immigration Status (N = 594) Population 
New immigrants 20.0 19.0 
Native-born or veteran immigrants 80.0 81.0 


100.0 100.0 


Total 

ee rm ce rrr es 

Notes: Significance of difference between terror victims and general population: y? = .39, 
df = 1, p= .53. 

Percentages in the general population were calculated according to data of the CBS 
Statistical Yearbook for 2004. 
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Table 6: Terror Victims in Israel 1993-2003 by Socioeconomic Level of the 
Place of Residence 


% among Victims % in General 
Socioeconomic Level (N = 384) Population 
Low (clusters 1-5) ies 20.0 
Medium-Low (clusters 6-10) 30.2 25.9 
Medium-High (clusters 11-15) 46.9 36.3 
High (clusters 16-20) 15.6 17.8 


Total 100.0 100.0 

Notes: Significance of difference between terror victims and general population: x? = 
46.6, df = 3, p< .001. 

Data on the general population were taken from the 1995 census. See: http:/(www1.cbs. 
gov.il/mifkad/tables/pirsom13/10.xIs. 


Socioeconomic Status 


Table 6 presents the distribution of the victims and total population by the socioeco- 
nomic level of the statistical area of residence according to the 1995 census. The two 
intermediate groups, especially the medium-high level (clusters 11-15), were over- 
represented in fatality rates relative to their proportion in the general population, 
and the two groups at the extremes, especially the lower, were under-represented. A 
possible explanation for the relatively low vulnerability of the weaker strata is that 
these groups are found more in the communities of the Israeli periphery, includ- 
ing the poor neighborhoods in the large cities, the development towns and the 
Arab communities. The relative exclusion of these groups from the public space in 
Israel's main urban centers was apparently an advantage in terms of the danger of 
encountering terror attacks. But because most of the terror victims come from the 
Jewish population, and this population is less represented in the lower classes than 
is the Arab population, we repeated the comparison between the clusters within the 
Jewish population only. This analysis showed that the under-representation of the 
lower class was reduced but did not disappear, so that the general pattern, where 
the middle strata were struck more than the weakest and the strongest, persisted.' 

As for the under-representation of the strongest strata, the members of this 
group are less dependent on public transport, and their lifestyle and leisure activi- 
ties tend to be more exclusive. They also tend to reside in relatively upscale and 
protected neighborhoods, whereas most terror attacks occurred in crowded public 
places. That is, upper class and lower class people were exposed to the danger of 
the terror attacks less than the middle classes because the latter are typically more 
present in the public space of Israeli society. 


Summary and Conclusions 


‘The empirical analysis indicates that the chances of being killed in terror attacks 
are not equally distributed in Israeli society, and that certain social groups are at 
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greater risk than others. This is not to say that chance, in the sense of “being in 
the wrong place at the wrong time,” is an irrelevant factor at the individual level. 
Nevertheless, when social groups are considered, other factors seem systematically 
to affect the risk level. Still, the relationship between location in the social struc- 
ture and the chances of being a victim of terror proved not as simple as it might 
appear. Our findings suggest that some marginal groups in this society, such as 
Arabs, women, the elderly and children are less vulnerable to terror attacks than 
the more privileged adult, Jewish men. As to the effects of social class, we found 
a curvilinear relationship: residents of poor as well as rich neighborhoods were 
hurt by terror attacks less than middle-class communities. These results are highly 
consistent with our prior conceptual considerations, namely the risk of becoming 
a victim of unselective terror is directly related to presence in the public space: the 
greater the presence the greater the risk. Because the less privileged are less often 
present in public spaces, they are less likely to become victims of terror. 

This line of reasoning may also explain the similarities and differences between 
Israel and Northern Ireland with respect to the effects of gender and age on the risk 
of being exposed to terror. The two countries were found similar in that women 
proved to be at lower risk than men, as did children and the elderly. They differed 
in the magnitude of these effects, given that the risk levels of women, as well as of 
children and the elderly, are higher in Israel than in Northern Ireland. As we have 
suggested, these gaps are related to the difference between the strategy of terror 
attacks in Israel, which are essentially unselective, and the patterns of terror in 
Northern Ireland, which tended to be less random in the choice of targets and to 
adhere more to certain codes. 

From the perspective of the sociological study of social stratification we believe 
that it is important to emphasize the seemingly paradoxical finding that precisely 
the groups more integrated into the Israeli mainstream, particularly middle-class 
Jewish men—as manifested in the presence of members of this group in the public 
space—are more exposed to terror attacks than more marginalized groups. 


Note 


1. In addition to the descriptive analysis we also estimated a logistic regression model 
using the case-control study approach, which is common in the health field. 
Multivariate analysis showed that, in general, the pattern arising from the descriptive 
statistics recurred in the logistic regression model. This analysis confirmed that the 
finding on the under-representation of the lower socioeconomic strata among the 
victims is robust, and holds even when controlling for other variables. The results of 
the multivariate analysis can be obtained from the authors upon request. 
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Clocking In: The Organization of Work Time and Health in 
the United States 


Sibyl Kleiner, /ndiana University 
Eliza K. Pavalko, /ndiana University 


This article assesses the health implications of emerging patterns in the organization 
of work time. Using data from the National Longitudinal Survey of Youth 1979, we 
examine general mental and physical health (SF-12 scores), psychological distress 
(CESD score), clinical levels of obesity, and the presence of medical conditions, at 
age 40. Overall, we find that health varies more across work hours than across types 
of shifts, and part-time workers report worse physical and emotional health than 
full-time workers. However, controlling for individual, family and job characteristics 
explains the poorer health observed among part-time workers. Those who are satis- 
fied with their jobs, have more education, or have an employed spouse, report better 
health, while women and those with a prior health limitation report worse health. 
After taking these factors into account, we find a curvilinear relationship between 
work hours and health, with those working between 40 and 59 hours per week report- 
ing worse mental and physical health than those working 40 hours per week. We also 
find that obesity differs from current health problems in its relationship to work time. 
Those who work part-time or fixed-hour schedules are less likely to be obese, sug- 
gesting that long-term health risks operating through obesity, such as cardiovascular 
disease and diabetes, are affected by time availability. 





On Oct. 24, 2003, the North American “Take Back Your Time” movement es- 
tablished an annual “Take Back Your Time Day,” and pushed to make work time 
an issue in the 2008 American presidential election. “Millions of Americans are 
overworked, over-scheduled and just plain stressed out... we're working more 
than medieval peasants did, and more than the citizens of any other industrial 
country.”! Its message is clear: workers are sacrificing their health and well-being 
in the service of their employers, free time is growing increasingly scarce, and 
the greedy time demands of work institutions encroach on personal, family and 
community involvement. 

Yet the picture of changing work time in America is complex. In The Time Divide 
(2004), Jacobs and Gerson find that trends in Americans’ work hours over the past 
several decades are more bifurcated than previously assumed in scholarly and popu- 
lar literature (Hochschild 2001; Schor 1991). Since the 1970s, there has been a 10 
percent decline in the number of Americans working a “standard” 40-hour week, 
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with a corresponding increase in both part-time and overtime work. On average, 
these part-timers desire greater employment, and the overtime workers wish to work 
less (Jacobs and Gerson 2004). In addition, few Americans are working a standard 
9-to-5 shift due to variable hours, rotating shifts and non-daytime work (Presser 
2003). Although night work may be on the decline, the work day has increasingly 
expanded since the 1970s to cover early morning and evening hours (Hamermesh 
1999). While much of the literature documenting these trends in hours and shifts 
tends to either focus on the number of hours people work or the time of day when 
they do that work, the two trends are clearly related. Both suggest fundamental 
changes in the organization of Americans’ work time. 

Do these changes in the organization of work time have implications for the 
health and well-being of Americans? The intuitive hypothesis that working long 
hours or non-standard shifts is harmful to health has become largely accepted. 
Supplementing this popular view, a sizeable body of literature from America, 
Europe and Japan reports that the length of work hours and the timing of shifts are 
hazardous to health (European Foundation 2000; Galinsky et al. 2001; Ludovic, 

. van Amelsvoort, Schouten and Kok 2004; Megdal et al. 2005; Raediker et al. 
2006). Yet while many studies of work time suggest detrimental effects on health, 
others observe that more rigorous studies find less evidence of a health effect 
(Bonzini, Coggon and Palmer 2007; Chen et al. 2006; Megdal et al. 2005). 

In this study we sharpen our understanding of the work time-health relation- 
ship by providing a systematic examination across several dimensions of work time 
and a variety of health outcomes, while also taking into account confounding in- 
fluences that have limited prior work. We use longitudinal data from the National 
Longitudinal Survey of Youth ’79 Cohort, focusing on health outcomes when 
respondents were approximately 40 years of age. Longitudinal information on 
work and health allows us to begin to separate the influence of health problems on 
how much or when people work from the effects of work time on health. Because 
these data are based on a nationally representative sample of adults in a defined 
birth cohort and because they include a wide range of information on work and 
family characteristics, we are able to control for health and work correlates of work 
time identified in prior literature (Barnett 2006). Finally, this health module of 
the NLSY79 includes a wide range of physical and emotional health outcomes, 
allowing us to assess these relationships across different health dimensions. 


Work Time and Health 


The number of hours spent working can be distinguished from when that work 
is done and the regularity of those hours. Each of these dimensions of work 
time is likely to affect and be affected by health in distinct ways. However, these 
dimensions of work time are also related and as a group reflect the organization 
of work time.” Simultaneous attention to all of these dimensions of work time is 
important for understanding whether work time has consequences for health, and 
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if so, the mechanisms through which these consequences materialize. A broader 
understanding of the relationship between work time and health also requires 
looking across a wide array of health outcomes because the pathways leading to 
poor health will vary for different health dimensions. While unraveling the precise 
mechanisms linking work time and health is beyond the scope of this article, as- 
sessing the relationships between work time and different dimensions of health 
can provide important clues about possible mechanisms. For example, emotional 
stress is a plausible and often assumed mechanism linking long hours and non- 
standard shifts to health (e.g., Kudielka et al. 2007), but long hours performing 
repetitive tasks or heavy physical demands may also affect health because of the 
physical stress of this work. In addition, the amount and timing of work may affect 
health indirectly by interfering with time and energy for preparing healthy meals, 
exercise and sleep, increasing the risk of obesity and chronic disease. 


Quantity of Work Hours 


Recent work on the number of hours Americans work has highlighted the in- 
creasing polarization of work hours (Jacobs and Gerson 2004). Since the 1970s 
there has been an increase in both part-time (less than 30 hours per week) and 
overtime (more than 50 hours per week). Jacobs and Gerson also report that, on 
average, the parttimers desire greater employment, and overtime workers wish 
to work less (2004; see also Reynolds 2003). Work time in the United States is 
distributed unequally, and is becoming increasingly divided into extremes. One 
implication of this polarization is that part-time work may be related to health in 
different ways than overtime, making the relationship between work hours and 
health non-linear and multifaceted. 

Studies suggest that part-time workers suffer from worse health than full-time 
workers (Ross and Mirowsky 1995). Part-time work may represent lower involve- 
ment with one’s workplace, weaker ties to co-workers, and decreased engagement 
with social structure. The part-time worker may experience greater under-regulation 
and consequentially greater anomie and dissatisfaction (see Durkheim 1997), leading 
to stress and negative health outcomes. An inability to find or maintain adequate 
work may also be stressful in itself, particularly if there is a mismatch between aspira- 
tions and achievements (Dressler 1988), or pressing economic hardship (Kohn 2002). 

To the extent that fewer work hours are linked to lower socio-economic status, 
they are also likely to have an indirect impact on health through SES. SES is 
considered a “fundamental cause” of disease (Link and Phelan 1995) because of 
the persistence and strength of its association with health. In contrast, working 
longer hours increases earnings and may even steepen future earnings trajectories, 
allowing workers greater access to health insurance and effective medical treat- 
ments, healthier neighborhoods and preventive health behaviors. 

However, in some circumstances, part-time work may have positive effects for 
health and well-being. For example, couples in which one partner works parttime 
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(compared with both partners working fulltime, or one working fulltime and the 
other unemployed) have been associated with positive family-related outcomes 
(Hill 2006). Working less than fulltime might also free up time to pursue leisure 
activities, get enough sleep and complete household tasks, as employees of the fa- 
mous American cereal corporation Kellogg claimed during the 1930s (Hunnicutt 
1996). The reduced stress of less than standard full-time work hours may be 
physically and psychologically beneficial. 

The meaning and desirability of part-time work, however, does differ somewhat 
for men and women. In the 1992 National Study of the Changing Workforce, 
15.6 percent of women vs. 7.3 percent of men reported they would change their 
jobs in order to be able to work parttime (Jacobs and Gerson 2004). Because the 
organization of work time and particularly the meaning of part-time work remains 
gendered, it is possible that relationships between part-time work and health differ 
for men and women. 

“Extended duty,” or work that continues for long periods of time and takes away 
from regular sleep time, affects health through the interaction of fatigue, sleep loss 

. and circadian rhythm disruption (U.S. Congress 1991). The danger of overtime 
has long been of concern to social critics such as Marx, who maintained that . 
harsh labor conditions in combination with increasing work time demands and 
socio-economic hardship produces physical suffering, debilitation and premature 
death (Marx 1887). Recent research has linked the stress and fatigue of overtime 
work with a higher rate of work-related injury (Dembe et al. 2005), and greater 
subjective time pressure with depression (Roxburgh 2004). 

In addition to physical depletion and increased occupational hazards, extending 
work time decreases time available to spend in other social arenas such as one’s family 
(Voydanoff 1994), which may increase stress and reduce time for healthy behaviors 
such as food preparation and exercise. Using data from the General Social Survey, 
Golden and Wiens-Tuers found that the negative effects of working overtime on fa- 
tigue and work-family interference were not mitigated by increased earnings (2006). 

The recent literature on work hours and health is more extensive in examining 
overtime than part-time, but does not consistently show negative effects of either. 
‘The finding most commonly reported is a positive link between overtime work and 
health complaints (Grosch et al 2006; Raediker et al. 2006; Trinkoff et al. 2006), 
including physical, psychological and overall health ratings. However, some studies 
find no relationship between long hours and health (U.S. Department of Health 
and Human Services 2004), and some find that overtime work is associated with 
better health (Golden and Wiens-Tuers 2006; also see Stolzenberg 2001). 


Timing of Work (Shifts) 


‘The length of time one spends working (sometimes referred to as “chronometrical 
time”) can be distinguished from when those hours are distributed or chrono- 
logical time (Hinrichs et al 1991). The time of day in which one works, and the 
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variability of work hours, are chronological dimensions of work time with possible 
effects on employee health. Both night work and rotating shift work have poten- 
tial negative implications for health, although the proposed mechanisms differ. 

Social context is thought to mediate health and quality of life outcomes for night 
workers. Infrastructure and other environmental factors are cued to diurnal work 
patterns — active during the day and resting at night. In a study of South African 
employees, night shift workers had longer commuting times and poorer sleep as 
compared to day shift workers (Adler 1991). Night workers may also try to stay in 
step with day schedules, for example by staying awake during traditional mealtimes 
with their families (Presser 2004). In addition to exhaustion, asynchrony with social 
institutions may lead to increased stress levels. For both men and women with chil- 
dren, working at night rather than during the day is associated with higher divorce 
rates (Presser 2003[2000]), suggesting greater levels of family stress. 

Work that includes variable day and/or nighttime hours has further implica- 
tions for health. Switching from day to night forces continual adaptation to a new 
schedule, compounding the effects of working nights or extended hours. Rotating 
and variable hour shifts may also have negative effects on well-being if the job is 
associated with a lack of job control. Poor “schedule fit” may negatively affect psy- 
chological distress (Gareis and Barnett 2002). In contrast, autonomy, flexibility and 
the ability to set one’s hours might counteract some of the detrimental health effects 
of working a rotating shift (Presser 2005). However, working a shift with variable 
hours may be the choice of the employer, not the employee, indicating that both 
irregularity of hours and job control are at issue. Fenwick and Tausig (2001) found 
that non-day, weekend and rotating shift work did not independently affect health, 
but that lack of scheduling control had strong negative effects on health. 

Most of the literature associates shift work with negative health outcomes, in- 
cluding psychological, physical and general health measures. Rotating shifts are as- 
sociated with greater stress and reports of more frequent health problems (Lac and 
Chamoux 2004), as well as hypertension (Oishi et al. 2005), diabetes (Suwazono 
et al. 2006) and serum acid levels in males (Uetani et al. 2006). Working nights 
has been associated with overall worse health (Khaleque 1999) and late fetal loss 
(Zhu et al. 2004). It has also been linked to breast cancer (Megdal et al 2005), 
perhaps through increased exposure to artificial light which suppresses melatonin 
and its anti-carcinogenic activity. 

Yet there are also inconsistencies in the literature on shift work and health. 
Several studies fail to find negative health effects of shift work (Fenwick and 
Tausig 2001; Ludovic et al. 2004; O’Leary et al 2006), or report better health 
outcomes among shift workers (Axelsson et al 2006; Chen et al. 2006; Yadegarfar 
and Mcnamee 2003). Additionally, night shift workers are often combined with 
workers in rotating (day to night) or other variable hours shifts in studies of “shift 
work.”(e.g., Pietroiusti et al. 2006) Whether “shift work” studies capture work 
outside of traditional daytime hours, variability of hours or the magnitude of a 
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rotation is often unclear (Knutsson 2004). Finally, researchers may attribute null 
effects or positive relationships to selection (Yadegarfar and Mcnamee 2003). 


Methodological Inconsistencies in the Work Time-Health Literature 


Research assessing the effects of work time on health has yielded inconsistent find- 
ings. Several confounding influences may account for these inconsistencies. First, 
work time is closely related to other dimensions of both family and work environ- 
ments, making it important to disentangle the influence of these factors from the 

specific effects of work time on health. Second, relationships between work time 

and health are likely to be reciprocal, with health influencing how much and when 

people work and work time affecting health. These effects may be temporary, as 

the body adapts to a new schedule (Kudielka et al. 2007), or they may persist and 

affect future labor force participation. Finally, it is likely that work time will affect 

different dimensions of health in varying ways (Knutsson 2004), thus making it 

difficult to make broad conclusions about health effects from studies that focus 

on a single or small number of health indicators. 


Covariation with Work and Family Environments 


The organization of work time is closely connected to other characteristics of both 
work and family environments, and these environments may affect health in a vari- 
ety of ways. The covariance of work time with other dimensions of both work and 
family makes it critical that studies control for important covariates such as gender, 
the presence of young children in the household, income, race, education, job char- 
acteristics and attitudes toward one’s job (Barnett 2006). Because specialized samples 
may over select individuals on one or more of these confounding characteristics, it 
is also important to investigate these relationships with nationally representative 
samples. Many studies (Oishi et al. 2005; Suwazono et al. 2006) focus on one em- 
ployer or one occupation, limiting the ability to control for SES, job characteristics 
and other factors that are confounded with work time. Prior research also typically 
assesses either work hours or shifts, failing to isolate one from the other, and some 
fail to distinguish between working variable hours and working at night. 


Reciprocal Effects of Health on Work Time 


Although health may be influenced by how much and when one works, work 
time is influenced by health. Health selects workers into and out of the labor force 
(Ross and Mirowsky 1995; Pavalko and Smith 1999; Pavalko, Gong and Long 
2007) and workers with health problems may cut hours or change their shifts to 
better manage their health needs. Workers with mild health problems may also 
be less likely to work overtime, making it difficult to distinguish the health effects 
of long work hours from health selection. Researchers who study shift-work also 
suggest a “healthy shift-worker effect.” Shift workers with worse health quit their 
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jobs and select into jobs with standard hours, obscuring the potential negative 
health effects of non-standard work hours (Knuttson 2004). Longitudinal data 
can be valuable for helping to sort out these reciprocal influences (Presser 2000), 
yet longitudinal assessments of the effects of work time on health are rare. The few 
longitudinal studies that do address work time and health generally have been lim- 
ited to smaller non-representative samples or studies of single occupations (U.S. 
Department of Health and Human Services 2004; but see Stolzenberg 2001).° 


Varying Etiological Pathways 


Work-time may affect health through a variety of direct and indirect path- 
ways. Because health outcomes reflect different etiological processes, they provide 
important clues about these different mechanisms. For example, if work time 
primarily affects health because it produces emotional stress, then we should find 
the strongest influence of work time on emotional health and physical problems 
such as hypertension and coronary heart disease. Conversely, if work time also 
affects health through indirect pathways such as reducing time for healthy meals 
and exercise, these influences should be most evident through increased risk of 
obesity. We may also find stronger selection effects of physical health on work time, 
as workers are required to make work adjustments to accommodate physical limi- 
tations. Unfortunately, most studies assessing effects of hours or shifts on health 
focus on a single health outcome or type of outcome (for example, hypertension, 
birth outcomes, mental health or overall self-rated health). These studies are valu- 
able for providing an in-depth assessment of individual health processes, but at- 
tending to a limited number of health outcomes limits our ability to assess broad 
health effects of the organization of work time. Sociologists have long argued that 
understanding social determinants of health requires attention to a wide range 
of health outcomes because these comparisons across outcomes can better guide 
further inquiry into underlying mechanisms (Aneshensel 1991; Thoits 1995). 

In this article we ask whether the changes in the organization of work time 
have implications for the health and well-being of Americans. In light of the 
inconsistencies in prior literature, we look at the relationships between multiple 
measures of health and the timing and duration of work hours, and whether 
these relationships hold even when controlling for individual, work and family 
characteristics, as well as prior health limitations. 


Data and Methods 
Sample 


Our data come from the National Longitudinal Survey of Youth 1979, a nation- 
ally representative sample of 12,686 men and women residing in the United States. 
In the first wave of the survey (1979) these individuals were 14 to 22 years of age. 


1470 © Social Forces 88(3) 


The original NLSY79 sample included two supplemental oversamples (a military 
supplement and an economically disadvantaged supplement). Portions of each 
oversample were dropped in the 1980s and 1990s; the original sample size of those 
who continued to be followed was 9,964. Seventy-two percent (n = 7,182) of that 
original sample completed the 40+ health module. The study participants were re- 
interviewed on a yearly basis through 1994 with subsequent follow-ups every two 
years. In each wave, face-to-face interviews conducted by the Center for Human 
Resource Research and the National Opinion Research Center collected detailed 
information concerning labor force participation, including employment and 
the timing of changes in employment, spouse’s employment, wages and income, 
industry, and work schedule (Bureau of Labor Statistics 2004). 


Study Design 


To analyze the effects of work time on health, we use a range of health measures 
collected when respondents were approximately 40 years old. Beginning with 
the 1998 survey, a 40-and-older health module was included in the NLSY. Each 

* respondent was only asked this module one time, at the survey closest to when 
he or she turned 40. We refer to the survey after respondents turned 40 as T2. 
Taking into account the temporal ordering of work time and health, all control 
and work time variables were pulled from the survey prior to when they turned 
40 (T1). This design allows us to assess the effects of work time on health when 
the men and women in the NLSY were approximately 38. 

Our analysis is limited to employed respondents who were working at T1 and 
who had valid responses to the 40+ health module. A baseline N of 7,182 respon- 
dents available at T2 was reduced to 6,468 after non-employed respondents were 
dropped.* We lose additional cases due to missing data, for a final N of 5,341. 


Measures 


Table 1 shows descriptive statistics for all variables included in the study. We 
created six dependent variables to measure health outcomes. We used compos- 
ite measures of physical and mental health from the SF-12 (Short Form 12) 
Physical and Mental Health Summary Measures (Ware, Kosinski and Keller 
1996). Psychological distress was measured from the Center for Epidemiological 
Studies Depression Scale.’ Height and weight measures were used to calculate 
body mass index, which was then dichotomized to indicate obesity for respon- 
dents with a BMI of over 30° (CDC 2007). We also coded for the presence of 
any of 15 very life-threatening medical conditions (heart problems including 
heart attack, stroke, angina, cancer or congestive heart failure) or somewhat 
life-threatening medical conditions (Gold et al. 2002). A separate measure‘was 
included to capture the total number (out of 17) of non-life-threatening medical 
conditions reported (Gold et al. 2002), such as joint and back problems, chronic 
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upper respiratory problems, skin conditions, frequent headaches, frequent uri- 


nary tract or kidney infections, and thyroid conditions. 


Our independent variables indicate the quantity and timing of work hours. We 
created a continuous variable for number of hours worked per week, based on the 


number of hours worked per year divided by the number of weeks worked per year. 
This variable was then broken up into six dummy variables: <30, 30-39, 40, 41-49, 


50-59 and >59 hours per week. We used a categorical measure due to substantive 





yoaom Jad SINOY OE UCU} SSB] SYJOM OSNOUS X pale ET 
paXojdwia jou asnods x pawey| 9¢° 
(mo} 7 ‘yBly |) uonoeysHes gor 9’ 
(syaam) saAojde Avewid yim asnua} gor = / 9°76 Z 
yyOud-uou JO a}eAud :J0}Oas Gol ANEW FHC 
JUSWWAAOB J0O}0es Gof AWeWUd PP’ 
yJOM yesd JO Buiwmwey si gol ewig  /¢" 
Jasoqe| JO soyesado ue se gol Avewiig ge 
gol aoes esi gol Arewug st 
yoddns aaijesjsiulwipe Jo ‘sajes ‘jeo1uyoa} si gof ewig = ph 
EXO 0} Ayige yuu] yjeay (pinom 40) se0q gz" 


YlYs JyBlue SOM Y |Z 

(Bunejou Jo sejnBauu!) sunoy paxy-uoU SOM yg" 
yaem Jad sinoy gG< SWOMY 1 

yaam Jad suNnoY GG-0G SOM Ge’ 

Y9OM Jad SINOY Gp-|7S¥OMY QE" 

9am Jad SINOY GE-0E SOM YG TZ" 

yaam Jad sunoy Og UY} SS8] SOM YEE" 


(+0€ IW) 8seqosiy gy’ 

UOI}I|PUOD jedI|pawW Hulua}eaiy}-ajl|-UOUe SEH Gy" 

UOI}|PUOD jedIPaW Hulus}eosy}-94!| JEyMaWOS JO ANBAeSeH gy" 
9100S Ayes juauodWwog jeUey| Z| WO4UOYUS 99°) 
9109S Ayewuns juauodwog jealskud Z| WJ04 WOuS  6g'9 
ajeos uolssaidaq saipnys jeaibojoiwapidy Jo} Jajuag «= Gg 






uoduosaq aqeue, uonelAag 


paepueys 


concerns with the broad changes in work hours in North America, and because 
of the likelihood of non-linearity in the relationship to health.’ We also included 


abed Buimoyjoj ay) uo penujuds 


90° O€> SyJom esnods 
Gi. peAojdwa jou asnods 
LO} uoHoeysijes Gor 
OS'9LE aunus} gor 
el }yOJd-UON /9}EAUd 
pl JUSWUBAOS 
QL yesg /Buwwe4 
qt Jasoge7 juoyesedQ 
rl B0INaS 
9? SAENSIUIWPY /S9|eg /|BOUYO9| 
10° JOM SHUI] UES] 
(LL) sjojuog 

70" WlUS }YBIN 
Zh JejnBeu| 
Ol 6S< 
vl 6S-0S 
Lk 6r-LYV 
60° 6€-0€ 
ch oe> 
(LL) Swi) 410M 

0€" 8S9qO 
6S" UOIIPUOD HulUa}easy}-94!|-UON 
Sc uolIpuod Hulus}easy}-94!9 
LS°€S (Z/-4S) uleey jejueyy 
06°75 (Z}-4S) uleey jea1shyd 
Z0'€ (GS39) sseqsiq 
(ZL) sejqeue, Juepuadag 





ues SWEN aqele, 


SONSHEIS AANDUISAQ =| a1qeL 


1472 © Social Forces 88(3) 


two dummy variables indicating type of shift: Night (a fixed night schedule vs. a 
day, evening or irregular schedule); and /rregular (rotating or irregular work hours 
or schedule vs. a fixed day, evening, night or split shift).° 

We included controls for race and gender. Controls collected at Tl include 
income, number of children, presence of young children in the household, type 
of occupation, tenure with employer, spouse's level of employment, educational 
attainment, and whether health problems limited or prevented employment. 
Because of study design the respondents were very close in age (37-39 years old) 
at T1, but to control for any variation we included age as a control at T1. 







Variable Description 


Married x Spouse works 30-39 hours per week 


Married x Spouse works 40 hours per week 
Presence of child under six years old in the household 


Married x Spouse works 41-49 hours per week 
Married x Spouse works 50-59 hours per week 
Married x Spouse works >59 hours per week 

Log of spouse and individual income combined 
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In our analysis we address (1. how health outcomes are distributed by shift type 
and length of the work week, (2. to what extent relationships are attenuated by 
controls for other dimensions, and (3. whether any remaining relationships are 
explained when we account for existing health limitations at T1. 


Results 
Mental Health 


In the primary set of multivariate analyses, we investigate baseline associations 
between the organization of work time and measures of mental health. Panel A of 
Table 2 shows the regression coefficients for psychological distress, as measured by 
the CESD (higher CESD scores indicate more reported symptoms). In Model 1, 
we use OLS regression to estimate distress as a function of work hours and shift, 
using 40 hours per week, fixed hours and daytime work as the omitted categories.” 
In Model 2, we include additional measures likely to affect health outcomes: race, 
gender, job satisfaction, job tenure, education, age, number of children, having 
a child under the age of six in the household, earned income (In), occupational 
type, job sector, and spouse’s hours. Finally, in Model 3 we introduce an additional 


Table 2: OLS Regressions of Distress (Panel A) and Mental Health Score (Panel B) 


on Hours and Shift 
Model 1 Model 2 Model 3 
Work Time | Work Time + Controls? Work Time + Controls? 


Panel A: Distress (CESD) 


<30 ot 34 14 
30-39 2 .29 eo 
41-49 .05 Op} 30) 
50-59 .06 44** 44** 
>59 -.13 18 .09 
Irregular shift oe 38" 29 
Night shift 43 .09 05 
R? 0 08 1 
Panel B: Mental Health (SF-12) 

<30 =a -.88* -.61 
30-39 -.86* -.36 -.3 
41-49 -.49 Suh -.13* 
50-59 -.13 -.50 -.49 
>59 46 -.03 eho 
Irregular shift -.50 -.63 -.51 
Night shift -.11 -.50 -.45 
R? 01 05 06 


@Model 2 controlled for race, gender, job tenure, job satisfaction, education, age, number 
of children, having a child under 6 in the household, log of earned income, occupational 


type, job sector, and spouse's hours. 


»Model 3 also controlled for whether health prevented or limited work. 


N=5,341;*p<.05 *p<.01 
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control variable, indicating whether the respondent reported, at Time 1, that his 
or her health limited the type of work she or he could do or prevented working. 

Overall, we find that working more than 40 hours, but not more than 59 hours, 
is associated with higher levels of distress. As shown in Model 1, workers reporting 
less than 40 hours report significantly higher levels of distress on the CESD. Those 
who work variable hours (rotating between nights and days, or reporting “irregu- 
lar” hours) also report higher levels of distress. However, controlling for work and 
family characteristics in Model 2 accounts for the higher levels of distress among 
part-time workers. Thus, we find that lower levels of work time do not increase 
distress, but rather, the association between part-time work and distress in Model 
1 is explained by the characteristics of part-time workers. Controlling for these 
factors, we find a curvilinear effect of work hours on distress in Model 2; greater 
than 40 hours of work per week (but not >59 hours per week) is associated with 
more distress as indicated by significantly higher levels on the CESD. However, 
Model 2 does not account for the higher distress scores reported by those working 
variable hours. In Model 3, controlling for a prior health problem that limited 

. work reduces this association to below statistical significance. This suggests that 
those who work irregular schedules for health reasons are more distressed. There. 
is still no association between part-time work and distress in Model 3, and we 
continue to observe a non-linear association between greater-than-standard work 
hours (41-59, but not >59 hours per week) and psychological distress. 

In Table 2, Panel B, we find a similar pattern for overall mental health. The 
response variable is the mental component summary score from the SF-12; a 
higher number indicates a more positive health score. In Model 1, there is a 
negative relationship between part-time work and mental health, as compared to 
full-time work. This relationship between part-time work and worse mental health 
is mitigated by controls in Model 2, and becomes non-significant in Model 3. A 
negative association between mental health and working moderately more than 
standard hours (41-49 hours per week, but not 50-59 or >59 hours per week) 
emerges in models 2 and 3. We fail to find an association between mental health 
and shift type in any of the models. 


Physical Health 


As with the mental health outcomes, we find an overall curvilinear relationship be- 
tween greater work hours and worse reported physical health. Table 3, Panel A shows 
the OLS estimates for the effect of work hours on the SF-12 physical component 
summary score. As with the SF-12 mental component score, a larger number indi- 
cates a more positive health score. Again we see a significant relationship between 
part-time work and lower score on the SF-12 (worse health) in Model 1 that is 
attenuated by controls in Model 2, and absent in Model 3. In models 2 and 3 we 
observe a significant negative relationship between working 50-59 hours per week 


(but not 41-49 or >59 hours per week) and physical health on the SF-12 (p < .05). 
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Table 3: OLS and Logit Regressions of Physical Health Measures on Work Hours 
and Shift Type 
Model 1 Model 2 Model 3 
Work Time _ Work Time + Controls? Work Time + Controls? 
Panel A (OLS): Physical Health (SF-12) 
<30 -1.32*"* 


-.97* -.29 
30-39 -.68* -.50 -.34 
41-49 18 -.20 ~.25 
50-59 -.12 -.62* -.60* 
>59 -.15 -.52 -.20 
Irregular shift -.34 -.44 -.13 
Night shift -.42 .08 .19 
R? 01 .04 16 
Panel B (Logit): Life-Threatening Condition 
<30 44*** at 16 
30-39 20 .04 01 
41-49 ass mL Be .20* 
50-59 10 ke 4 bl 
>59 -.02 14 08 
Irregular shift .00 .08 .02 
Night shift 19 .20 18 
Pseudo R? .00 .03 07 
Panel C (Logit): Non-Life-Threatening Condition 
<30 ye 23 14 
30-39 at .08 .06 
41-49 ai oA .22* 
50-59 ees 2b. .26** 
>59 .05 he .08 
Irregular shift .04 .08 .04 
Night shift 05 12 a 
Pseudo R? .00 02 04 


Model 2 controlled for race, gender, job tenure, job satisfaction, education, age, number 
of children, having a child under 6 in the household, log of earned income, occupational 
type, job sector, and spouse’s hours. 

Model 3 also controlled for whether health prevented or limited work. 
N=5,341;*p<.05 **p<.01 ***p<.001 


Our other two indicators of physical health measure the presence of a very 
or somewhat life-threatening medical condition and the presence of a non-life- 
threatening medical condition. Panels B and C show logit coefhicients for work 
hours and shift on the likelihood that someone has a very/somewhat (Panel B) 
or non-life-threatening (Panel C) medical condition. For both types of medical 
conditions, Model 1 indicates that part-time workers (<30 hours per week) are 
more likely to report these conditions, but these health effects are explained by 
controls in models 2 and 3.!° In all three models, working more than 40 hours 
per week, but not more than 59 hours per week, was associated with a greater 
likelihood of reporting a non-life-threatening medical condition. There were no 
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significant associations between working a night shift or working variable hours 
and reporting a medical condition. 


Obesity 


The effects of work time on obesity (BMI > 30) are shown in Table 4. In each of the 
models, part-time hours have an independent, protective effect on obesity risk. The 
logit estimates for obesity on hours and shift indicate that workers reporting work- 
ing 40 hours per week or more are more likely to be obese than part-time workers 
(fewer than 30 hours per week), and this pattern stands up to controls in models 2 
and 3. Even accounting for prior health limitations (Model 3), the logit coefficient 
for obesity on working less than 30 hours per week is -.3. Working less than 30 
hours per week, rather than 40, decreases the odds of being obese by 26 percent, a 
probability decrease of 6 percent (odds ratios and predicted probabilities not shown). 
In separate estimates (shown in Table 4), we find that greater than full-time hours 
(41-49, 50-59 or >59) are significantly associated with obesity (@ = .34, p < .01 for 
41-49 hours per week, 8 = .49, p < .001 for 50-59 hours/week, and 8 = .30, p < .05 
* for >59 hours/week) compared to part-time hours (<30 hours per week). 

Working variable hours also affects the likelihood of being obese, with variable © 
hours associated with a higher chance of obesity even when controlling for socio- 
demographic characteristics, job tenure, occupational characteristics and spouse's 
hours (Model 2), as well as whether health limits the ability to work (Model 3). 
Holding all other variables constant, workers with irregular hours (including those 
who rotate between day and night shifts) have 1.27 times higher odds of obesity 
(5% greater probability) than workers with a fixed schedule (Model 3, odds ratios 
not shown). Although night shift work is associated with a greater likelihood of 
obesity in Model 1 (p < .05), with controls added to the equation in Model 2 this 
relationship does not hold. 


The Effects of Control Variables 


Our control variables included work, family, race, class and gender, as well as prior 
health status (refer to Appendix 1). With the exception of age and number of 
children, the control variables had significant independent effects on at least some 
of the health outcomes, although these relationships differed across the health 
measures.'' Most consistently, being satisfied with one’s job, having more educa- 
tion, and having an employed spouse tended to be associated with better health, 
while having had a health problem that previously limited one’s work was associ- 
ated with worse emotional and physical health and greater risk of obesity. There 
were relatively few systematic occupational or job sector differences in health. 
Consistent with previous research, women tended to report more psychological 
distress and emotional health problems as well as more physical health problems 
than did men. Differences between blacks and whites tended to vary across the 
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Table 4: Logit Regressions of Obesity on Work Hours and Shift Type 
Model 1 Model 2 Model 3 


Work Time __ Work Time + Controls? Work Time + Controls® 





Obesity 

<30 -.40*** -.27* -.30™ 
30-39 -.16 -.11 -11 
41-49 -.13 02 02 
50-59 -.03 St Bas 
>59 -.14 -.02 -.02 
Irregular shift .19* Zar .24* 
Night shift ae .20 .20 
Pseudo R? .00 03 03 
<30 Hours per Week as Comparison Group 

30-39 24 18 19 
40 Ale .29* ae 
41-49 ed ok 4 
50-59 ot .46*** 4g*** 
>59 20° 20" aC" 


* Model 2 controlled for race, gender, job tenure, job satisfaction, education, age, number 
of children, having a child under 6 in the household, log of earned income, occupational 
type, job sector, and spouse’s hours. 

’Model 3 also controlled for whether health prevented or limited work. 
N=5,341,*p<.05 **p<.01 **p<.001 


health outcomes. Blacks reported higher levels of distress and a greater risk of 
obesity than whites, but blacks also reported better overall emotional health and 
fewer medical conditions. The health of Hispanics was more similar to whites, but 
Hispanics reported more obesity and fewer medical conditions than did whites. 


Gender Differences in the Organization of Work Time and Health 


We also ran the full models for each health outcome separately for men and 
women to assess whether the relationship between work time and health differs by 
gender (results not shown). Significant coefficients were compared to results from 
an additional model including interaction terms for each of the independent and 
control variables multiplied by gender. Overall we found that the effects of work 
hours and shifts on the various health outcomes were more similar for men and 
women than they were different, with a few notable exceptions. As we expected, 
part-time work does appear to relate to mental health for women differently than 
it does for men. For example, among women, part-time workers do not have 
significantly worse mental health, as measured by both the CESD and by the 
SF-12, than those working 40 hours per week. However, we do see significantly 
worse mental health for men working parttime than for male full-time workers. 
These gender differences for part-time workers are only statistically significant for 
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the CESD and for those working less than 30 hours a week. One of the other few 
statistically significant differences between coefficients for men and women was 
found for physical health among workers working an irregular shift. The overall 
physical health for men, as measured by the SF-12 scale, is significantly worse 
than for men working a regular shift. For women there are not significant physical 
health differences between those working regular and irregular shifts. However, 
given that each model has seven work-time coefficients and we are assessing ef- 
fects across six health outcomes, it is important to note that most of these are not 
significantly different for men and women, and we would expect that, by chance, 
one or two might be statistically significant. The overall pattern appears to be 
more similarity in the relationship between work time and health for men and 
women than difference. 


Discussion 


Both popular and scholarly work in recent years has documented fundamental 
changes in the organization of work time. It is often assumed that changes in when 
and how much people work, particularly working longer hours or non-standard 
shifts, have implications for health and well-being. Yet prior research on the health 
effects of work time has yielded inconsistent findings (Chen et al. 2006; Golden 
and Wiens-Tuers 2006; U.S. Department of Health and Human Services 2004). 
Just as changes in work hours are more complex than originally assumed, reflecting 
increases in both under and overwork (Jacobs and Gerson 2004), the health effects 
of work time are also complex. While work time may have a direct effect on health 
and well-being, work time is closely related to many other dimensions of work that 
also affect health, including working conditions, job satisfaction, occupational status, 
income, and in the United States, health insurance. Workers who work long hours 
are also a select group, and those who are in poorer health may be less likely to 
have the opportunity or be willing to take on longer work hours if the opportunity 
arises. Finally, work time may also affect different dimensions of health in varying 
ways. The primary goal of this research has been to sharpen our understanding of 
the effects of work time, including both work hours and work shift, on a variety 
of health dimensions, while also taking these confounding influences into account. 
Our findings suggest four general patterns in the relationship between work 
time and health. First, we generally find more health variation across different 
levels of work hours than we do across different types of work shifts. While there 
are a few exceptions (e.g., psychological distress and obesity), even before intro- 
ducing controls, the health of workers who work irregular or rotating shifts or 
those who regularly work the night shift generally does not differ from workers 
who work more standard shifts. In contrast, multiple measures of psychological 
and physical health vary across levels of work hours. The lack of health variation 
between standard and non-standard shift workers may indicate that shift type has 
relatively few effects on the health of midlife workers, but it may also reflect more 
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subtle selection processes than are accounted for in our analyses. While models 
control for the extreme case of workers who previously reduced or stopped work 
because of poor health, workers who do not fall into this extreme case but whose 
health is more sensitive to non-standard shifts or whose energy is more limited 
may avoid jobs where they would have to work a non-standard or rotating shift. 
Relatively few workers in our sample report non-standard, and especially regular 
night shift work, suggesting that even in such a large sample of workers, we may 
lack adequate statistical power to fully detect health differences between standard 
and non-standard shift workers. In either case, our findings indicate that, given 
both the number of persons working very short or very long hours and the health 
differences across work hours, how much people work is more closely related to 
physical and emotional health than when people work. 

Second, there is a higher concentration of health problems among part-time 
workers, but we do not find an independent effect of working fewer than standard 
hours on any of the health outcomes. Compared to persons who work 40 hours 
per week, those working less than 30 hours (i.e., part-time workers) generally 
report significantly worse physical health, and males working parttime also re- 
port significantly worse emotional health. Part-time workers report higher levels 
of psychological distress, lower levels of emotional and physical health on the 
SF-12 scale, and are more likely to report both very life threatening and non-life- 
threatening medical conditions. However, virtually all of these differences between 
part-time and full-time workers are accounted for by controls for work and family 
characteristics. Job satisfaction, higher levels of education, and spousal employ- 
ment are associated with better health, while women and those with a prior health 
limitation affecting the ability to work report worse health. This suggests that it is 
not working fewer hours that is unhealthy, but rather, the limited resources that 
are tied to part-time work. 

Third, after controlling for the resources and work conditions that are closely 
linked to work hours, we find a curvilinear relationship between work time and 
health, with work beyond 40 hours per week, but not beyond 59 hours, predicting 
health penalties. While moderate levels of greater-than-standard work hours had 
a negative effect on distress and mental and physical health, we were surprised 
to find that workers at the extreme, reporting >59 hours per week, do not report 
significantly worse health than full-time (40 hours per week) workers. Recent 
research has found that working more than 40 hours per week increases access 
to flexible scheduling; for 50+ hours the effect is much greater (Golden 2008). 
Perhaps the >59 group, which was not examined separately in that study, has 
even more access to this benefit. We considered the possibility that the >59 hour 
a week group was too small to detect health differences, but we noted that 10 
percent of the sample reported working >59 hours per week. A final possibility 
is that there may be multiple selection processes differentiating those who work 
very long hours from standard full-time workers. Our measure of health selection 
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only captures those who reduced their work because they had health problems, but 
health selection may operate in the other direction as well. Persons who are very 
healthy and have high levels of energy may be more likely to work exceptionally 
long hours. In other words, we controlled for health limitations, but not for health 
advantages, and it is possible that both health limitations and advantages influence 
how much people work. We hope that future research will continue to disentangle 
these intertwined selection and causation processes and be able to assess multiple 
dimensions of health selection. 

Finally, our results suggest that work time is related to obesity in somewhat 
different ways than for either physical or emotional health. Unlike indicators of 
current physical and emotional health, for obesity, part-time work is associated 
with ower risk of health problems compared to work at full-time levels and higher, 
and this lower risk stands up to controls for job and family conditions, other 
resources, and health selection. In other words, the resources that one gains from 
working fulltime or more appear to do little to offset the risk of obesity. Working 
an irregular shift is also associated with increased risk of obesity. The direction and 
persistence of both hours and shift on obesity risk are consistent with concerns 
that increases in work hours, and even full-time work, as well as working non- 
standard shifts, allows less time for the preparation of healthy meals and exercise 
(Phipps et al. 2006). 

The importance of obesity, particularly at age 40, is its risk for future health 
problems such as cardiovascular disease and diabetes. When viewed in the context 
of the adult life course, the overall pattern of findings suggests that the organiza- 
tion of work time at midlife has modest implications for short-term physical 
health and emotional well-being while simultaneously setting the stage for more 
significant health problems later in the life course. This temporal pattern of cur- 
rent and future health risk is important because it suggests that the mechanisms 
through which work time affects health are multifaceted, most likely reflecting 
both stress and time demands as likely mechanisms. While full attention to these 
various mechanisms is beyond the scope of this article, we hope that the patterns 
across multiple health measures provide clues for future research. 

While this study addresses many of the limitations that have plagued research 
in this area, particularly the use of cross-sectional data, small or specialized 
samples, and few controls for components of the work and family environment 
that correspond with work time, our findings remain limited in several respects. 
Although the NLSY-79 provides a nationally representative sample and attrition 
has been remarkably low, attrition has reduced the representativeness of this 
sample. Also, while these respondents have been followed since they were teens 
or young adults, they are not asked about such a range of physical and emotional 
health before age 40. Thus, although we are able to take advantage of the lon- 
gitudinal component of these data to assess the effects of work time on health 
approximately two years later, the snapshot we provide remains limited to these 
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effects at one point in men’s and women’s work life. There are many reasons to 
think that the effects of work time on health may differ for 30 year olds or 55 year 
olds, and also that the effect of health on work time (i.e., selection processes) may 
change as workers age. While we suspect this may be the case, these questions are 
beyond the scope of the NLSY-79 data. 

An even greater limitation of this study is our conceptualization of work time 
in terms of individuals’ hours. As noted by Jacobs and Gerson (2004), the changes 
in the amount of time individuals spend at work are overshadowed by much 
greater change in the amount of time couples spend at work. It is quite possible 
that how much or the combination of hours that couples spend at work may have 
even more striking effects on individual health than individual hours (see also 
Stolzenberg 2001). While our models control for spouse’s work, we suspect that 
attention to combinations of work hours and shifts between couples will provide 
an important piece to our understanding of the health effects of work time. 


Notes 


1. hetp://www.timeday.org. 


2. We use the term “work time” and “organization of work time” interchangeably. We 
refer to the dimensions of work time as “work hours” and “work shifts.” 


3. Zhu etal. (2004) use a longitudinal design to examine the associations between shift 
work and late fetal loss, but their national Danish cohort only includes women who 
were both pregnant and working at the time of the study, and they do not control 
for income or overall prior health. The study design also follows each participant for 
less than nine months. 


4, We re-ran all of our final models using Heckman’s (1979) sample selection model. A 
parsimonious selection equation was estimated (Winship and Mare 1992) predicting 
employment from sex and labor force status six years prior to T1. The Heckman 
models were similar to our original regression models, indicating that selection out 
of the labor force on the basis of sex or pre-existing condition six years prior was 
not affecting most of our results. Likelihood-ratio tests indicated that the use of 
Heckman’s sample selection was not justified, so original estimates are presented. 


5. We combined seven CESD items for an overall measure of psychological distress (A 
principal components analysis indicated that these loaded on one factor. Cronbach 
alpha = .83). Two items (could not shake the blues and felt lonely) were not added 
to the 40+ health module until 2002. Because respondents that responded to this 
module in 1998 or 2000 did not answer these questions, we eliminated these variables 
from our analysis. 

6. We also ran our models using several alternate BMI cutpoints: >29, >31, >32, and 
>33, with few differences. Findings present BMI > 30. 

7. We also ran all models using four categories (<30, 30-39, 40-49 and >50) and five 
categories (<30, 30-39, 40-49, 50-59 and >59) with similar results. We present 40 
hours per week as the comparison group unless noted otherwise. 


8. Presented models include hours and shift variables simultaneously. We also ran 
separate models for hours only, and shift only, with the same pattern of results. 


1482 © Social Forces 88(3) 


9. Because the CESD is not normally distributed, we ran alternative models using 
the square root of CESD, and again using the natural log of CESD score +1, with 
comparable results. 

10. Separate models using hypertension alone were also run. We found only one 
significant association (decreased probability of hypertension in 41-49 hours per 
week vs. 40 hours per week). 


11. This is consistent with the results from a correlation matrix of all our variables. 
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Schools Betrayed: Roots of Failure in Inner-City Education 
By Kathryn M. Neckerman 
University of Chicago Press. 2007. 260 pages. $29 cloth. 


Reviewer: Sandra L. Wong, Colorado College 


Relationships between educational inequality, achievement and life chances are 
well-documented. Yet, despite persuasive accounts of vast differences in funding, 
resources and teaching quality between schools, as well as disparities in test scores, 
graduation rates, and other measures of performance, educational attainment 
is still routinely attributed to individual and group limitations. Based on the 
American ideology of meritocracy, explanations for achievement commonly turn 
a blind eye to structural sources of inequality while holding students responsible 
for dampened aspirations or disengagement from schooling. 

The assumption that anyone can get ahead merely through his or her own effort 
and commitment is further reinforced in comparative analyses of different racial and 
ethnic groups both historically and currently. While some racial minorities become 
exemplars of the value of hard work, others are blamed for their own failures, limited 
abilities, and lack of effort and responsibility. For example, black American students 
who do not fare well in school are often compared to white European immigrants 
in the past or to more recent immigrants of color who are portrayed as a testament 
to the capacity of groups to overcome disadvantage and unfavorable odds. 

Comparing the experiences of southern and eastern European immigrants with 
those of black Americans between 1900 and 1960, Kathryn Neckerman presents 
a different story. In contrast to popular belief, she attributes low academic perfor- 
mance among black students to a set of practices and processes that began in the 
1930s, and eventually cemented the effects of initial disadvantage. Through both 
explicit racist policies and less overt forms of neglect, Chicago's inner-city public 
schools offered black students an education that was inadequate in nearly every 
way. Their schools hired and retained the least experienced teachers. Severely over- 
crowded, they cycled students through double and even triple shifts. The absence 
of effective remedial education kept low-performing students perpetually behind. 
When these students moved on to secondary schools, often by social promotion, 
they did not receive the same benefits of the vocational education programs that 
prepared immigrant students for skilled trades. 

Neckerman’s findings are consistent with those of other historical narratives 
that examine the rise of differentiated schooling. Processes of curricular tracking 
created different paths for black and immigrant students and standardized test 
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scores became a key sorting mechanism. While theories emphasizing a relation- 
ship between test scores and academic potential prevailed, there seemed to be 
fewer theories linking performance to the cumulative effects of inferior education. 
By relating current patterns of achievement to the pervasive and deeply entrenched 
problems of Chicago's segregated public schools, Neckerman illuminates the pro- 
cesses through which disadvantage adds to further advantage. 

Most interestingly, her research shows how the structural conditions in which 
teaching and learning take place affects the perspectives and attitudes of teachers 
and students, and how institutional legacies discourage teachers and students from 
putting much stock in their roles. When barriers are overwhelming and constant, 
when there are few signs of hope, what is likely to happen and what does happen, 
is that social actors, at best, “do education” by going through the motions because 
there is little evidence that a higher level of commitment will pay off. As a result, 
classrooms and school buildings become sites of struggle as teachers and students 
direct their frustration toward one another, rather than at the inadequate resources 
that make it difficult for either to achieve success. 

Unfortunately, only limited data are available during the specific years of study. 
We must infer a connection between poor school quality, segregation and the micra- 
level classroom and school practices and interactions she describes. Although quotes 
from historical data provide some evidence of how individuals respond to schooling 
environments, one wishes for more first-hand data on how parents, students and 
teachers experienced the school system and made sense of their experiences. 

Nonetheless, Neckerman’s account clearly reveals significant barriers to high 
achievement. While recognizing segregation and poverty as primary sources of 
educational inequality, she avoids creating an impression that the education of 
children in the inner city can be improved only with massive reform. She makes 
a case for enhancing the everyday lives and future outcomes of young people by 
investing in schools as a source of positive influence. Models of investment include 
community- and school-based experiments in New York City and Chicago where 
local officials, educators and parents have attempted to address issues of poverty 
and schooling as one piece. Obviously, meeting these objectives requires sufficient 
funding, and Neckerman calls for coalition building at the state and federal levels 
to promote efforts to increase resources. I look forward to introducing students 
in my sociology of education course to her work. 





Reproducing Racism: White Space, Elite Law Schools, and Racial Inequality 
By Wendy Leo Moore 


Rowman and Littlefield Press. 2008. 216 pages. $80 cloth, $29.95 paper. 
Reviewer: Debra Schleef, University of Mary Washington 


Reproducing Racism is the most comprehensive study of the role of race and insti- 
tutional structure in law school to date. Moore extends the well-developed concept 
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of the gendered organization by demonstrating how organizations can be “raced” 
as well. What makes the study of law schools so compelling, the author shows us, 
is that they are raced organizations at the intersection of two powerful American 
institutions— education and the law. Law schools are “fundamentally connected 
to the political system and to the political economy of race,” and reproduce racial 
ideologies through legal authority. 

Moore conducted two years of ethnographic research at two elite urban schools 
in the Midwest. “Midstate” is a public school with 16 percent minority enroll- 
ment, while “Presidential” is private, with 30 percent. Moore interviewed at length 
more than 50 students, as well as faculty and administrators. She attended courses, 
meetings of students of color organizations, and made observations at the schools. 

Moore insists that we look at law schools as “totalizing invocations of [white] 
space.” Using Feagin’s notion of “white frame,” Moore argues that American law 
and legal education promote a set of ideas, emotions and actions that reproduce 
racial inequality. Even though the numerical presence of students of color is not 
small, there are few visible signs of people of color and their experiences in the law 
school environment. The racism that does occur in legal cases is minimized in the 
law classroom, as well as in extracurricular activities. Moore’s critique extends to 
social events planned by white students in white venues with white music. 

The case law that forms the bulk of first year reading is presented as the impar- 
tial voice of authority. Cases that are racist to their core are extolled as espousing 
abstract neutral principles. Law schools privilege the history of abstract liberalism 
in American case law, where equality means treating each individual the same 
under the law when in fact law is often used to shore up group-based concerns 
protecting white power. Moore argues that this supposed neutral stance forces 
students to ignore the power hierarchies that exist around them, from their profes- 
sors and in the law. When professors and students do address ethical, non-neutral 
principles, they are considered “not law.” 

As a result, law students of color are forced by their surroundings to adapt a 
white frame, to create “the institutional conditions by which they may be success- 
ful in these spaces.” Thus they are essentially required to do more work— including 
the unrecognized emotional labor of dealing with subtle racist activities —than 
white students do. What can be done to deconstruct white space? Moore suggests 
that law schools begin by centering, rather than marginalizing, the experiences 
of people of color in the law, as well as teaching law school from a critical rather 
than objective standpoint. 

What Moore’s book shows, perhaps unintentionally, is that the centering of 
the white discourse isn’t even all that invisible. There are, sadly, more examples 
of overt racism against law students of color (what Moore refers to as “Jim Crow 
racism”) in Moore’s book than there are examples of the less visible kind. Of course, 
it is difficult to see the subtle type of bias when is so deeply embedded in the law. 
However, because that is exactly Moore’s point, one might have wished for more 
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examples of how professors teach the alleged objectivity and neutral stance. She 
provides two classroom examples in Chapter 2 and two in Chapter 3, all from 
constitutional law classes during the first year. But law professors don't always 
teach the objectivity of the law, and do discuss indeterminacy with their students. 
Students do come to see the indeterminacy of the law but are perhaps not skilled 
enough at understanding it during law school to see how it privileges whiteness (as 
well as upper class, masculine and heterosexual frames). I would also be interested 
to know whether any white students contested the racist practices of law school. 
They appear here to be monolithically disinterested in diversity. 

Finally, because Moore makes the argument that blacks and other law students 
of color should be allowed to tell their own stories, I wanted more discussion of 
the methodological tools she utilized to establish rapport and ensure that she was 
faithfully reproducing the voices of these students and avoiding “the normaliza- 
tion of white perspectives about the experiences of people of color.” Her method- 
ological appendix is a scant four pages long. However, I suspect her whiteness was 
critical to the exchange with the white professor who said of students of color, “We 
give them the moon, but they still complain.” Nevertheless, Reproducing Racism 
is a fascinating study of the complex dynamic between race, education, abstract 
legal liberalism and the American legal system. 


Citizen Employers: Business Communities and Labor in Cincinnati and San 
Francisco, 1870-1916 

By Jeffrey Haydu 

Cornell University/ILR Press. 2008. 272 pages. $39.95 cloth. 


Reviewer: Chris Rhomberg, Fordham University 


Why has American business been so distinctly and bitterly resistant to unions? 
Beyond any economic reasons, Jeffrey Haydu finds the roots of management ideol- 
ogy among late 19" century entrepreneurs breaking loose from an earlier, cross- 
class regime of producer republicanism. In this sophisticated and well-crafted book, 
Haydu compares employer class formation and labor relations in Cincinnati and 
San Francisco to demonstrate the alternate paths of business’ collective action. In 
the course of industrialization, Cincinnati’s employers developed a familiar ideologi- 
cal mix of free market capitalism, civic reformism and virulent anti-unionism. Yet, 
during the same years small business in San Francisco took a very different road, 
choosing instead to ally with skilled labor in the economy and in politics. Haydu’s 
analysis of their divergence provides a stimulating account of the form and content 
of business class hegemony in industrial relations and in American political discourse. 

The argument is built on a three-sided foundation. First, Haydu aims to show 
a trend of increasing class stratification between employers and employees, in- 
cluding the breakdown of vertical ties between groups and the strengthening of 
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horizontal, intra-class ties among businesses. Second, he focuses on the discursive 
content and boundaries of class and civic identity for employers, showing how 
these were selectively mobilized from a menu of pre-existing cultural possibili- 
ties. Finally, he examines how these schemas were transposed across institutional 
borders, from the workplace to the polity and back again. 

In Cincinnati, Haydu contends, employer concerns centered not on the rela- 
tively weak union movement but on a fading commercial economy, the corrupt 
political machine led by Republican boss George Cox, and a fear of urban mob 
disorder stemming from the Court House riots of 1884. Business class solidarity 
developed through mobilization for economic boosterism, cultural enrichment 
and civic improvement. Local merchants, manufacturers, managers and profes- 
sionals elaborated a collective identity of “business citizenship,” achieving status 
closure and distancing themselves from working class culture. This shift allowed 
employers to see themselves as non-partisan protectors of the public interest, and 
to regard union-busting as a positive civic responsibility. 

Unlike Cincinnati, employers in San Francisco faced a combination of stronger 
unions, corporate power (particularly in the form of the Southern Pacific Railroad) 
and a large immigrant Chinese population. Forced to contend with labor, small 
proprietors forged a grudging alliance with skilled white workers against both low- 
wage competition and greedy corporations, cemented by racism against the Asian 
other. This “practical corporatism” legitimized class representation in the economy 
and in politics, balancing class organization and interests with peaceful, industry- 
wide negotiations and tripartite mechanisms of civic mediation and arbitration. This 
path persisted until the years before WWI, when the conditions that sustained it 
reversed: a waning salience of anti-Chinese racism as a medium of cross-class unity, 
the collapse of the Union Labor Party and a decline of class representation in politics, 
and the turn of employers to a Cincinnati-style business citizenship model. 

The analytic framework is robust enough to permit substantive historical 
argument. For Haydu, business citizenship in Cincinnati develops partially in 
opposition to urban machine politics. But, as Martin Shefter and Steven Erie 
have shown, machine politics itself can represent a form of class compromise, 
with party bosses brokering ties among capitalists and ethnic working classes. 
I would have liked to see more on the interests and alliances within the Cox 
machine (which endured for more than two decades), as well as in the pre- 
corporatist Buckley regime in San Francisco, the better to appreciate the terrain 
from which their subsequent paths emerged. 

Similarly, while Haydu emphasizes the role of race in San Francisco, in 
Cincinnati it is simply absent. Haydu claims that, unlike San Francisco’s Chinese, 
blacks in Cincinnati were largely excluded from manufacturing and posed little 
threat to white workers or small entrepreneurs. Yet, the very success in segregat- 
ing black workers indicates a degree of white racial unity. With industrialization, 
black workers in many cities lost occupational status, and it would be interesting 
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to know how Cincinnati fared as a destination for black migrants compared with 
other regional cities such as Louisville and Indianapolis. 

These criticisms, however, do not diminish the utility of Haydu’s approach or 
what he has accomplished. The book draws on a diverse range of archival data, 
including documents from industrial fairs, mechanics institutes and employer 
personnel policies as well as chambers of commerce, civic associations and other 
sources. Altogether, the scholarship is elegant and meticulous, a model of com- 
parative historical research and method. Haydu provides a valuable historical 
context for current controversies in labor-management relations, class inequality, 
collective action and American political development. At first glance, Cincinnati 
businessmen might not seem like the most compelling historical dramatis personae, 
but Haydu shows us we can learn more from them than we may think. 





Fairies, Bears, and Leathermen: Men in Community Queering the Masculine 
By Peter Hennen 
University of Chicago Press. 2008. 240 pages. $50 cloth, $20 paper. 


Reviewer: James Joseph Dean, Sonoma State University 


In his engaging study of three gay American male subcultures, Peter Hennen ana- 
lyzes the play of what he calls the effeminacy effect (the stigmatizing of homosexual 
men with femininity) and its social-historical role in shaping the gay masculinities 
of fairies, bears and leathermen over the past half century. Using multiple methods, 
from ethnographic participant observation to conducting 23 in-depth interviews as 
well as drawing on archival and historical secondary sources, Hennen draws a clear 
and thorough picture of the masculinity dynamics that animate each community's 
approach to negotiating the “effeminacy effect.” He situates his three case com- 
munity studies of gay masculinities as responses to the social-historical context that 
constructs male same-sex desire as “failed” forms of masculinity. 

In a wide-ranging and insightful chapter on the cultural and historical construction 
of effeminacy, he defines the term as “a historically varying concept deployed primar- 
ily as a means of stabilizing a given society's concept of masculinity and controlling 
the conduct of its men, based upon a repudiation of the feminine.” Hennen, rightly, 
focuses our attention on effeminacy and its disciplinary shaping of (gay) masculini- 
ties. What I like even more than this clear contextualizing is his use of Bourdieu's 
concept of embodied masculinity. By focusing on the embodiment of masculinity in 
the three communities, he brings to the analytical fore the role of the body in gender 
and sexual identity politics and subcultural community formation. Each subcultural 
gay community is given its own individual chapter, moving the reader from the most 
(fairies) to the least (leathermen) effeminate embodiment of masculinity. 

‘The Radical Faeries, formed in 1979, are a group of gay men who reject urban 
gay culture and its consumerist and sexual cruising habitus. The Faeries’ response 
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to the hypermasculine images of urban gay men (men who wear tight jeans and 
form-fitting shirts to reveal their muscular bodies) is to do an about face and 
instead embrace their femininity through the practices of drag. However, while 
aiming to subvert normative gay masculinity, Faeries’ drag practices do not paint 
the typical drag queen portrait, passing and “realness” are not the standards they 
apply to their drag practices. Rather, faeries in drag “look very much like men 
in dresses.” However, due to their predilection to seek “sanctuary” in the woods 
and nature, thus secluded from the world around them, Hennen views the Faerie 
community's agency for challenging gender normativity as limited and confined 
to the subculture’s members. 

The Bear community emerged in the 1980s in part from a national organiza- 
tion called Girth and Mirth (large men and their admirers). Like the Faeries, the 
Bear community embraced a new type of gay masculinity that eschewed effemi- 
nacy. Rather, the gay masculinity of Bears eroticizes a male body that is heavier, 
hairier and older than the normative gay male body. However, Bear men also 
embody a form of “complicit masculinity,” the author argues, as they want to be 
viewed as regular guys. They are gay men who are gender conventional, probably 
perceived as straight by the casual observer, but identify as gay. Nonetheless, the 
Bear community's contestation of effeminacy and male homosexuality might also 
portend “the potential to turn back on the very system that produces it,” as it 
clearly undoes associations of feminine men and homosexuality and masculine 
men and heterosexuality. Bears are masculine gay men. 

Coming out of the ashes of WWII, gay veterans traumatized by the war, along 
with the newly formed gay biker clubs, started the gay leather community in the 
1950s. The gay leather community is also the community that most strongly dis- 
avows the association of male homosexuality and effeminacy. Embracing a hyper- 
masculine sexuality, the leather male community eroticizes the male body through 
leather clothing and a variety of sexual practices, ranging from sadomasochism 
and bondage and discipline to other “kink” practices. This community remakes 
the gay male body into an aggressive instrument. Here rough and often public 
sex and a tough, intimidating dress style lead gay leathermen to almost disavow 
their gayness. “The vast majority of leathermen I encountered identify themselves 
first and foremost as men and, in some cases, men who understand themselves as 
actually more masculine than heterosexual men.” 

In sum, this study of three gay male communities helps us to empirically con- 
cretize concepts like embodied masculinity and contextualize the role effeminacy 
plays in constructing (gay) masculinities. Hennen’s book is not only a pleasure to 
read but also a well-written study which will appeal to students, the general reader 
and scholars interested in masculinities, sexualities, and queer and contemporary 


social theory. 
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The Elusive Dream: The Power of Race in Interracial Churches 
By Korie L. Edwards 
Oxford University Press. 2008. 227 pages. $29.95 cloth. 


Reviewer: Kevin D. Dougherty, Baylor University 


What does it take to sustain a church where blacks and whites worship to- 
gether? According to Korie Edwards in The Elusive Dream: The Power of Race 
in Interracial Churches, it takes a congregational culture familiar to whites. Her 
thesis is that “interracial churches work to the extent that they are, first, comfort- 
able places for whites to attend.” 

Edwards roots her explanation in racial theory and the concept of white hege- 
mony, the dominance associated with whiteness as a racial category in the United 
States. It is a dominance rooted in social institutions and cultural practices, yet 
largely invisible to those benefiting from it. She defends her thesis using national 
survey data from congregations and a case study of one interracial church in the 
Midwest. Chapters explore worship, civic engagement, leadership, members’ racial 
identity, reasons members join and the reproduction of white hegemony. A short. 
conclusion with a call to action ends the book. 

The multi-method approach aptly extends previous ethnographic studies on the 
topic. It provides Edwards’ analysis both breadth and depth. The statistical analysis 
reveals that America’s interracial churches—those populated predominantly by 
blacks and whites—resemble white churches more than black churches in wor- 
ship style, social and civic activities, and the race of clergy. Yet, it is the case study 
where analysis is most compelling. Edwards takes us inside a congregation she 
calls Crosstown Community Church. Crosstown is an 80+ year old congregation 
that went from all white to two-thirds black over the past two decades. The racial 
transition inside Crosstown was precipitated by a near complete racial turnover 
from white to black outside its doors. 

The analysis of Crosstown often reads like a good story. There is the sanction- 
ing of Lydia, black woman prone to loud expressions of praise during worship 
services, and a fascinating account of the contentious hiring process for two 
pastoral positions. Edwards weaves narrative and analysis superbly to illuminate 
racial overtones and power inequalities involved in church deliberations over 
worship and leadership. Racial differences do not disappear inside Crosstown. 
Even though white members are a numeric minority and the senior pastor is 
black, white hegemony persists through a collective process of reinforcement 
by both whites and blacks who concede their own cultural preferences in order 
to accommodate whites. Her conclusion admonishes churches (and other non- 
religious communities) “not to accept the convenient counterfeit of mere racial 


integration but to strive toward becoming communities that celebrate racial 
justice and equality.” 
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The framing of “integration” is one difficulty for me with the book. Edwards 
uses the term to refer to racial diversity without regard to shared power or interac- 
tion. She acknowledges in a footnote that others define integration differently. I 
believe her analysis would have benefited from the distinctions of DeYoung et 
al. (2003) among assimilated, pluralist and integrated multiracial congregations. 
These distinctions capture variation in interaction, power and the salience of 
racial differences within racially mixed congregations. In integrated congregations, 
racial groups no longer see themselves as different. Edwards’ findings raise doubts 
that congregations like Crosstown can achieve this ideal. Advocating for equality 
amidst salient racial differences suggests that the best we can hope for are pluralist 
multiracial congregations. 

This leads to a second difficulty I have with the book. Edwards expects white 
hegemony to exist in any mixed-race congregation where whites are present. It 
does in Crosstown where only 30 percent of members are white. However, for six 
decades Crosstown was exclusively a white church. After a congregational culture 
is formed, it is not easily transformed. Were an African Methodist Episcopal 
church to become 30 percent white in membership, would it cease to resemble 
an AME church? Not likely, although it is an open empirical question. White 
hegemony may be a feature most pronounced in white congregations transitioning 
to be multiracial. Furthermore, I question Edwards’ contention that her findings 
apply beyond the black/white dichotomy. The black experience in the United 
States is unique. Edwards, who is black, cites this as a motivation for her study. 
Cultural differences may represent less formidable barriers for other racial-ethnic 
combinations in congregations. 

Despite these concerns, The Elusive Dream is a welcome addition to a grow- 
ing body of literature on multiracial congregations. It brings attention to the 
racial dynamics at play in congregations uniting blacks and whites. It straddles 
sociological subfields of religion, organizations, and race and ethnicity. It also 
affirms Edwards, an assistant professor of sociology at Ohio State University, as 
a prominent voice on race and religion. For those interested in the topic, make 
room on your bookshelf for Zhe Elusive Dream. 
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Race, Space, and Riots in Chicago, New York and Los Angeles” 
By Janet L. Abu-Lughod 
Oxford University Press. 2007. 360 pages. $35 cloth. 


Reviewer: Amy Kate Bailey, Princeton University 


In the 20" century United States, the most widely-known race-based conflicts 
erupted in urban settings. While similar community grievances typically triggered 
unrest across cities, the form of rebellion, level of violence and socio-political out- 
comes varied substantially. In Race, Space and Riots, Janet Abu-Lughod provides 
description and analyses of major 20" century race riots: Chicago's 1919 and 1968 
riots; Harlem and Bedford-Stuyvesant’s revolts in 1935, 1943 and 1964; and the 
Watts and South Central Los Angeles uprisings in 1965 and 1992. Abu-Lughod 
analyzes the factors influencing each conflagration and its outcomes, with par- 
ticular focus on the way that race relations and interracial violence are structured 
by local political, economic, demographic and spatial configurations. She finds 

_ that the nature and swiftness of local political leaders’ response, particularly as 
evidenced by the course of police actions, was the most important determinant of | 
the duration and severity of unrest. For example, she credits city officials’ ability to 
directly address community grievances and rapid police response with shorter, less 
destructive outbursts in New York than in the other cities, and points to elected 
officials’ inaction and the delay in launching coordinated police containment 
efforts as contributing to the escalation of civil unrest in Los Angeles. Political 
culture also influenced the nature and intensity of violence, as evidenced by the 
sophisticated political organization of New York’s black community, serving to 
lessen the severity of the racial crisis, and the relative exclusion of blacks from 
Chicago's political machine, where the 1968 riot inflicted massive property dam- 
age on white-owned businesses. 

This work also emphasizes the critical link between civil disturbances, the level 
of residential and social segregation and the concentration of disadvantaged groups 
within the metropolis. The spatial patterning of development within each city, and 
the demographic distribution of its population, directly affected the onset of un- 
rest, the ability of police to suppress violence, and the course of action pursued by 
urban insurgents. The location of each riot—both generally as well as in terms of 
specific targeted sites, such as businesses or police stations —was predicated by the 
history of race-based power struggles over both space and community resources. 
In Chicago, for example, both events Abu-Lughod highlights were explicitly over 
space, particularly black access to housing in traditionally white neighborhoods. 
In Los Angeles, conversely, the diffusion of residential development and racial 
heterogeneity of the city led participants in the 1992 uprising to target not only 
Korean-owned businesses in the black and Hispanic South Central neighborhood, 
but to also attack Koreatown itself. 
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However, not all ideas are developed as fully as they might be, and some pro- 
vocative themes raised in the introductory portion of the book are not revisited. 
For example, in her introductory chapter, Abu-Lughod briefly mentions the role of 
public transit in mediating the ability of police in both New York and Chicago to 
effectively isolate areas where violence was occurring, by re-routing buses and clos- 
ing subway stations. In Los Angeles, conversely, reliance on automobiles hampered 
spatial containment. Angelenos drove into Watts to participate in the 1965 riot, 
and in 1992, utilized the freeway system to spread violence and property damage 
throughout the metropolis. However, during the later in-depth discussion of the 
events in LA, she does not return to this idea, and discusses broad distribution of 
unrest without analyzing the contribution of local infrastructure. Abu-Lughod also 
discussed a number of additional events—the 1968 riots at the Democratic National 
Convention in Chicago, for example, and several smaller incidents in New York 
and Los Angeles. These events could have worked to provide further illustration of 
her claims, or perhaps allowed her to tell a more complete story of racial conflict in 
the United States’ leading cities. Unfortunately, these supplementary illustrations 
were treated with uneven levels of both historical detail and critical analysis, with 
the result of leaving the reader wanting more information, or questioning why 
these subsidiary events were not included in the main analytic narrative. 

This book is an outgrowth of prior work done for Abu-Lughod’s 1999 book 
New York, Chicago, Los Angeles: Americas Global Cities, and her expertise in these 
rich municipal histories allows her to embed events within the broader local 
context, weaving together multiple elements that structured the ways in which 
episodes of interracial violence developed, played themselves out, and wrought 
changes. Her selection of paired examples for each city—with an early and a late 
example of local interracial violence— provides a more nuanced analysis than is 
possible with large-sample quantitative methods, but includes enough cases to 
meaningfully compare the contextual factors that affect when and how racial 
tensions erupt into violence. The result is a far-reaching narrative that locates 
historical events in their social environments, and should provide valuable reading 
for scholars of race relations, collective action and urban communities. 





Where Memory Dwells: Culture and State Violence in Chile 
By Macarena Gomez-Barris 


University of California Press. 2009. 240 pages. $60 cloth, $24.95 paper. 
Reviewer: Mark A. Wolfgram, Oklahoma State University-Stillwater 


One can sort memory scholarship into two distinct categories, although each 
individual work may well combine aspects of both. One trend in the scholarship 
treats memory, with its associated modifiers of public, collective, political, social, 
hegemonic and so forth, as a metaphor. Various cultural artifacts that represent 
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the past are classified as memory. A second trend in collective memory scholar- 
ship locates memory in the minds of individuals, but recognizes a collective 
and social aspect to this very individual and personal realm. Thus, individual 
impressions of the past, informed by the collective representations of a society's 
past, are seen as central to the study. 

Macarena Gémez-Barris’ places the emphasis on the first trend in this work 
that focuses on the legacy of General Augusto Pinochet's military regime in Chile, 
which lasted from 1973 to 1990. She is especially interested in the legacy of the 
state’s use of violence against its own citizens, and looks for where these memories, 
or representations of the past, continue to exist in the form of a memorial site, 
the former concentration camp Villa Grimaldi, the abstract art of Guillermo 
Nuifiez, himself a survivor of torture, several documentaries, the work of Patricio 
Guzman perhaps being the best known, as well as an art exhibit that the author 
helped to organize in Berkeley, California. As a second-generation Chilean liv- 
ing in the United States, and whose family was forced to flee Pinochet's regime, 
Gémez-Barris has a very personal engagement with her work. The role of first- and 
second-generation scholars working in some form of exile has been quite prevalent 
in the field of memory studies. : 

The author has consciously chosen to work with the producers of narratives of 
the past that are operating well outside the official capacity of state institutions. 
She uses the term “memory symbolic” to identify these struggles between the 
state and those operating largely independent of the state. Although the Chilean 
state is democratizing, she remains suspicious of state power, especially the neo- 
liberalism that has continued well beyond Pinochet’s rule. She clearly recognizes 
the democratic transition, while also mistrusting the willingness of public officials 
to deal with the dictatorship’s violent past. This leads her to identify, “... a central 
epistemological inquiry of my book: if management and concealment are primary 
modes of hegemonic historical transitions, where indeed can memory be found?” 
She continues, “Reconciliation processes, ‘putting the past behind us, are central 
ways that governments get on with the business of reconstructing the nation, 
often at the expense of victims and their families... In other words, while power 
from above moves to close the issue of violence for the nation, from below the 
heterogeneous effects of violence constantly threaten to rupture and disarticulate 
the transitional national project.” 

After an opening chapter orientated toward theoretical questions, the rest of 
the book offers a close reading and interpretation of the different social texts, rep- 
resentations, memories or narratives collected. One of the difficulties of pursuing 
memory scholarship along the lines of the first trend mentioned above is that the 
author becomes, for the most part, the sole interpreter of these dense social texts. 
Gémez-Barris recognizes this obstacle at the opening of her work noting, “Herein 
lies the challenge of examining cultural memory, where artists, their works, and 
audience engagement reshape and animate contemporary society.” The meaning- 
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making process of culture requires that scholars examine how individuals, embed- 
ded in social groups and social contexts, encounter and engage different cultural 
artifacts and then engage others with their interpretations. 

The strength of Gomez-Barris’ work is her first-hand victim perspective on 
these narratives about Chile’s past. She gives an intimate portrait of her engage- 
ment with these works, and especially in the case of Guillermo Nufez with the 
artist as well. She is self-reflective on her identity as a second-generation Chilean 
woman living in exile. But this means that the broader public’s engagement with 
the same narratives is largely missing. How might the perpetrators of violence, 
their supporters and bystanders react to the same narratives? Commenting on one 
of the documentaries, she writes that one scene, “...forces the viewer to confront 
the mother’s trauma, empathize with it, and ask oneself about the human cost 
of extreme violence...” But a work of art cannot force the viewer to experience 
anything—even revulsion at extreme levels of violence. The perpetrators of the 
violence and their supporters, more often than not, remain unmoved and will 
justify their violent acts and continue to mock their victims. Bystanders will often 
refuse to be drawn in or feel implicated in what confronts them. Victims, perpe- 
trators and bystanders occupy positions in the state, society and exile, and their 
narratives and interpretations of the past will probably continue to divide Chile 
for generations to come. As the author has shown, the scars of violence run deep. 





Marked: Race, Crime, and Finding Work in an Era of Mass Incarceration 
By Devah Pager 
The University of Chicago Press. 2007. 256 pages. $25 cloth, $15 paper. 


Reviewer: Thomas P. LeBel, University of Wisconsin-Milwaukee 


A major consequence of mass incarceration in the United States is that close to 
700,000 prisoners will be released from state and federal prisons each year for 
the foreseeable future. In Marked, Devah Pager provides a well-researched and 
well-written book discussing the impact of criminal record and race on the em- 
ployment prospects of formerly incarcerated persons. ‘This book contributes to a 
growing body of literature examining prisoner reentry, while also contributing to 
the literature about the impact of stigma and discrimination. 

Following an introduction which carefully lays out the logic and plan for 
the book, Pager explains the reasons for the drastic rise of incarceration in the 
United States as well as the problems of prisoner reentry into society. She fol- 
lows this chapter with a discussion of how incarceration contributes to negative 
employment outcomes. 

An important contribution of this book is its attention to issues of measure- 
ment for studying the labor market consequences of incarceration. In Chapter 3 
the author discusses the strengths and weaknesses of various approaches that have 
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been used to measure this phenomenon. The author then provides a detailed de- 
scription of the study’s experimental audit methodology chosen for its usefulness 

in “measuring discrimination in real-world settings.” In this study, matched pairs 

of individuals (called testers) posed as job applicants in real job searches. The tes- 
ters were given fictitious resumes with the only differences being their race (black 
vs. white) and assignment of a criminal record (a felony drug conviction and 18 

months served in prison). A total of 350 employers in the greater Milwaukee area 

were audited. The author provides a lengthy discussion in Appendix 4A of the 

audit design and the many steps taken to preserve the integrity of the study. 

The early stage of the employment process is examined with the primary out- 
come of interest the proportion of applications that elicited call backs from em- 
ployers or were offered the job on the spot. In chapters 4 through 6 the author 
discusses the extent to which persons were discriminated against in the hiring 
process because of their criminal record and/or their race. Pager found that for- 
merly incarcerated persons are half (for whites) to a third (for blacks) as likely to 
receive a call back from employers as compared to applicants without criminal 
records. Among those without criminal records, blacks were less than half as likely 
to receive call backs as equally qualified whites (14 vs. 34 percent). In what might. 
be the most surprising and important finding in this study, a white applicant 
with a criminal record was just as likely to receive a call back as a black applicant 
without a criminal record (17 vs. 14 percent). Moreover, the extremely low call 
back rate for black formerly incarcerated persons (5 percent) indicates that for 
these applicants, “with two strikes, youre out.” Pager contextualizes these findings 
by analyzing the impact that personal contact with employers and the location of 
the employer (city vs. suburb) have on the likelihood of call backs. 

In Chapter 7 Pager provides valuable information about employers’ consid- 
erations and concerns in evaluating applicants. Interviews with the same sample 
found that employers are very concerned about the potential liability of hiring 
persons with criminal records and are also sensitive to the type of conviction. 
Meanwhile, minority-owned businesses are much more likely to express a willing- 
ness to hire persons with criminal records, while employers who have hired an 
applicant with a criminal record in the past year are more likely to report favorable 
attitudes towards hiring these applicants. 

Throughout the book, Pager argues that the results of this study are conserva- 
tive and perhaps represent a best case scenario for the employment prospects 
of persons with criminal records because of the favorable characteristics of the 
testers and the sample of employers audited. In the final chapter Pager discusses 
several prisoner reentry-related policy recommendations including “avoiding the 
mark” by sending fewer drug offenders to prison, providing greater support for 
the transition of released prisoners into employment, and “easing the mark” by 
adding mechanisms to regulate who has access to criminal records and removing 
criminal records from the public domain after a set period of time has elapsed. 
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After reading this book one is struck by the fact that in the 21* century one’s 
race continues to limit the opportunity to succeed in America. Pager argues that 
‘rather than being merely a problem of the past, direct racial bias continues to 
shape employment outcomes in ways that contribute to persisting racial inequal- 
ity.” Moreover, she asserts that “race must not disappear from our discussions of 
public policy or our conversations about the problems of prisoner reentry.” Pager’s 
book deserves to be read widely by anyone interested in prisoner reentry as well 
as those interested in stigma and discrimination based on one’s race or ethnicity. 


« 


The World and US Social Forums 
Judith Blau and Marina Karides, editors 
Lexington Books. 2009. 252 pages. $29.95 paper. 


Reviewers: Gary Coyne, Preeta Saxena and Ellen Reese, University of 
California-Riverside 


The World and US Social Forums, edited by Judith Blau and Marina Karides, ex- 
plores both the political significance of the World Social Forum and the United 
States Social Forum, and the role of scholar activists within them. Since the first 
WSF meeting in 2001 in Porto Alegre, Brazil, the WSF has quickly become the 
largest international meeting of social justice activists, inspiring the spread of 
local and national social forums across many countries. Bringing together activ- 
ists across space and movements, the social forum process has helped activists to 
exchange ideas, share strategies, coordinate actions and expand and strengthen 
their social networks. Most of the volume’s contributors are scholar-activists based 
in the United States, but some are from Canada, Sweden and South Africa. A few 
chapters were written by key activists in the social forum process, namely Chico 
Whitaker, who helped to found the WSF in Brazil, as well as Michael Guerrero 
and Jerome W. Scott, who played key roles in organizing the USSF Like any 
multi-author volume, there is some unevenness in the contributions. Rather than 
addressing each chapter individually, we will review the book’s three major sec- 
tions and then the overall volume. 

Part I focuses on the USSE, which first occurred in Atlanta, Georgia in 2007 
and will happen again in 2010 in Detroit, Michigan. Some chapters focus on the 
history and emergence of the USSF, while others explore how it fosters new kinds 
of politics, praxis, especially with regards to conflict and difference. A highlight of 
this section is Michal Osterweil’s chapter, which highlights the success associated 
with the USSF primarily for its diverse participation and the deeper implica- 
tions of this diversity. He claims that the conflict between an indigenous speaker 
and the caucus organizer (when the organizer cut-off the microphone from the 
speaker) at the closing plenary exemplified how participants overcame differences 
in their “systems of values.” Osterweil argues that, although at face-value this was 
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seen as a negative instance, it was in actuality representative of conditions that are 
conducive to political action among radically diverse groups. Rather than view- 
ing diversity as a “problem to be solved,” difference and conflict were treated as 
a necessary and fundamental part of improving political practices. The argument 
was both well-formulated and strongly supported by evidence. 

Part II of the book engages the debates about the function, challenges and 
future of the WSE. Stellan Vinthagen’s examination of the relationship between 
global civil society and the WSF provides a number of interesting conceptual tools. 
As for the relationship between the two, if civil society is a space of communicative 
rationality, then the WSF certainly is the most inclusive and participatory such 
space yet created at the global level. In this sense, then the WSF can be understood, 
by analogy, as a real life open-source Wikipedia: accessible and diverse, but at the 
same time also in possession of a protocol that is moving toward a more authentic 
incarnation of its ideal under direction of a self-reflexive set of protocols. It is the 
continued ability of the WSF to learn from its past that gives it the potential to 
become a truly global civil society. 

Part III of the book examines the various roles played by scholar activists at 
social forums and how these meetings foster the development of new kinds of 
knowledge. Patrick Bond argues that the 21 century anti-capitalist manifesto 
should reflect the excellent work already being done by social movements, whose 

“victories and failures” need to be studied carefully. Lyndi Hewitt discusses her 
shifting role within the forum as both participant and researcher of feminist 
workshops. Steven Sherman’s chapter reflects on how social forums are productive 
sites for leftist intellectuals to exchange ideas, and for activists to find academics 
to meet their research needs. Mark Frezzo’s and Eunice N. Sable’s chapters focus 
on how the WSF has promoted human rights and challenged dominant models 
of economic development and neoliberal globalization. 

This book will mostly appeal to scholar activists and other social justice activists 
who are curious about the social forum process, but not already familiar with it. 
Many of the authors assume a certain amount of background political knowledge 
that the true layperson would likely not have. Some chapters could be useful to 
those teaching undergraduate or graduate classes on social movements, although 
undergraduates might need some clarification or additional background material 
when reading the text. Many chapters provide an excellent overview of important 
contemporary issues and movements. The chapters in Part III are a “must read” 
for scholar-activists at all levels; they have important implications for how all 
scholar-activists, not just those at social forums, should relate to social movements. 
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Same Sex, Different Politics: 
Success and Failure in the Struggles Over Gay Rights 


By Gary Mucciaroni 
The University of Chicago Press. 2008. 392 pages. $60 cloth, $24 paper. 


Reviewer: Steven A. Boutcher, University of California, Irvine 


Why has the American gay rights movement achieved success on some public policy 
issues but not others? In Same Sex, Different Politics, Gary Mucciaroni provides a 
nuanced framework that incorporates both public opinion and the structure of po- 
litical institutions as key mediating mechanisms in his analysis of six different LGBT 
policy issues: marriage and partnership benefits, adoption, sodomy repeal, military 
service, hate crime laws and civil rights (employment discrimination). Mucciaroni 
argues that each policy issue contains a different set of politics and is “marked by 
a distinct struggle and by political and institutional differences that are critical for 
determining success.” For example, the institutional forces surrounding the struggle 
to repeal “don’t ask don’t tell” are very different than the politics that surround 
the passage of hate crime laws. This comparative insight, while seemingly simple, 
provides.a framework that proves to be crucial for understanding why gay rights 
advocates are successful on certain issues but not others. 

Two components are at the center of Mucciaroni’s analysis: the extent to which 
the general public finds an issue threatening and the structure of political institu- 
tions surrounding each policy issue. Treating each component separately doesn’t 
adequately explain why some issues are more successful. For example, sodomy 
repeal (legalization of homosexual conduct), marriage equality and adoption are 
the gay rights issues most threatening to the general public, but they have reached 
very different outcomes. All state sodomy laws have been repealed, adoption is 
allowed in an increasing number of local and state jurisdictions, and marriage 
equality has yet to have widespread success. Thus, public threat alone does not 
provide the full answer, but is mediated by political institutions. 

Mucciaroni argues that three institutional variables matter for LGBT policy 
success: decentralized policymaking, third party stakeholders and judicial in- 
volvement. Various combinations of each of these variables have led to different 
outcomes across policy issues. Whereas the civil rights movement has histori- 
cally been successful at the federal level, the LGBT movement has faced a very 
different, and largely antagonistic, federal government. The closure of national 
political opportunities resulted in a decentralized strategy targeting state and local 
municipalities—a strategy that has proved relatively successful because federalism 
provides multiple points of access for activists to target (Meyer 2007). For example, 
although adoption rights is one of the most threatening policy agendas pursued by 
the movement, a decentralized strategy has resulted in 10 states passing statewide 
laws and another 15 states where some local jurisdictions allow adoption rights. 
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Currently, an estimated 63 percent of the gay and lesbian population live in juris- 
dictions that allow second-parent adoptions. 

Decentralized policymaking, however, is not entirely sufficient to explain suc- 
cess. The presence of third party stakeholders is also important. For example, the 
success of the gay rights movement in repealing state sodomy laws was assisted 
by the American Law Institute, a group of legal professionals that pushed for the 
adoption of the Model Penal Code in state legislatures during the 1950s and 1960s. 
These professionals were not interested in promoting gay rights but were focused on 
modernizing state criminal codes and getting rid of laws that were seen as outdated 
and not enforceable (see Pierceson 2005). Similarly, key stakeholders in the adoption 
issue included social workers and adoption agencies open to gay and lesbian par- 
ents. Conversely, Mucciaroni argues that marriage has been relatively unsuccessful 
because of the relative lack of stakeholders, although this point can be debated (legal 
professionals have been very active in advocating for marriage equality). 

Finally, issues where the courts have taken a lead appear to be more success- 
ful than when the courts take a back seat to state legislatures. In the case of the 
military, courts have overwhelmingly deferred to the executive and legislative 
branches— effectively upholding the policy of “don’t ask, don’t tell.” Conversely, 
the courts central role on the adoption issue has been a key factor for securing 
adoption rights. However, Mucciaroni is quick to point out that the role of the 
courts is mixed in other issues, such as sodomy repeal where state legislatures also 
played a key role in repealing sodomy laws. 

Although Mucciaroni provides an erudite analysis of gay rights policy success 
and failure across disparate policy issues, I found the omission of the LGBT move- 
ment's strategy especially puzzling. In a book focused on gay rights advocacy, there 
was very limited attention paid to the gay rights movement. Activists strategize 
on which venues to target and what issues to pursue, but we don’t know how 
these decisions are made. Similarly, activists work hard at building alliances with 
stakeholders, but the reader is left wondering how these alliances came about in 
the first place. Placing the movement as a central actor in the analysis would have 
strengthened the book. Despite this limitation, however, this book is a gem and 
scholars working at the intersection of social movements and public policy will 


want to read this book. ‘This book is a must read for scholars of public policy, gay 
rights and social movements. 
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Transnational Tortillas: Race, Gender, and Shop-Floor Politics in Mexico and the 
United States 


By Carolina Bank Mufioz 
Cornell University Press. 2008. 216 pages. $49.95 cloth, $18.95 paper. 


Reviewer: Paul L. Greider, St. Cloud State University 


In this informative book, Carolina Bank Mufioz reveals the polity between gov- 
ernments and a transnational corporation which ultimately affects the common 
worker. In Mexico, tortillas are not only a staple food but a tradition that is a 
critical segment of its economy. Political agreements between the United States 
and Mexico have had a great impact on the making of tortillas. The resulting 
macro policies that come about by the intimacy of governments and corporations 
are central to the micro-level processes of the shop floor, and ultimately the social 
problems created for workers employed by the Tortimundo Company, which uses 
these policies, along with “racialized and gendered labor markets,” and immigrant 
status to control labor. 

Overall, employers in Mexico and the United States are reaping the benefits 
of U.S. immigration policy. Mexico is influenced by U.S. policy such as NAFTA, 
which in turn, influences much of the trade and business policies within Mexico. 
Other points in her argument involve the interlinking of the U.S. and Mexican 
border and the countries’ labor markets, along with the issues of race, gender and 
class, and their use as controllers of labor. Mexico is dependent on supranational 
institutions like GATT, the World Bank and the IMF, which ultimately has re- 
sulted in a loss of workers’ rights there. Thus, the border has become an arbitrary 
line which marks the difference between those with higher wages versus those with 
lower ones; those with infrastructure versus those without; and those with legal, 
social, political and economic capital versus those with little of these. 

Enter two tortillas-making factories owned by the Tortimundo Company; one 
is in Hacienda, CA, the other in Hacienda, Mexico or Baja California. Each plant 
has a unique managerial style to facilitate production. Workers at Hacienda CA 
are subjected to a regimented existence on the shop floor where they are under 
constant surveillance, while workers at Hacienda BC are not. The end result is still 
the same; workers are controlled and manipulated to the benefit of management. 
However, the tempering method used to subjugate workers depends on which side 
of the border they are located. 

The shop floor dynamics at these two factories are very different environments. 
Managers at Hacienda CA, rule with a strict supervisory style based on a U.S. busi- 
ness model, where worker efficiency is the primary goal. Absenteeism and tardiness 
are not tolerated. Health and safety rules must follow company protocol and are 
strictly enforced—more to the benefit of the company, than to the worker. Managers 
use a variety of tactics to divide and control workers, with immigration status as the 
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key, ultimately facilitated by U.S. immigration policies. Here, documented workers 
are subjugated by hegemonic control, while undocumented ones are subjugated 
by despotic control. Workers are pitted against each other based on their immigra- 
tion status, a way management divides the workers. In this “immigration regime” 
undocumented workers are faced with the inflexibility of managers who hold the 
fear of their immigration status over them. However, sexual harassment is taken very 
seriously in the U.S. plant because of concerns with U.S. laws. 

On the other hand, at the Hacienda BC plant located in Mexico, women are 
exploited on the basis of their gender. Managers select from a docile female labor 
market to inflate their control over workers on the shop floor in what Bank Mufoz 
calls a “gender regime.” On this shop floor women are not only sexually harassed 
by managers, they feel that have to compete with other workers in order to keep 
their jobs. This includes flirting with managers and being subjected to an array 
of sexual innuendos in exchange for job stability and promotion. Race a relevant 
issue as well, and used as a tactic by managers to divide the workers. 

The workers ultimately try to fight back, with mixed results. Bank Mufioz 
shows her pedigree as a labor activist here and indicates that globalization has been 
hard on workers regardless of which side of the border they are on. The company 
uses many tactics to control labor, which makes union organizing difficult at best. 
But this doesn’t stop workers from resisting and fighting back in some way. 

Overall, the story about transnational tortillas is a well written and detailed 
portrayal of the interconnections between macro policies of governments and its 
affect on the micro processes of the shop floor. It should be of interest to those 
concerned with globalization and labor issues, the problems associated with im- 
migrant status, as well as the sexuality and racialization of the workplace. 


Metropolitan Migrants: The Migration of Urban Mexicans to the United States 
By Ruben Hernandez-Leon 


University of California Press. 2008. 258 pages. $55 cloth, $ 21.95 paper. 


Reviewer: Nadia Yamel Flores, 7exas AeoM 


Hernandez-Leon'’s Metropolitan Migrants is the first book-length monograph ex- 
amining the history, bi-national components and social processes of urban-origin 
migration from Mexico to the United States. Hernandez-Leon uses a bi-national, 
multi-site and multi-method approach which includes both qualitative and quan- 
titative methodologies; unveiling the reality of what Hernandez-Leon calls the 
Monterrey-Houston connection, he documents a 10-year study of international 
migration exemplified by the sojowrners from La Fama, a working-class neighbor- 
hood in Monterrey and its surroundings, to Houston. 

The book begins with the bi-national account of the historical connection 
between the two cities and provides a well documented account of Mexico’s eco- 
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nomic restructuring process from Income Substitution Industrialization to Export 
Oriented Industrialization. He argues these processes, in the auspices of global- 
ization, led to the cruel displacement of skilled manufacturing workers who had 
long enjoyed economic stability in the form of decent wages and benefits, forcing 
many to migrate to the United States. According to the author, the consequences 
of such displacement can be used to push the boundaries of international migra- 
tion theories which claim Mexicans migrate to the United States as a result of a 
risk-managing strategy; rather, migrants from La Fama were looking for ways to 
cope with the forces of economic restructuring. 

The book is full of novel findings related to urban-origin migration from 
Mexico, as it documents how during the second half of the 20" century U.S. com- 
pany recruiters found and actively recruited highly skilled workers in Monterrey 
to work for the oil and aviation industries in the United States. Furthermore, the 
study shows that human capital transferability has been possible for many of the 
highly skilled workers of La Fama due to both active recruiting and social network 
referrals within the embedded network of regiomontano migrants (as Hernandez- 
Leon calls the people of Monterrey), although many migrants more commonly 
experienced a scenario of downward mobility when they were hired into the 
asbestos removal industry. 

Hernandez-Leon also uncovers the network mechanisms shared by urban- 
migrant networks, which he claims are made of multiplex ties reinforced through 
endogamic matriage practices and constituted by kinship ties as well as neighbor 
and co-worker ties. Labor stability and occupational homogeneity in La Fama 
created the conditions for its residents to build strong neighbor and co-worker 
ties rooted in the urban barrio; such ties provided the necessary social capital 
to prospective migrants to relocate to the United States. Unlike those network 
ties belonging to their rural migrant counterparts, the regiomontano ties did not 
depend on enforceable trust and social surveillance mechanisms, so participants 
were not obligated to help others as a result of social pressure. 

Hernandez-Leon also uncovers what he calls the “Monterrey-Houston circuit” 
as the constant movement of people, money, information and goods between 
the international border of Mexico and the United States. The study reveals that 
the ability of international migrants to engage in transnational movement and 
transnational activities depends on border enforcement and the legal status of the 
migrants, as tighter border enforcement and lack of legal status prevent actors 
from participating in transnational activities. Hernandez-Leon challenges the role 
of nation-states and their borders on restricting the cross-border activities and 
social processes that result from the international migratory experience. 

Another revelation is what Hernandez-Leon refers as the “Migration Industry,” 
exemplified by Transportes Garcia. Through an ethnographic account, Hernandez- 
Leon shows migrant entrepreneurs engaging in the transnational movement of 
people, goods and remittances across international borders; this services offer a 
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solution to the increasing “undocumented permanent resident” population (as 
Jorge Durand will put it) which has resulted from stricter border enforcement and 
the lack of access to legal visas. 

Hernandez-Leon cautions the Monterrey-Houston connection’s uniqueness, 
given the long history of bi-national cooperation between Monterrey corporate 
owners and U.S. enterprises, Monterrey’s closeness to the border, and the long 
established, commercial, social, labor and transportation connections between the 
two cities. After reading the manuscript, I agree with the author that the lessons 
learned in this manuscript can with no doubt still be utilized to inform our general 
knowledge about the migration of urban Mexicans to the United States. Given my 
own research experience observing urban-origin flows from Guanajuato, Mexico 
to the United States, I also found consistencies with most of the conclusions and 
findings listed in this monograph with the exception of the ability of Guanajuatenses 
to hold a border-crossing card, which is a unique benefit to those who live near the 
U.S.-Mexico border. This is to say that economic restructuring has affected urban- 
dwellers all across Mexico, and that tighter border restrictions have created the 
need for a “Migration Industry” to serve those who have become “undocumented 
permanent residents” all across the United States and their loved ones in Mexico. . 


Adam Smith in Beijing: Lineages of the Twenty-First Century 
By Giovanni Arrighi 
Verso. 2007. 418 pages. $35 cloth, 25.95 paper. 


Reviewer: Cheris Shun-ching Chan, University of Hong Kong 


Arrighi’s third major title, Adam Smith in Beijing is a refreshing sequel to the 
author's two earlier works, Zhe Long Twentieth Century (1994) and Chaos and 
Governance in the Modern World System (1999). In line with his systemic, histori- 
cal approach to the development of modern world capitalism, this book explains 
the shift of the global political economy’s epicenter from North America to East 
Asia. Arrighi frames his argumentation and analysis in light of Smith’s Zhe Wealth 
of Nations. The major argument-that the ongoing descent of the United States 
in the world economy and the ascent of China realizes Smith’s vision of a world- 
market society—is novel and provocative. 

The volume is evenly divided into four parts, and each part contains three 
chapters. Part I lays out the theoretical underpinnings of the project, with a re- 
interpretation of Smith's theory of markets and its implications for an economic 
development path divergent from the Euro-American model. Part II tracks the 
global turbulence in the first half of the 20" century that, together with 9/11, set 
the stage for the Bush Administration’s Project for a New American Century. The 
analysis of the crisis of U.S. hegemony carries on from Part II to Part II. Part 
III focuses on the failure of the New American Century Project, and the shift in 
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U.S. domination from a legitimate protector to a racket protector. Arrighi argues 
that the failure of the U.S. project and the decline of its hegemonic position 
consolidated China's emergence as the power center. Part IV describes the unique 
features of the East Asian economic model, and how China’s development path is 
likely a realization of Smith’s ideal type of “natural course” economic development. 

Along with Arrighi’s former works, this book is full of historical details to sup- 
port his claims and arguments. The volume makes an impressive contribution, and 
has many strengths, particularly these two. First, the re-interpretation of Smith’s 
conception and theory of economic development directly confronts neo-liberal 
economists appropriation of Smith’s theory for their agendas. In contrast to the 
conventional belief that Adam Smith, as the grandfather of classical economics, 
advocated minimal government intervention to let the “invisible hand” do its job, 
Arrighi contends that Smith actually prescribed a strong and active role for the 
state in the economy. As the ultimate goal of market efficiency is to serve the good 
of society, the state must actively intervene to correct any socially or politically un- 
desirable outcomes from market operations. Thus, the “liberal creed” advocated by 
the Washington Consensus was completely alien to Smith. Second, the argument 
that the Chinese economy’s path, prior to the European Industrial Revolution and 
in the twilight of U.S. domination, is closest to Smith’s “natural” path to opulence, 
radically challenges the advocacy of a “China (militaristic) threat.” Arguing that 
Chinese domination of the world economy (from the Song dynasty in the 12" 
century to the mid-Qing dynasty in the 18" century) was always based on national 
economy-making rather than war-making, Arrighi characterizes the East Asian 
market economy as non-militaristic in nature in contrast to modern Western 
capitalism, which is closely aligned with militarism. He argues that the divergent 
mode of economic development to be led by China in the 21* century is built 
in this historical mold. Furthermore, China’s unyielding refusal of shock therapy 
and the strong role of its state in the making of a market economy from Deng 
Xiaoping’s to Hu Jintao’s regime has, in the author's view, signaled that China is 
not following the neoliberal path. 

It is difficult to do justice to a book of more than 400 pages in a few hundred 
words. The most exciting and refreshing work, to me, is found in chapters 2, 3, 6, 
7, 11 and 12. These few chapters piece together the crux of the story. The draw- 
back of the book is that some chapters appear to be only remotely connected to 
the main argument, and Chapter 1 drags on too long about others’ views. With 
the title Adam Smith in Beijing, one might expect to read mainly about China. 
‘This expectation is unlikely to be fulfilled, as on the whole, the volume has more 
about the decline of U.S. domination than the rise of China. Readers have to be 
patient and wait until the last two chapters to read about China. When describing 
Beijing’s climb to replace Washington’s central position, the author has relied too 
heavily on the U.S. press (particularly the New York Times) as the primary source 
of data, which can have its own ideological bias. Finally, while Arrighi’s historical 
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and analytical insights detailing how the Chinese economic model fits Smith's 
theory are impressive, readers may find the picture of China’s present economic 
development a bit too beautified. To tone down a seemingly perfect, idealistic 
Chinese world market economy, the author dedicates the final section of the last 
chapter to the social contradictions and inequalities brought about by China's 
economic reforms, though this section is rather disjointed from the success he 
lays out in the chapter as a whole. 

These imperfections, nevertheless, do not negate the highly valuable contribu- 
tion of this work. Readers will enjoy the historical richness with incisive, insightful 
and sharp analyses. It is an important text for scholars and students of political 
economy, and for economic sociologists who want to get a refreshing dose of 
Adam Smith’s ideas. 
ae 2 cay Ant eyo |. Sys oe eee 
Volunteers: A Social Profile 
By Mark A. Musick and John Wilson 
Indiana University Press. 2008. 663 pages. $39.95 cloth. 


Reviewer: Susan M. Chambré, City University of New York, Baruch College 


When David Sills published his seminal book, The Volunteers, in 1959, he had little 
empirical research to review. His study of the National Foundation for Infantile 
Paralysis, now known as the March of Dimes, analyzed a major social phenom- 
enon, a nationwide mobilization to “fight” polio by volunteering and donating 
money. By the time his book was published, this effort had already achieved its 
main goal since the National Foundation had largely but not completely funded 
the development and clinical trials for the Salk polio vaccine. Sills was only able 
to draw upon a handful of works that studied volunteerism, voluntary associations 
and the nature and role of civil society and civic engagement. But his examina- 
tion, still interesting and now of historical value, set the stage for other studies of 
volunteerism and civic engagement. 

Nearly three decades ago when I began to study volunteering, most of the litera- 
ture, aside from Sills work, Lipset’s Union Democracy and DeTocqueville, was highly 
speculative and was mainly based on the experiences and observations of volunteer 
directors. The literature on volunteering was substantial, but little of it had the kind 
of methodological rigor and theoretical insights of more recent work, most notably 
work by the two authors of this volume and a number of their collaborators. 

Since then, we have come to a far greater understanding of volunteers, vol- 
unteerism and a host of broader issues of the organizations and institutional 
structures that influence volunteer work. Volunteering itself has become far more 
institutionalized and harder to define with the growth of public programs pro- 
moting volunteerism, greater use of volunteers in both the public and corporate 
sectors (defined as ‘interns’) and payment (not wages) to volunteers in the form 
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of stipends, scholarships and payment for expenses. Most recently, scholars study- 
ing civil society organizations have been faced with a perplexing paradox: there 

has been a sharp decline in organizational memberships—a phenomenon widely 
referred to as ‘bowling alone’ (described by Robert Putnam in Bowling Alone) —but, 
at the same time, a larger proportion of Americans have been engaged in volunteer 
work albeit donating fewer hours over the course of a year than was true in the past. 

Over the past two decades, the empirical and theoretical insights about vol- 
unteering have grown enormously. This is a highly interdisciplinary area of study 
with important work done by economists, psychologists and sociologists. Musick 
and Wilson have done scholars and students an enormous service by doing a care- 
ful review of the sociological literature. To say that this is an encyclopedic volume 
is not an understatement. 

The book offers detailed coverage of the empirical literature on volunteer- 
ing. The six major parts of the book—the Introduction, Subjective Dispositions, 
Individual Reseources, the Social Context of Volunteer, the Organization of 
Volunteer Work, and the Consequences of Volunteering—each offer a clear and 
well-organized discussion of various dimensions of the broader subject of volun- 
teerism. Each individual chapter is a stand-alone review of the literature on issues 
like motivations, the role of values, norms and attitudes, and the importance of 
gender, race and the life course. 

But, this is not simply a literature review. The book includes a series of discus- 
sions based on the authors’ analysis of empirical data from several different data 
sets including Independent Sector and Americans’ Changing Lives. There are 
also a number of incredibly insightful observations included in the volume which 
have the potential of leading to a major reconsideration of current best practices. 
For example, using the ACL data, the authors found that “the more hours people 
volunteered before retirement the more likely they were to cut back on the number 
of hours volunteered after retirement.” This finding challenges current policies 
which are designed to tap the extra time people gain when they retire, an effort 
that is currently garnering significant amounts of public and foundation resources 
in anticipation of the retirement of baby boomers. 

The book’s major strengths are, however, also a deficit. In doing the careful 
stand-alone chapters that cover major issues hard to discuss in isolation from other 
topics, the book incorporates some redundancy and leaves the reader with a good 
but somewhat overwhelming sense of the range of the issues and the complexity 
of the topic rather than a clear theory of the nature of volunteerism and the social 
and cultural forces that shape this phenomenon. But, for the novice reader as well 
as the scholar more immersed in the field, this is the one book to have because 
of its coverage and sharp insights. The authors have created, in a sense, both an 
encyclopedia and a guide for future research. 
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Divine Hierarchies: Class in American Religion and Religious Studies 
By Sean McCloud 
University of North Carolina Press. 2007. 224 pages. $65 cloth, $22.50 paper. 


Reviewer: Ralph Pyle, Michigan State University 


In this book author Sean McCloud enhances our understanding of the connection 

between religion and social class by acknowledging the importance of cultural 

resources and situational factors in accounting for the different forms of reli- 
gious expression associated with affluent and non-affluent groups. Incorporating 
Bourdieu’s concept of habitus, McCloud emphasizes that cultural resources are 

associated with class codes that predispose individuals from similar social locations 

to join particular religious groups. 

After outlining his theoretical approach, McCloud examines American schol- 
arship on religion and social class during different time periods. During the 
age of eugenics (the early 20" century), writers often presented biological ac- 
counts which emphasized that “inferior sorts of people” tended toward sectarian 
religious expression. Although his sampling of anthropological and sociologi- 
cal writings from the time may not perfectly represent the corpus of scholarly 
work on the topic, McCloud does present evidence that a number of authors 
embraced eugenic assumptions. McCloud then examines scholarship from the 
late 1920s to the early 1940s and emphasizes that the emergent deprivation 
theories of religious behavior shared a common bond with the earlier biological 
discussions of religious preferences. However, the author stumbles when trying 
to link Neibuhr’s Social Sources of Denominationalism account with biologically 
deterministic writings of earlier decades. Although Niebuhr may have used 
the phrase “intellectual naiveté” to describe economically disfranchised classes 
drawn to emotional religious expressions, | am not aware that he attributed 
intellectual deficiency to those groups. 

McCloud notes that deprivation theories peaked in studies of religion in 
the 1960s through the 1970s. His review of the scholarship during this period 
includes many anthropological accounts of Native American or non-Western 
religions. The author again stresses the degree to which deprivation accounts 
echo earlier biological or evolutionary accounts of religious behavior. While 
McCloud has succeeded in identifying scholarly treatments of religion and social 
class that may echo earlier eugenic assumptions, it is not clear that those studies 
represent the larger body of research on religion and economic inequality, and 
the selection of studies seems slanted to support the author's claims. Certainly 
the landmark sociological treatments (e.g., the works of Weber, Troeltsch, 
Neibuhr and Pope) highlight the importance of social, not biological, factors 
in accounting for people's religious behavior. It is more likely that many depri- 
vation scholars evince a middle-class bias or a mainline Protestant bias when 
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describing sectarian religion, and in that sense they may be portraying sectarian 
adherents in a patronizing fashion. 

Toward the end of the book McCloud discusses four major theologies of class 
in American religion and then presents an interesting ethnographic account of 
services at two Holiness Pentecostal congregations in the Ohio River Valley. One 
of the churches is a small independent Pentecostal church whose members engage 
in spontaneous expressions of ecstatic worship. The other is a larger Church of 
God congregation that is experiencing an influx of middle-class members and 
seems to be accommodating to mainstream cultural styles. McCloud argues that 
deprivation theories fail to fully explain the differences between these churches. 
Incorporating insights from his theoretical model, McCloud focuses on differences 
in praise styles, clothing and music at the two congregations. He stresses that the 
different styles of worship at these churches are not just a reflection of the members’ 
social class positions, but are reflective of differences in pastoral style, institutional 
afhliation and growth trajectories. It is interesting to note that the author's analysis 
of congregational accommodation here seems to affirm the basic insights of the 
sect-to-denomination model of religious change that he earlier criticizes. 

McCloud rightly suggests that deprivation accounts of religious behavior tend 
to ignore other factors that help to explain sectarian affiliation, such as recruitment 
techniques, group commitment mechanisms and other external contextual factors. 
In addition, he has successfully moved beyond traditional accounts of religion and 
social class by incorporating elements of Bourdieu’s discussion of social class in 
his model. Although the author's blanket condemnation of deprivation theories is 
somewhat of a distraction, there is useful material here, and this book should serve as 
a good resource for anyone interested in the topic of social class in American religion. 


Bonds of Civility. Aesthetic Networks and the Political Origins of Japanese Culture 
By Eiko Ikegami 
Cambridge University Press. 2005. 460 pages. $80 cloth, $36.99 paper. 


Reviewer: Priscilla Parkhurst Ferguson, Columbia University 


What place does beauty have in social life? In her stunning analysis of Japanese 
culture Eiko Ikegami tackles this seemingly intractable question, and she does so, 
as the subtitle makes clear, to reach to the larger society. Aesthetic practices in 
pre-modern Japan constituted an extra-political force that turned out to be highly 
political. In manifesting a distinctive culture of everyday life, the sensibility that 
evolved over the two and a half centuries of the Tokugawa shogunate (1603-1867) 
shaped the Meiji political regime (1868-1912) and prepared Japan to take its 
place in the modern world of nation-states. 

Political sociologists will see this work as a case study of an alternative model 
of modernization. Ikegami challenges the assumption that Europe and North 
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America constitute the paradigm of the development of democratic society. The 
Tokugawa shogunate traces a very different path for the late (by European stan- 
dards) emergence of Japan as a modern society. Strictly regulated hierarchical 
relations left no opening for anything resembling a Habermasian public sphere, no 
social space for the archetypal coffee houses, and zero tolerance for the freewheel- 
ing discussions supposed to have set the stage for democracy. In Japan, the egali- 
tarian ties deemed requisite for a democratic polity took shape, not in voluntary 
organizations geared to social action, as de Tocqueville argued for America, but 
in groups committed to the practice of beauty in everyday life. 

Instead of vertical ties challenged by an oppositional sphere, “proto-modern” (a 
recurrent term) associations that grew up around aesthetic practices— poetry, fash- 
ion, the tea ceremony and flower arranging, among others—linked participants 
horizontally. Simultaneously within and outside the hierarchy of rigidly classified 
and controlled socio-political relationships, these sites allowed individuals to meet 
onan equal footing. They fostered “bonds of civility” of a very different order from 
the Western model of civil society. Network theorists in particular will appreciate 
the intricacy of the connections. 

Historical sociologists will surely prize the painstaking reconstruction of a cultural 
tradition. Ikegami shows how some of these aesthetic practices evolved from forms that 
first appeared in mediaeval times but came to serve very different purposes in the later 
period. Especially impressive is the range of evidence—diaries, published and unpub- 
lished poetry, civility manuals, commercial records, travelers’ accounts, novels, paint- 
ings, woodcuts, scrolls, even contemporary advertisements. (Cambridge University 
Press is to be commended for the abundance and quality of the illustrations.) 

Yet, Bonds of Civility has perhaps the greatest riches for the cultural sociologist. 
Not only does Ikegami confront the questions of beauty that most sociologists 
scrupulously avoid, she demonstrates the relevance of the aesthetic, not only to the 
civilizing process in Tokugawa Japan, but also to sociological analyses of culture 
more generally. The bonds of civility laid before us in splendid detail are first and 
foremost bonds of beauty. 

The art forms themselves, interactive in the main, compel sociability. Linked 
poetry depends on serial participation, each writer working off the preceding poetic 
creation and setting up the next. This collective, public poetry, which integrates the 
writer into a community, contrasts vividly with the Western lyric with its emphasis 
on the individual creator and a final product. Similarly, rather than solipsistic gusta- 
tory pleasure, the tea ceremony enacts an elaborate ritual of social integration. The 
tacit communication of highly controlled body movements stands in stark contrast 
to the Western mode of explicit articulation of aesthetic principles and beliefs. At 
the same time, language plays a vital role in shaping as well as expressing cultural 
understandings. Ikegami is ever attentive to its nuances and their significance. Like 
Elias, she places great importance on the particulars of expression, carefully teasing 
out the multiple implications of a given term (as in the discussion of “public.” 
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Once connected to the expanding market economy these aesthetic practices 
spread beyond the personal networks characteristic of the earlier periods and 
brought them to the attention of the state. Ikegami analyses three cultural produc- 
tions that harnessed the market in the later 18 and 19" centuries: commercial 
publishing, civility manuals and, my favorite, Kabuki theater and fashion with 
their bid to renegotiate both class and gender identities. The argument of domina- 
tion and resistance builds on close analyses of the very stuff of fashion, the designs 
for kimono, imported and domestic silks and cottons, urban sumptuary regula- 
tions so concerned with minutiae as to forbid sumo wrestlers from wearing silk 
loincloths! Any one of these can stand on its own as exemplary of the interaction of 
the commercial and the aesthetic in pre-modern Japan; together they are a tour de 
force, all the more persuasive for the comparative context that, here as throughout 
the book, broadens the discussion. 

Its theoretical sweep, historical focus, imaginative use of evidence and penetrating 
analyses make Bonds of Civility a treasure trove for sociologists of every persuasion. 
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While the general dynamics governing collective memory processes are well developed 
theoretically, our tool kit for systematically assessing how collective memory changes 
over time remains limited. Here, we focus on a particular tragic event—the killing of 
five participants in an anti-KKK march in Greensboro, North Carolina, on Nov. 3, 
1979-—to assess continuity and change across accounts presented as part of a 1980 
federally-sponsored investigation and a 2005 truth and reconciliation commission. 
Specifically, we employ a block modeling methodology to describe the structure of 
narrative themes shared across multiple accounts in each time period. We then analyze 
the sources of temporal variation in these narratives, and show how shifts in the range 
of actors participating in the 1980 and 2005 initiatives, as well as the institutional 
contexts within which accounts were offered, shaped how themes were strategically 
deployed to increase the resonance of specific positions. We find that, in the Greens- 
boro case, the contours of these demographic and institutional dimensions resulted 
in a decrease in the polarization of competing narratives and an increased emphasis 
on key themes that contested elite “Gnstitutional” accounts. 





“Memory is a living thing—it too is in transit.” 


— Eudora Welty (1984:104) 


“Memory is a process, not a thing; and it works differently at 


different points in time. 2 
~ Olick and Robbins (2002:122) 


The concept of time is embedded in the very fabric of work on collective memory. 
Accounting for how events are remembered and represented by collectivities re- 
quires that we take seriously a bedrock assumption of historical process—the fact 
that past events shape present actions and outcomes. Schudson (1992) explains 
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that such effects can occur both through contingency—where the occurrence of 
an event shapes, in part, subsequent events—and continuity, in which the insti- 
tutionalization of memories of prior events serves as a cultural force that imbues 
present choices with meaning. But equally important is the inverse claim—that 
events have the capacity to (re)shape our interpretation of the past (Bearman et 
al. 1999; Danto 2007; Griffin 2004). In this sense, the malleability of memory 
renders history vulnerable. “Because ideology is powerful, the needs and desires of 
the present urgent, and the pull of the self and its attachments strong,” Schudson 
(1992:205) argues, “the past is subject to reconstruction and rewriting to accord 
with present views.” 

As this quote suggests, the “presentism” of memory can be accounted for within 
an instrumental framework, in which the logic of a recollection emerges through 
its alignment with its proponents’ current interests (Coser 1992). But the pull of 
self-interest is also tempered, and in part constructed, by cultural forces, in which 
symbolic logic and normative sway constrain the range of “acceptable” renderings 
of the past (Olick and Levy 1997; Schwartz 1982; Schwartz et al. 1986). These 
cultural forces, in effect, provide a basis for the durability of memory by placing 
certain revisionist accounts out of bounds, limiting history's malleability. The roles 
of both interest and culture, in addition, must be weighed relative to the insti- - 
tutional contexts within which agents produce and disseminate representations 
of past events, as memories that take on a meaningfully collective character are 
rarely defined equally or neutrally by all members of a collective (Wagner-Pacifici 
1994; Wells 1996). 

In order to assess how collective memory changes over time, we adapt an 
established network methodology to rigorously analyze such temporal change 
processes. Specifically, we apply block models—which have been used to describe 
the structure of trade flows, reputational status, musical genres and social protest 
events (Bearman and Everett 1993; Cunningham et al. 2003; Nordlund 2007; 
Shrum and Wuthnow 1988)-to a novel context: collections of individual nar- 
ratives tied to the 1979 killing of five activists at an anti-Ku Klux Klan march in 
Greensboro, North Carolina. We draw upon two sets of narratives, collected in 
different time periods as part of separate community-based initiatives to construct 
a collective memory of the event. This approach allows us to: (1. describe the 

“macro-structure” of institutionalized memory that emerges through multiple 
narrative accounts presented within a single time period; and (2. assess the degree 
of durability vs. malleability of such memories, by analyzing how these narrative 
macro-structures vary over time. Capturing temporal change also provides a basis 
to examine whether such variation can be explained by the emergence of subse- 
quent events or by dimensions of the institutional contexts within which collective 
agents create a “commonly accepted” view of history (Gibson 2004; Savelsberg 
and King 2007). Our primary interest is to understand how collective memory of 
a discrete event is affected by these temporal and institutional processes. 
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Collective Memory 


A key premise underlying any examination of how temporal processes shape the 
social, or collective, character of memory is the classic insight that the past is not 
fixed, but instead is conditioned by, and conditions, present and future events 
(Degler 1976; Griffin 2004; Schudson 1992). For social scientists, the analytic 
analogue of this insight is what Bearman et al. (1999) refer to as the “problem 
of casing,” or the difficulties inherent in fixing the beginning and end of event 
sequences. These difficulties, of course, stem from the fact that history is not 
“dead,” that the meaning of past events can be transformed by new “endings” —later 
events that reactivate historical occurrences and alter their roles within an overall 
sequence of events (Danto 2007). 

This temporal malleability of memory highlights its social character. In his 
foundational conception, Maurice Halbwachs (1992), reflecting the influence of 
his teacher Durkheim, asserted that memory was necessarily a collective endeavor, 
an act made possible by individuals’ participation in groups. This view has been 
tremendously influential in subsequent work, much of which focuses on the rela- 
tional character of remembering and/or how memories are materially represented 
(see Nora 1996; Olick 1999a; Schudson 1992; Zerubavel 1996). 

A critique of this tradition has emerged in reaction to what some view as 
Halbwachs’ “anti-individualist” tendency to strip away the agentic dimensions of 
the process. Much of this reading stems from Halbwachs’ assertions that memory 
is recalled by individuals “externally” through publicly-accessible symbols and nar- 
ratives, and reconstructed only through the social framework of one’s group as- 
sociations (Olick 1999a). Remarking on subsequent applications of this tradition, 
Confino (1997) notes the danger in viewing collective identity as unitary, such that 

“2” collective memory exists within some identifiable population. Doing so elides the 
contestation that defines the creation and reproduction of collective memory, in par- 
ticular the ways in which powerful actors work to construct and control a dominant 
discourse (Kansteiner 2002; Wagner-Pacifici and Schwartz 1991). Such concerns 
have led some analysts away from labeling such processes as “collective,” and toward 
alternative labels such as “cultural” or “social” memory (Olick and Robbins 1998). 

But Olick (1999a) also notes an individualist strain running through Halbwachs 
work, centered on his consistent assertion that remembering—though structured 
within social contexts—is an act occurring within individual minds. This tension 
between individual and collective processes has endured in recent scholarship on 
collective memory, which has tended to separate into two camps. Schuman and 
Corning (2000) have represented this split as the distinction between “cultural- 
level” vs. “individual-level” analyses. The former emphasizes symbols that come to 
serve as representations of the past—and, crucially, how such symbols are deployed 
by powerful actors. The latter focuses on memories held by individuals within 
some defined population, often relying on survey or interview data to examine 
how individual memories cumulate across group members. 
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This key distinction has been extended and expressed in various ways. Olick 
(1999a) notes that one way to conceptualize a truly collective memory is to 
recognize the role of supra-individual groupings and processes— including lan- 
guage, technology and power-laden institutions —in how events are recalled 
and represented. In contrast, he argues that an emphasis on how individual 
memories emerge, endure and aggregate might be more accurately referred to 
as the study of “collected memory.” However, the fact that commemorations 
and other collective representations are necessarily both constituted and (re) 
interpreted by individuals has moved other scholars away from the strict binary 
conception of individual vs. collective levels of analysis, and toward more fluid 
formulations of “elite” and “popular” (Schwartz and Schuman 2005), or “official” 
and “vernacular” (Bodner 1992), memories. 

Despite these divisions, few analysts in either camp would refute the claim that 
collective perceptions are mediated by the interaction of producers and consumers 
of representational texts or that individual beliefs and intentions play some role in 
creating commemorative representations. It is this intersection of the popular/in- 
dividual and elite/collective that interests us. In the Greensboro case, community 

- initiatives organized in 1980 and 2005 were self-consciously intended to represent 
the collective expression of memory around the November 3“ event, and thus as- 
vehicles to ultimately attain “mnemonic consensus.” (Griffin and Bollen 2009; 
Zerubavel 1996) ‘The “texts” employed to do so were derived from collections of 
testimonies gathered in both years. 

The role of time—reflected in processes of continuity and change—is central to 
strong and weak strains of “presentist” collective memory research, focusing on 
how groups work to align images of the past with present interests (Hobsbawm 
and Ranger 1983; Olick and Robbins 1998; Schudson 1992; Schwartz 1996). A 
well-developed literature on cohort effects emphasizes how events experienced by 
a cohort during formative years exerts a strong influence over memory, provid- 
ing one mechanism for the formation and maintenance of a stable, persistent 
generational identity (Griffin 2004; Schuman and Rodgers 2004; Schuman and 
Scott 1989). Olick (1999b) notes the path dependency of collective constructions 
through which commemorative efforts are shaped in reference to the accumulated 
history of past commemorations. 

This dialogic relationship between a given commemorative effort and its prior 
iterations highlights but one way in which the temporal durability of memory 
is conditioned by broader contextual elements. Such factors can directly exert 
influence through their impact on the institutional makeup of subsequent com- 
memorations. Savelsberg and King (2005:583) argue strongly the importance of 
examining this institutional context, in particular the political and social norms 
defining the “rules of commemoration,” the rhetorical and procedural frameworks 
within which statements are made, legitimized and recorded, and the political 
and civic structures shaping the public sphere in particular places and times. 
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Wilson (2003), in his discussion of the South African Truth and Reconciliation 
Commission, similarly focuses on institutional context, emphasizing the roles of 
the public organization and bureaucratic method of the TRC itself. 

Assessing the impact of institutional and temporal contexts on the durability or 
malleability of collective memories in Greensboro requires that we first describe 
the extent to which those memories persisted or changed over time. Typically, 
these sorts of temporal comparisons have centered on rich textual or discursive 
analyses of memorials and other material forms, ranging from qualitative explo- 
rations of worldview and meaning, to more formal attempts at content analysis 
through comparative word and/or concept counts. To capture distributions of in- 
dividual beliefs, Schuman and his colleagues have relied mainly on survey sample 
designs, while others have employed open-ended interview accounts. In contrast, 
our approach arrays narrative accounts as networks. Unlike survey results, which 
typically capture variation across individuals through their responses to discrete 
questions, a network-based approach provides a means to capture narrative ele- 
ments contextually—i.e., as holding a place within a constellation of all possible 
elements (Bearman et al. 1999; Bearman and Stovel 2000; Franzosi 2004; Smith 
2007). Capturing sets of elements as they are patterned across multiple narrative 
accounts of the same event moves beyond a binary accounting of the presence/ 
absence of particular themes, to understand each feature in relation to all other 
themes salient to narratives about the Greensboro event. 


The “Greensboro Massacre” 


On Noy. 3, 1979, in the black Morningside Homes neighborhood, the Communist 

Workers Party held a widely-publicized demonstration against the Ku Klux Klan. 
The CWP was a racially-diverse organization, comprised largely of university-aged 

activists committed to empowering low-wage workers. Active largely in textile 

mills across North Carolina’s Piedmont region, the CWP had developed a strong 
anti-KKK stance, in part due to the perception that the klan posed a threat to ra- 
cial cooperation in trade union efforts. Following a direct clash with klan members 

in July 1979, the CWP’s aggressive anti-KKK activities intensified, culminating in 

their November 3" march, billed as a “Death to the Klan” event. 

That day, as CWP members and their supporters gathered at Morningside 
Homes, eight carloads of klansmen arrived to disrupt the march. As they slowly 
passed, CWP members banged on the cars with sticks, and the groups antagonized 
each other with verbal slurs. The violence escalated when KKK and Nazi members 
retrieved guns from their cars and opened fire on the demonstrators, leaving five 
CWP associates dead and eight others, along with a single klan member and a 
news photographer, wounded. 

While both sides came to the event armed and exhibited aggressive behavior, 
klan members fired the initial shots. All but one of the demonstrators slain, 
along with the majority of those wounded, were unarmed, and all injuries 
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were inflicted by klan and Nazi weapons. The conflict was complicated by the 
absence of Greensboro police on the scene and the presence of children and 
other bystanders along the parade route. Such factors led many to refer to the 
event simply as the “Greensboro Massacre,” though this characterization was 
contested by others who viewed the incident as a “shootout” between two armed 
outsider extremist groups (Wheaton 1987). 

Within months of the tragic violence, the North Carolina Advisory Committee 
to the U.S. Commission on Civil Rights sponsored a community forum designed 
to present publicly a representative range of accounts exploring its causes and con- 
sequences. In two subsequent criminal trials, all-white juries found the Nazi and 
klan defendents not guilty by reason of self-defense, despite evidence that included 
video footage of the shootings (Ifill 2007). While members of the CWP contin- 
gent refused to testify during the first trial and in some cases actively disrupted the 
proceedings, state and federal prosecutors’ failure to win convictions contributed 
to a widespread sense that the victims had been betrayed by the police department, 
the city and the legal system. In a 1985 civil trial, the city was found responsible 
for the shootings as a result of new information discovered regarding the police 

. department's actions. Officers had full knowledge of the intention and location of 
the November 3 “Death to the Klan” rally, as they had issued the parade permit. 
They also had an informant within the klan who assisted with the planning of the 
KKK and Nazi counter-demonstration, leading many to suspect that police failed 
to use their intimate knowledge of the klan to prevent the confrontation with the 
CWP from turning deadly (Williams 2007). 

In 2003, the Greensboro Truth and Reconciliation Commission was estab- 
lished by local residents to more completely understand what had taken place. 
Historically, TRCs have been initiated as a means to pursue official truth-seeking, 
by publicly documenting and investigating events through testimony from vic- 
tims and perpetrators as well as analysis of primary and secondary data sources. 
Their goal is to create a rigorously-constructed “truth,” thereby “reducling] the 
number of lies that can be circulated unchallenged in public discourse” (Ignatieff 
1996:113) and ideally providing a foundation for healing and reconciliation. 
While the post-apartheid South African TRC has earned the most attention, at 
least two dozen TRCs have been established globally in the past 20 years (Hayner 
2001). The Greensboro Commission was the first such effort in the United States. ! 

In 2005 and 2006, the GTRC convened three two-day public hearings, fea- 
turing testimony by former and current CWP members, Nazi and klan members, 
shooting victims, elected officials, police officers, attorneys and Morningside 
Homes residents. Additionally, the commission employed evidence from the 
three trials, private records made available by the police department and FBI, 
newspaper and magazine articles, academic literature, and more than 200 per- 


sonal interviews and testimonies in its analysis (GIT'RC 2006). The final report 
was released in May 2006. 
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Data 


Has the structure of narrative themes surrounding November 3" changed over 
time? What are the source(s) of observed temporal continuity and change? To 
systematically answer these questions, we apply a network-based methodology to 
sets of narratives collected during two distinct time periods. To the extent that the 
1980 and 2005 narratives are not equivalent, we argue that the passage of time 
holds some significance. But “time” can play at least three potentially compet- 
ing analytic roles. In one sense, it can serve as a medium for the emergence of 
subsequent events that shift the thematic structure of November 3" narratives. 
Second, these emergent events, beyond serving as raw material for the production 
of new themes, can define new contexts that provide the causal push required for 
individuals and constituencies to reassess their original narratives, creating fresh 
accounts of past events. Finally, temporal shifts in the collective thematic struc- 
ture can be produced by differences in the institutional contexts surrounding the 
1980 vs. 2005 initiatives, including the perceived legitimacy of these community 
initiatives, the broader climate surrounding American race relations during both 
periods (Savelsberg and King 2007), the format within which individual narra- 
tives were delivered, validated and questioned (Wagner-Pacifici 1994), and the 
makeup of populations testifying in 1980 and 2005. 

To capture the changing structure of narratives regarding November 3", we 
employ individual accounts collected in two time periods. ‘The first set comes 
from abridged transcripts of a NCAC meeting convened on Feb. 26, 1980.’ The 
commission had been created as an independent bipartisan federal agency by the 
Civil Rights Act of 1957, charged with the study and investigation of potential 
violations of equal protection clauses, including those which occur due to racial 
discrimination. In each state, the commission constituted an advisory commit- 
tee, composed of “responsible” private citizens, to advise on “matters of mutual 
concern.”(NCAC 1980:i) In 1980, North Carolina's committee was a biracial 
group of 16 professional and civic leaders, including a number of doctors, lawyers 
and clergy, as well as prominent civil rights leader Floyd McKissick (NCAC 1980). 
Its formal tie to the federal government, along with the established links between 
many of its members and political and civic elites, provided its proceedings with 
considerable credibility in institutional circles. 

The open meeting was designed to “ascertain the status of black/white rela- 
tions and the extent to which the November 3" shootings have affected relations 
between the races.”(NCAC 1980:1) To that end, 25 individuals, intended to rep- 
resent “a cross-section of Greensboro’ citizenry,” gave detailed statements of their 
perspectives on these issues. Not surprisingly, given NCAC’ institutional support 
base, the meeting included testimony from city officials, including the mayor, city 
manager, personnel and public safety directors, and the police chief. A number of 
business, religious and civic leaders—from the ACLU, NAACP, Human Relations 
Commission and Bennett College (a historically black women’s school) —also took 
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part. Non-institutional actors were present, though the commission tended to 
invite a single representative from groups such as the KKK, American Nazi Party 
and CWP, minimizing the overall presence of these distinct (and often heteroge- 
neous) constituencies. 

The second set of narratives comes from testimony given during the GIRC’s 
public hearings in 2005. Each of the GIRC’s three hearings addressed a specific 
topic related to November 3", with invited individuals presenting prepared state- 
ments and responding to questions from commissioners. Our data incorporate 
transcripts from two of the three hearings; the first took place on July 15-16, 2005, 
focusing on the question “What brought us to November 3, 1979?,” while the 
second was held Aug. 26-27, 2005, organized around the theme “What happened 
on, and after, November 3, 19792” Full transcriptions of the GTRC’s public hear- 
ings are included on the commission’s website.’ 

The GTRC was formed in 2003 by the Greensboro Truth and Community 
Reconciliation Project. The GTRCP had strong ties to two local grassroots organiza- 
tions—the Beloved Community Center and the Greensboro Justice Fund—whose 
members overlapped with the former membership of the CWP, making city officials 

"and others wary that the GIRC was little more than a front for left-wing activists 
who had long criticized Greensboro for various racial and class-based inequities. 
From its inception, however, the GTRC worked to be inclusive, inviting a broad 
range of community groups to nominate members of the selection committee that 
chose the seven commissioners who would be the public face of the GTRC’s hear- 
ings and investigation. The commissioners were diverse in terms of race, gender, and 
educational background, and included a university professor, a retired corporate 
attorney, a nonprofit group consultant, officers of two interfaith peace organizations, 
a nursing assistant and a YMCA board member (GTRC 2004). 

In sharp contrast with the high level of institutional support for the 1980 
community meeting, Greensboro’s political and civic elites consistently attacked 
the GTRC. Mayor Keith Holliday was an outspoken critic of the process, arguing 
that “harm can come from an inaccurate truth leading to inaccurate account- 
ability, non-forgiveness and especially non-reconciliation” and characterizing the 
commission's efforts as amounting to a “guilt trip put on our 235,000 citizens 
who are not responsible for Nov. 3, 1979.”(Green 2005) This view was echoed 
by other city leaders, and formalized in the city council’s vote to officially oppose 
the GTRC process (Green 2005). However, the GTRC gained credibility among 
many citizens—in Greensboro and elsewhere—as the first U.S. incarnation of a 
TRC. GTRC staff made significant efforts to link with leaders from more than 
two dozen previous TRCs around the world, including South Africas Desmond 
Tutu, and in general the GTRC gained legitimacy from the high visibility and 
considerable impact of these previous global initiatives. 

The GTRC’s makeup and process had an impact on the range of participants 


whose voices contributed to the commission's collective product. Consistent with 
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Table 1: Constituency Representation in 1980 Community Meeting and 
2005 GTRC Hearings 





Peres Mss BIST Tryst SNE hx) <0 gi eve es ie oe Ry eT et) a 
Constituency Number in 1980 % of 1980 Total Number in 2005 % of 2005 Total 
Activists if 28.0 9 29.0 
Business 2 8.0 0 0 
City 5 20.0 2 6.5 
CWP | 40 6 19.4 
Education 1 40 4 12.9 

KKK 2 8.0 2 6.5 
Legal 0 0 4 12.9 
Media 2 8.0 1 3.2 
Morningside 1 4.0 1 o2 
Religion 4 16.0 1 o.2 
Textile 0 0 { 32 
Unknown/ Resident 0 0 0 0 
Total 25 100 31 100 





Olick’s (1999b) conception of the ways in which commemoration efforts exhibit 
a dialogic relationship with their forebears, commissioners and GTRC staff made 
a concerted effort to correct what they viewed as the marginalization of many 
non-institutional voices in the 1980 meeting and other previous discourse on 
November 3". GTRC hearings were intentionally designed to incorporate mul- 
tiple perspectives from Greensboro residents (especially those who resided in 
Morningside Homes), the CWP and the KKK/Nazis (though many members of 
this latter constituency avoided the hearings). Greensboro officials and other elites 
largely refused to participate, further reducing the representation of the city, police 
and business interests. Table 1 illustrates the consequent distinctions between the 
constituency makeup of the NCAC and GTRC processes. 


Methods 


Our analysis has two components: (1. to model the narrative accounts themselves to 
describe how they have changed over time, and (2. to explain the thematic structure 
of narratives in both time periods, as well as why this structure has shifted in par- 
ticular ways over time. The former component requires that we develop a strategy to 
identify and “code” narrative themes, to observe systematically how commonalities 
and differences are arrayed across individuals and over time. Content analysis pro- 
vides one method, by identifying key themes and tallying their distribution across 
narrative accounts as a means to identify and examine emergent patterns. However, 
such analyses offer no clear way to examine interactions across sets of themes—i.e., 
how any particular theme relates to constellations of all other possible themes—and 
thus likely remain insensitive to complex patterns as well as risk overemphasizing 
similarities across texts (Carley 1993; Reissman 1993; Smith 2007). 

Over the past decade, a range of studies has employed an alternative approach, 
utilizing network methodology to examine narrative structure. Typically, such 
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analyses have relied on the logic of ego-centered network models (which em- 
phasize the patterning of social ties across nodes), representing stories as sets of 
themes or elements connected by narrative flows (Bearman and Stovel 2000; 
Franzosi 2004; Smith 2007). The resulting presentation allows for the “grammar” 
of narratives to be analyzed through an examination of the centrality, density, etc. 
of constitutive elements. However, as our narrative accounts tend to center on 

whatever particular themes were of interest to each individual, rather than their 
attempts to construct a full story explaining outcomes or present conditions, we 

have chosen to focus on the presence or absence of key themes, modeling the 

emergent thematic structure by identifying clusters of themes exhibiting similar 
relationships to the overall population of themes. Additionally, as we are interested 

in the structure of themes across individual accounts, we chose to review and code 

each transcript in our possession rather than sample a subset of the data. 

One approach to modeling thematic structure involves representing sets of 
themes as a block model. This technique relies on arraying data in matrix form—in 
our case, each row i represents an individual hearing speaker, and each column ja 
theme invoked within his/her testimony. From this rectangular ij (person-theme) 
incidence matrix, one can generate both ii (person-person) and jj (theme-theme) 
adjacency matrices through inner-product matrix multiplication (Breiger 1974; 
Wasserman and Faust 1995). These square matrices capture the degree to which 
any pair of themes is connected through mutual invocation by participants, or 
alternately the degree to which any pair of individuals is connected by common 
themes. Here, we work with the jj adjacency matrix (see Figure 1a). 

This structure is ultimately represented by a block model, i-e., a permuted and 
partitioned jj matrix. Using the algorithm CONCOR (short for “CONvergence of 
iterated CORrelations”), the initial population of themes is, in effect, partitioned 
into subgroups or blocks, characterized by relative internal homogeneity. Each 
step of the iterated process halves blocks, with the analyst deciding on substantive 
grounds the number of steps that yields an optimal model. The result is an image 
matrix (see Figure 1b), comprised of high-density blocks of often-linked themes 
and, in some cases, a “null block” of themes not connected to any others. These 
diagonal blocks are represented as circles in figures 2 and 3. Figure 1b also shows 
several relative high-density zones, which denote links across blocks. In effect, the 
resulting block models show clusters of themes that “travel together,” meaning they 
have strong relationships to other themes in their respective block as well as similar 
links to themes from other blocks.* Using data from the 1980 narratives, Figure 1 
illustrates the block modeling process, moving from the original jj adjacency matrix 
(Figure la), to its permuted version partitioned into five blocks (Figure 1b). 

‘This block modeling procedure is fundamentally descriptive, meaning that 
it does not reveal patterns of causal association. Rather, it provides a way to 
summarize and view the thematic macro-structure that emerges from multiple 
narrative accounts. Block models, then, serve as an alternative to more con- 
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Table 2: Population of Themes 
Blame: City 


Blame: KKK 

City avoiding 11/3 
Complicity 

Context for conflict 

Could have happened anywhere 
CWP doing good/ unselfish work 
CWP incompetence 

CWP provocation 

CWP violent 

Fear 

Informant 

Injustice 

Massacre 

Media bias 

Morningside as battlefield 
No conspiracy 

No CWP provocation 

No racial motivation 
Outsiders 

Police followed procedures 
Police inattention/ neglect 
Police incompetence 
Police lacked information 
Police repression/ hostility 
Police unjust 

Shootout 

Space violated 

Trials unfair 

Under siege 

Uneven fight 


Victims were targeted 
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ventional organizing methodologies, such as 
content analysis. We argue that this network 
approach produces a contextualized map- 
ping of narrative themes, drawing attention 
to complex interactive patterns across sets of 
themes. It can also serve as a foundation for 
subsequent qualitative analysis addressing 
other key forms of context, associated with 
discursive variations on the presentation of 
a particular theme and the manner in which 
themes are highlighted or otherwise solicited 
by institutional officials. 

To construct a population of themes, we 
employed an inductive approach, compiling 
substantively relevant themes as they arose 
within each transcript. We identified a total 
of 32 themes tied to November 3%, listed in 
Table 2. As part of our explanation of the 
resulting thematic structure, we subdivide 
participants into constituencies. The con- 
stituency variable was constructed using the 
NCAC and GTRC background statements 
about participants; we assigned individuals 
to one of 1lconstituency categories: CWP, 
KKK/Nazi, city (including police), religion, 
media, legal, business, education, textile, ac- 
tivist or Morningside resident. We also fur- 


ther collapse constituency divisions into “institutional” and “non-institutional” 
categories, reflecting participants’ orientation to city officials. 


Analysis 


Figures 2 and 3 present block models graphically representing the thematic 
structure of November 3” narratives in both the 1980 community meeting 
and the 2005 GTRC public hearings. Blocks in each model represent clusters 
of themes that meet our standard for structural equivalence, meaning that each 
theme within a block has similar relationships to themes from other blocks. 
The presence of a line connecting two distinct blocks indicates that the density 
of ties across themes within those blocks exceeds the overall density of ties in 
the network. Substantively, these inter-block ties represent narrative bridges. 
Any block represented by an arrow directed to itself (sometimes referred to as 
a “1-block”) has internal coherence, meaning that the cluster of themes within 
the block tends to be invoked in combination. Figure 2 also includes a “null 


1530 © Social Forces 88(4) 


Figure 2. Thematic Structure o 
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Notes: Themes in boldface are highly central, with a centrality score at least one 
standard deviation above the mean (see Table 3). 

Block 1: Massacre, Blame: KKK, Uneven fight, Police inattention/neglect, CWP doing 
good/unselfish work, Context for conflict 

Block 2: Police incompetence, Complicity, Blame: city 

Block 3: Shootout, Fear, Police followed procedures, CWP provocation, Police 
repression/hostility 

Block 4: Police unjust, Outsiders, Media bias, No racial motivation 

Block 5 (null block): All other themes 


block,” or set of themes not invoked by any participants during the 1980 NCAC 
meeting, mostly because they involve events occurring after the 1980 hearings. 

Table 3 presents a measure of each theme’s influence in the overall network 
of themes. Specifically, we report Freeman degree centrality measures, which 
capture the number of ties linking any given thematic “node” to other themes 
(Freeman 1979), standardized to allow for comparisons both within and across 
our 1980 and 2005 data. 


The Structure of November 3” Narratives 


Figure 2 shows the structure of early accounts of November 3", as represented 
by testimony from the 1980 community meeting. Blocks 1 and 3 clearly lay out 
two competing perspectives on the event: a characterization of the conflict as a 
“massacre,” an “uneven fight” caused by the KKK’s opposition to the CWP’s “good 
work” vs. a “shootout” fundamentally caused by the CWP’s “provocation” of the 
klan. Note that, in the latter account, the CWP’s stance provides sufficient expla- 
nation for the violence, while the former account (Block 1) emphasizes a broader 
and more complex “context for conflict,” tied to the history of racial institutional 
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Figure 3. Thematic Structure (2005) 
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Notes: Themes in boldface are highly central, with a centrality score at least one 
standard deviation above the mean (see Table 3). 

Block 1: Massacre, Complicity, Trials unfair, No CWP provocation, Police 
incompetence, Police unjust, CWP doing good/unselfish work 

Block 2: CWP incompetence 

Block 3: Uneven fight, Blame: KKK, Fear 

Block 4: City avoiding 11/3, Police repression/hostility, Injustice, Blame: city, Police inatten- 
tion/neglect, Informant, Context for conflict, Under siege, Victims were targeted, Media bias 
Block 5: CWP violent, CWP provocation, Shootout, Could have happened anywhere, No 
conspiracy, Police lacked information, Police followed procedures 

Block 6: Space violated, Outsiders, No racial motivation 


barriers in North Carolina and recent associated battles between the CWP and 
klan. During the NCAC hearing, Ed Whitfield, representing the grassroots orga- 
nization “Concerned Citizens Against the Klan,” highlighted such broad contexts, 
arguing that the conflict was rooted in the “acute lack of political and economic 
power among black people here in this community.’(NCAC 1980:9) Likewise, 
NAACP official John Ervin’s testimony criticized policies that left black North 
Carolinians shortchanged in terms of “redevelopment” funds and access to fair 
labor practices, while educator Sara Herbin noted how racial inequities played 
out in North Carolina schools (NCAC 1980). 

This emphasis on context extends to the role of the police in the “massacre” 
narrative, which was seen as “neglectful” and “inattentive” to the risk of violence 
posed by the November 3" march. Local NAACP President George Simpkins not- 
ed, for example, that police had long maintained discriminatory hiring practices 
that exacerbated their poor treatment of blacks, and CWP leader Nelson Johnson 
more pointedly blamed police for purposefully avoiding the march staging area 
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Table 3: Standardized Centrality Scores for Nov 3 Theme Categories 


Themes ————S—~—~—CSCSCSC~98 Public Hearings 2005 Public Hearings 
Blame: City 054 O57” 
Blame: KKK 049 047 
City avoiding 11/3 .000 .048 
Complicity .045 .058* 
Context for conflict 124 .090* 
Could have happened anywhere .000 .009 
CWP doing good/ unselfish work .036 033 
CWP incompetence .000 .004 
CWP provocation ete 045 
CWP violent .000 027 
Fear 031 021 
Informant .000 .068* 
Injustice .000 .033 
Massacre 049 012 
Media bias .099* 25 
Morningside as battlefield .000 .000 
No conspiracy .000 .035 
No CWP provocation .000 .035 
No racial motivation .049 .013 
Outsiders .099* 015 
Police followed procedures 045 .018 
Police inattention/ neglect .090* .045 
Police incompetence 018 .017 
Police lacked information .000 .010 
Police repression/ hostility 013 .020 
Police unjust 013 .031 
Shootout 031 .008 
Space violated .000 .036 
Trials unfair .000 .078* 
Under siege .000 012 
Uneven fight .036 .027 
Victims were targeted .000 021 
Mean Centrality 031 031 


*Denotes themes that have centrality scores at or above one standard deviation from the 
mean for that group of data. 


as the klan caravan arrived (NCAC 1980). This strain in Block 1 also connects 
to ideas related to the “complicity” of both police and the city, represented by the 
link between the first and second block in the figure. 

The common ground between the otherwise opposed “shootout” and “massacre” 
narratives is represented by mutual connections to themes central to Block 4, 
especially the idea that the conflict represented a clash of “outsiders” (rather than 
groups who were truly part of the Greensboro community). Speakers ranging 
from klan leader Virgil Griffin—who decried “outside Communist agitators” who 
fomented the mistaken impression that “blacks and whites hate each other” —to 
Mayor E.S. Melvin and newspaper editor William Snider all forcefully argued that 
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the conflict involved “unwelcome outsiders.”(NCAC 1980) Like blocks 1 and 3, 
Block 4 is a “1-block,” indicating that the “outsider” theme emerges in conjunc- 
tion with the pervasive sense that the strong association between the violent events 
and the Greensboro community was a product of “media bias.” Snider paired his 
criticism of “radically oriented organizations who came here from other communi- 
ties” with a strong claim that “the national media’s fragmented, and sometimes 
biased, coverage contributed to the confusion.”(NCAC 1980:15) Mayor Melvin 
similarly felt that the media’s failure to acknowledge that the November 3" clash 
was not an accurate barometer of race relations in Greensboro generally “may have 
sold newspapers but did little to further the prospects of truth and peace.” (NCAC 
1980:5) These broadly shared emphases are reflected in Table 3, which shows that 
both “outsiders” and “media bias” are among the most highly central themes as- 
sociated with November 3%. 

Figure 3 shows that certain aspects of this structure changed by 2005. In the 
GTRC public hearings, the idea of November 3“ as “shootout” has become consider- 
ably less central as a theme, with Table 3 showing its standardized centrality score (C) _ 
falling from .031 in the 1980 data to .008 in the 2005 hearing data. Perhaps more 
tellingly, Figure 3 shows that this theme—while still retaining coherence in Block 5 
(which, importantly, is a 1-block) alongside associated themes demonizing the CWP 
as “provocative” and “violent,” and rationalizing police ineffectiveness as “lacking 
information” while “following procedures” —is now isolated from all other narratives. 

The “massacre” narrative, in contrast, while becoming less central as an in- 
dividual theme (C = .049 in 1980; .012 in 2005), retains its salience relative to 
broader critiques of police, city officials, and the KKK contained in blocks 1, 3 
and 4. By 2005, the conception of November 3" as a “massacre” was closely linked 
to the idea of “complicity,” whose centrality was enhanced by the disclosure of 
police informants and “unfair” trials held in the mid-1980s. Each of these themes 
is included together in Block 1, and both the “complicity” and “trials unfair’ 
themes are highly central (C = .058 and .078, respectively). Local minister Cardes 
Brown’s testimony clearly illustrates this clustering. After reflecting on what he 
referred to as “the massacre of November 3” by asking how such a tragedy could 
have happened in Greensboro, he concluded that “it had to have been allowed 
to happen” and proceeded to outline a number of ways in which police and the 
judicial system contributed to that dynamic (GTRC 2005). 

This block of themes is also linked to a more global critique of the events surround- 
ing November 3, centered on the city’s culpability for the tragedy, given its sanction 
of “police repression and hostility,” the placement of informants, and its subsequent 

“avoidance” of attendant issues. Former CWP member Sally Bermanzohn (2005) 
clearly expressed these linkages during the GTRC’s second public hearing: 


> 


“My most profound conclusion is that the Greensboro 
Massacre happened because of police behavior. There are 
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lots of unanswered questions about other government 
agencies and officials, on the federal, state and local level. 
And I do not frankly think the police could have acted in 


isolation of other government officials.” 


These themes are grouped together within Block 4, and both this narrative and 
the more familiar “massacre” narratives are, in 2005, also linked to emergent themes 
that represent the Morningside perspective. ‘These narratives are prominently repre- 
sented in blocks 3 and 6, and incorporate ideas that Morningside was—as a number 
of former residents expressed in private interviews with commission members—a 

“battlefield” with residents’ “space violated” as they were “under siege.” 

These emergent themes were cited by the Morningside constituency, and 87 
percent of overall invocations were by non-institutional actors. It is important 
to note the significant disjuncture that marks the perspective of Morningside 
vs. other constituencies. Distinct from the traditionally dominant clash between 
“massacre” and “shootout” narratives centered on adjudicating the roles of the 
CWP, KKK and police, an emphasis on non-institutional actors highlights the 
salience of the idea that the November 3“ violence—regardless of the constituency 
at fault—represented a profound violation. As former Morningside resident Candy 
Clapp, ina private interview that preceded her public testimony, put it: “We were 
under siege. We were like, captive in our own homes... We didnt understand 
what it was all about. Where did it come from? I] mean, why would you come to 
our neighborhood and do this?”(GTRC 2005) The inclusion of the Morningside 
constituency in the GTRC process not only facilitated the expression of this dis- 
tinct perspective, but also served to enrich the longstanding “massacre” narrative 
by providing a vehicle for linking it to themes emphasized by other constituencies. 


The Contours of Temporal Change in Narrative Structure 


To explain observed change in these narratives over time, we assess the degree to 
which particular narrative shifts constitute direct or indirect effects of the emer- 
gence of post-1980 events within the 2005 narratives or were due to the changing 
makeup of voices included in the 1980 and 2005 community forums. To adjudi- 
cate between these sources of change, we focus both on specific constituencies (i.e., 
how do the themes invoked by particular constituencies vary across time periods?) 
and central themes (i.e., how do thematic citation patterns shift over time?). 
Table 4 lists the themes offered by three key constituencies in 2005, in descend- 
ing order of citation frequency. The table also notes (in bold) themes that were 
newly emergent in 2005, as well as those that were invoked in 1980 but absent 
from the 2005 accounts. From this table, we see that themes cited within CWP 
members’ narratives became notably richer over time; 15 of the 21 themes invoked 
by this constituency were emergent in 2005, and no themes invoked in 1980 
disappeared from CWP accounts offered to the GTRC. All emergent themes cited 
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Table 4: Themes in Descending Order of Citation Frequency During 
2005 Public Hearings 


CWP Cit KKK 
Blame: City Police lacked information CWP provocation 
Context for conflict Police followed procedures No conspiracy 
Complicity No conspiracy Informant 
Informant Shootout Could have happened anywhere 
Trials unfair Space violated Context for conflict 
Police inattention CWP violent CWP violent 
Injustice Context for conflict Blame: KKK 
Police unjust CWP provocation Shootout 
Blame: KKK Outsiders Police unjust 
CWP doing good/ unselfish work Media bias Outsiders 
Media bias No racial motivation 
Victims were targeted 
No CWP provocation 


Police repression/hostility 


City avoiding 11/3 
Massacre 
Uneven fight 
Under siege 
Police incompetence 
Space violated 


Fear 
Notes: The table summarizes several characteristics of the narrative themes offered by 
each constituency. Themes are listed in descending order of citation frequency. Bolded 
themes are those that emerged in 2005. Italicized themes were present in 1980 but not in 
2005.Themes listed above each dividing line were cited by multiple constituents in 2005. 


by CWP constituents (i.e., those listed above the dividing line on the table) ap- 
pear to be either directly or indirectly tied to events that came to light after 1980. 
For example, references to “informants” and “unfair trials” would not have held 
any meaning in 1980, and it would have been difficult to claim that the city was 
“avoiding November 3” so soon after the event itself. 

Other themes, in turn, were enabled by the presence of these emergent events. 
The discovery of state informants, for example, aligns with themes of “complicity” 
and “police repression,” along with a sense that “victims were targeted.” Similarly, 
the controversial acquittals in both criminal trials fueled a pervading sense of 

“injustice.” Claims of “no CWP provocation” constituted another type of reaction 

to subsequent events, as it was defensively invoked in response to the demoniza- 
tion of the CWP that had occurred in various public forums tied to November 
3", including the 1980 event. But a crucial point is that none of these emergent 
themes were inconsistent with the original set of narratives offered by CWP mem- 
bers. Indeed, these new themes seem to have resonated with CWP constituents 
precisely because they served either to defend their own role in the conflict or to 
supplement and strengthen, rather than challenge, their initial accounts. 
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Both the city and the KKK narratives were less complex, incorporating fewer 
themes and exhibited a higher level of consistency over time. Both cases are similar 
to the CWP in that temporal changes in narrative structure appear to be governed 
by strategic deployment of emergent events to reinforce, rather than transform, 
accounts advanced in 1980. Thus, the informant scandal and controversial trials 
that occurred later in the 1980s, rather than making city officials more receptive 
to charges of “injustice” or “conspiracy,” led them to argue that police “lacked 
information.” Similarly, emergent themes within KKK accounts at times draw on 
new information (e.g., “informant”) to reinforce the unambiguous perspective 
that the CWP was to blame for the incident. This focus has become more explicit 
over time, with 1980 claims of “police unjust” disappearing, replaced by themes 
that highlight the “violent” nature of the CWP and defensively reject the idea 
that the shootings involved any sort of official conspiracy. With this framing, the 
KKK effectively allied with institutional efforts to defend against a perception of 
conspiracy by 2005, reinforcing the perceived link created when the city’s settle- 
ment of the 1985 civil suit was made on behalf of its police force as well as the 
KKK/Nazi defendants (GTRC 2006). 

Examining particular themes that have become significantly more or less cen- 
tral over time reveals the interplay of the strategic incorporation of emergent- 
events across constituencies and the changing representation of institutional vs. 
non-institutional actors in the GIRC process. For themes related to November 3 
that have significantly increased in overall centrality, we see that such shifts are not 
a product of all constituencies invoking these themes at proportionately increased 
rates, but rather by particular constituencies disproportionately citing the themes 
and others avoiding them altogether. For example, “complicity” was nearly twice 
as likely to be cited in 2005 as in 1980, but this increase was almost entirely due 
to increased invocations by CWP members. This distinction was magnified by the 
significantly larger role played by the CWP constituency in the GTRC’s process, 
compared to the 1980 community meeting. 

Similarly, the “trials unfair” theme—which had no meaning in 1980, but be- 
came highly central in 2005—was unevenly invoked across constituencies; the 
theme was highlighted by all six CWP members testifying during the GIRC’s 
public hearings, but only by a smattering of those in other constituencies (i.e., 
22% of activists, 25% of educators, 25% of lawyers and a single religious leader). 
Thus, the centrality of that particular theme (at C = .078, it was the second 
most central theme in 2005) was due mostly to its resonance among a single 
disproportionately-represented constituency in the GTRC process. 

A related process was at work for themes that significantly decreased in cen- 
trality over time. The idea of “CWP provocation” was, in 1980, largely advanced 
by city officials and KKK members (80% of citations emerged from these two 
constituencies). By 2005, the two KKK members participating in the GFRC 
hearings continued to invoke this theme, but because of the relative absence of 
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city officials from the process (along with the defensive invocation of the compet- 
ing “no CWP provocation” theme by multiple CWP members and their allies), 
it had become significantly more marginal. The dynamics of such shifts reinforce 
our belief that, while subsequent events have the capacity to transform existing 
narratives, those events tend to resonate unevenly across constituencies, as their 
members instrumentally select events to construct reworked frames that reinforce, 
rather than challenge, the central principles of their original narrative.’ Given this 
constraint on the malleability of narratives (see also Schudson 1992), it appears 
that the primary cause of global shifts in narrative structure over time is the chang- 
ing representation of key constituencies. 


Discussion 


In this analysis, we have applied an established network methodology to a novel 
context. Exploiting the fact that narratives are characterized by—and gain distinc- 
tion through—actors invoking and combining multiple themes, we use block 
models to describe the structure of narratives related to the contention occurring 
on Nov. 3, 1979, in Greensboro, North Carolina. This descriptive structure then 
enabled us to undertake an analysis focused on the sources of continuity and 
change in November 3" narratives. 

What do the trends tell us about the durability of collective memory? Perhaps 
most obviously, the passage of time affects the structure of collective memory as 
subsequent occurrences and information reactivate, and potentially transform, 
the core event being remembered (Bearman et al. 1999). In Greensboro, a set of 
events occurred after 1980 that altered one’s ability to make particular claims. Most 
salient to our case were two criminal trials that both resulted in the acquittal of 
the klan/nazi defendants. Paired with the controversial disclosures that both police 
and the FBI had an informant placed among the white supremacists present in 
Greensboro, themes centered on active support or defense of police action became 
considerably less viable. Thus, claims that police had positive relations with the 
overall Greensboro community disappeared from 2005 accounts, and conversely 
themes painting police as “repressive” or “unjust” became increasingly central.° The 
fallout from the trials also enabled the emergence of the “trials unfair” and general 

“injustice” themes, and enhanced the centrality of “complicity” and “blame: city” 
themes. Such shifts occurred unevenly across constituencies, as particular emergent 
themes resonated with individuals who predominantly deployed them to strategi- 
cally bolster, rather than challenge, the structure of their earlier narratives. 

This ability of constituencies to operate strategically was, in turn, shaped by 
the institutional context of the GIRC’s hearings. Consistent with previous work 
emphasizing the contested nature of many, if not all, collective representations 
(Confino 1997; Kansteiner 2002), the GTRC process clearly illustrates how out- 
comes are shaped by the ways in which particular individuals and groups either 
participate in, or are excluded from, the process. In effect, institutions that operate 
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as agents in the advancement of a collective memory serve tosrefract the expres- 
sion of that memory, often by providing a venue for certain voices to be heard. 
In 1980, institutional actors—especially city officials—were heavily represented 

in community dialogues about November 3, CWP members and residents of 
Morningside Homes were almost entirely absent.’ This bounding of recognized ac- 
tors almost certainly skewed the cumulative rendering of the November 3“ event, 
providing a means to create a unified “official” impression that, for example, police 

were “following procedures” and that there was general respect across races in the 

community. Perhaps paradoxically, this social closure process likely contributes to 

the polarization of attitudes, as non-institutional actors add impetus to present a 

critique of the near-hegemonic views articulated by officials. 

In contrast, later narratives that emerged during the more inclusive GTRC 
process were considerably more complex, with a range of overlapping positions 
complicating the stricter “shootout vs. massacre” conceptions that marked the 1980 
meeting. Given the space provided within the GIRC process for individuals to 
strategically deploy post-1980 events to reinforce longstanding narrative accounts, 
we find that in the case of Greensboro, shifts in the macro-level narrative structure 
were not primarily due to transformations within constituencies, but rather to an 
increased or decreased prominence given to particular perspectives stemming frony 
changes in the overall constituency makeup. This finding points to the primacy 
of the politics of inclusion and exclusion in any process that seeks to construct or 
reproduce a collective representation of a discrete event or social process. 

Such contextual factors interacted with institutional features of the hearing pro- 
cesses themselves. While a full examination of these institutional features is beyond 
the scope of this article, it is important to note that commission officials’ actions had 
the capacity to facilitate or constrain speakers’ efforts to highlight specific themes 
tied to the interests of their respective constituencies. To the extent that such efforts 
are stratified, certain constituencies have greater or lesser opportunity to highlight 
themes associated with their own interests and identity commitments (Wagner- 
Pacifici 1994). Such processes were evident in Greensboro, as the directive and 
sometimes combative role played by the GTRC’s commissioners contrasted with 
the highly open structure of the NCAC hearings, which provided no opportunity 
to challenge testimonial accounts. Such dynamics were reinforced by significant 
shifts in the overall climate surrounding race relations in America—in particular the 
increased acceptance of attitudes sympathetic to militant or otherwise controversial 
aspects of the Civil Rights Movement, and the consequent marginalization of the 
overt racism of groups like the KKK. While the narrative shifts summarized in 
figures 2 and 3 directly represent cumulated changes in individual accounts, the 
contours of those narratives are impacted by these broader contextual factors. 

As the container within which a wide range of actors could make claims, the 
GTRC, through its efforts to create a foundation for future dialogue, functioned 
also to create a venue for elevating and adjudicating claims. Wagner-Pacifici 
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(1994) noted that the enduring influence of such claims likely has less to do with 
the dynamics of hearing testimonies than with the ways in which such testimony 
is subsequently crafted into “master narratives” within formal reports. As such, 
further research is needed to examine how the narrative structure presented here 
compares to the formal histories contained in the 1980 NCAC Citizens’ Review 
Committee report and the GTRC’s 2006 Final Report. 

Most generally, the impact of the GTRC on the trajectory of race relations 
in Greensboro remains an open question. But it is clear that the effort is part 
of an expanding, and increasingly self-aware, movement in the United States 
that seeks redress for the painful legacies of racial violence. As this movement 
progresses —constructing memorials for lynching victims, prosecuting civil rights 
“cold cases,” organizing historical inquiries into contentious events, and so on—it 
will become increasingly important to understand how its constituent bodies 
mobilize the required resources, exploit favorable political climates and articulate 
their claims to broader publics. But to the extent that we value uncovering the 
mechanisms that shape outcomes, it will also be crucial to examine how such efforts 
are able to (re)construct and articulate resonant collective memories. 


Notes 


1. The Greensboro commission operated alongside a number of like-minded truth 
and reconciliation efforts, including the annual reenactment of a 1946 lynching at 
Moore’s Ford bridge in Georgia and a detailed state-sponsored report on the 1898 
Wilmington “race riot” (Wilmington Race Riot Commission 2006). While these and 
other similar initiatives are not formally truth and reconciliation commissions, their 
goals are similar (see Ifill 2007; Cunningham 2008). 


2. Transcripts are included in “Black White Perceptions: Race Relations in Greensboro,” 
a report prepared by the NCAC. Unfortunately, the report includes only detailed 
summaries of statements, compiled by a note taker at the meeting. While presumably 
the statements were fully transcribed—the NCAC publication includes references to 
page numbers from those transcripts—repeated searches and inquiries to Greensboro 
officials failed to produce a copy. While this data limitation opens up the possibility 
that the note taker filtered testimony, it seems unlikely that he/she would have done 
so in a way that would have systematically altered invocations of the broad thematic 
categories in our coding scheme below. 


3. See www.greensboroTRC.org. The third and final hearing dealt with the question, 
“What does the past have to do with the present and the future?” Though we are 
interested in this general question in a theoretical sense, there is no analogue to this 
theme in the narratives contained in the 1980 data, so we chose not to incorporate 
it into this analysis. 


4. Formally, clusters of themes in the high-density diagonal blocks are considered 
“structurally equivalent,” meaning that they exhibit relatively equivalent patterns of 
similarity and dissimilarity to the overall population of themes (i.e., occupy similar 
positions in the overall structure of themes) (Hanneman and Riddle 2005). In its 
strictest sense, equivalence represents a situation in which all themes within a block 
reach convergence (i.e., have identical ties to all other themes). In practice, this 
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standard rarely yields robust solutions, so we employ a lower convergence criterion, 
or tolerance, of .2 (Borgatti, Everett and Freeman 1999). 


5. Emergent events also served to shape constituency representation in the latter period. 
For example, the fact that four lawyers testified in 2005 vs. none in 1980 is because 
of the occurrence of trials against the November 3“ klan/nazi perpetrators after the 
1980 hearings. 


6. A set of contemporary race-based scandals involving the Greensboro Police 
Department came to light during the GIRC process (Williams 2007), which likely 
served as another “subsequent event” reinforcing participants’ perspectives on themes 
related to November 3%. 


7. One former Morningside resident, Ervin Brisbon, did take part in the 1980 
meeting examined here, due to his role as a member of the officially-sanctioned 
“Citizens Review Committee.” Predictably, his was among the strongest critiques of 
institutional inequities in Greensboro; in the meeting, he summarized his take on 
race relations in the community by referring to them as “real bad.”(NCAC 1980:11) 
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Sex, Anger and Depression 


Robin W. Simon, Wake Forest University 
Kathryn Lively, Dartmouth College 


A social problem that has preoccupied sociologists of gender and mental health is 
the higher rate of depression found among women. Although a number of hypoth- 
eses about this health disparity between men and women have been advanced, none 
consider the importance of subjectively experienced anger. Drawing on theoretical 
and empirical insights from the sociology of emotion, we hypothesize that: (1. 
intense and persistent anger are associated with more symptoms of depression, 
and (2. sex differences in the intensity and persistence of anger are involved in the 
sex difference in depressed affect. Analyses of data from the 1996 GSS Emotions 
Module provide support for these two hypotheses and strongly suggest that women’s 
intense and persistent anger play a pivotal role in their high rate of depression. We 
discuss the extent to which sex differences in these emotions are a function of social 
factors, biological factors, or a complex interaction between them. We also com- 
ment on the implications of our findings for future theory and research on gender, 
emotion and mental health. 





Introduction 


A major problem that has long puzzled gender and mental health scholars is 
the relatively high rate of depression found among women. Sociologists have 
developed several hypotheses to explain this health disparity between genders, 
focusing on women’s greater exposure and vulnerability to stress as well as 
gendered-responses to stressors. However, while these hypotheses shed light on 
social structural, social psychological and socio-cultural factors that contribute 
to the female excess of depression, we offer an alternative set of hypotheses. 
Drawing on theoretical and empirical insights from the sociology of emotion, 
we hypothesize that: (1. intense and persistent feelings of anger are associated 
with more symptoms of depression, and (2. sex differences in the intensity and 
persistence of anger are involved in the sex difference in depressed affect. Sex 
differences in the subjective experience of these emotions may be a function of 
social factors, biological factors or an interaction between them. Although some 
mental health researchers have speculated about the significance of anger for the 
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prevalence of depression among women, the links between sex, anger and this 
mental health problem have not been investigated in the general population of 
adults. In this research, we evaluate our hypotheses with data from a nationally 
representative sample. We expect that women’s intense and persistent anger play 
an important role in their high rate of depression. 


Background 
The Sex Difference in Depression 


One of the most consistent and oft cited findings of research on mental health 
is that women have a higher rate of depressive disorders and report more symp- 
toms of depression than men. Epidemiological studies based on non-clinical 
populations of adults find that women are twice as likely as men to experience 
this mental health problem (Kessler et al. 2003). The female excess of depressed 
affect is the leading cause of disease-related disability among women and is as- 
sociated with a host of personal, social and economic problems for themselves 
as well as their families. 

Not surprisingly, sociologists have developed a number of hypotheses about 
this health disparity. Several decades ago, Gove (1972) argued that the higher rate 
of emotional disturbance among women is due to their roles in society, which are 
more stressful than men’s. Building on Gove’s sex-role theory of mental illness, 
research has evaluated the exposure hypothesis, which posits that gender inequality 
in the family and workplace differentially exposes men and women to role-related 
stress. However, while this research has enhanced our understanding of the re- 
lationship between role involvements and emotional well-being, it indicates that 
differences between men’s and women’s roles contribute to, but do not explain, 
women's mental health disadvantage (Thoits 1986). 

The inability of the exposure hypothesis to account for the gender gap in de- 
pression led to the development of the vulnerability hypothesis, which claims that 
the stressors to which individuals are exposed have a greater impact on women’s 
mental health. This hypothesis focuses on social psychological rather than struc- 
tural factors, such as women’s insufficient coping and social support resources, 
which render them more vulnerable than men to the adverse emotional effects 
of stress (see Aneshensel 1992 and Thoits 1995 for reviews). However, while this 
research has expanded our knowledge about social factors that both mediate and 
moderate the relationship between sex, stress and emotional distress, it indicates 
that women are not more vulnerable than men in general. Rather, some stressors 
are more distressing to women whereas others are more distressing to men (Pearlin 
and Lieberman 1979; Simon 1998). 

Scholars have recently turned their attention to the gendered-response hypothesis, 
which argues that women are neither more exposed nor more vulnerable to stress 
than are men, but that males and females express emotional upset with different 


types of mental health problems. A growing body of work finds that women tend 
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to respond to stress with internalizing problems such as depression, while men tend 
to respond with externalizing behaviors such as antisocial conduct and substance 
abuse (Aneshensel et al. 1991; Lennon 1987; Simon 2002; Umberson et al. 1986). 

Rosenfield (2000, 2005) attributes gendered-expressions of distress to gender- 
differentiated structures that develop in adolescence. She finds that females tend 
to develop an “other-focused” self that predisposes them to express distress with 
depression, whereas males tend to develop an “ego-focused” self that predisposes 
them to express distress with substance abuse problems. Simon (2002, 2004) at- 
tributes gendered-expressions of distress to Americans’ emotion culture, which in- 
cludes gender-linked norms about the experience and expression of emotion. She 
argues that depression is a culturally acceptable emotion for females, but not for 
males. Men’s higher rate of substance problems reflects their tendency to manage 
(i.e., suppress) inappropriate feelings of depression with mood-altering substances. 

However, while the gendered-response hypothesis begins to unravel the com- 
plex set of factors that contributes to sex differences in both the experience and 
expression of emotional upset, other factors may be involved in the female excess 
of depression. It is possible that intense and persistent subjectively experienced 
anger—more common among women than men-—also plays a role in depressed 
affect. Although some mental health scholars suggest that sex differences in anger 
underlie the sex differences in depression, most sociologists who study anger 
have focused on this emotion as an outcome of social disadvantage rather than 
as a mediator of the relationship between disadvantaged statuses and depression 
(Mabry and Kiecolt 2005; Ross and Van Willigen 1996; Schieman 1999, 2000). 


Sex, Anger and Depression 


The emotion culture of the United States includes gendered norms about anger; 
while anger is an acceptable emotion for males, it is an inappropriate emotion for 
females. Emotions scholars tell us that gendered norms about anger in the United 
States emerged in the early 19" century—a period when males became identified 
with the public sphere of paid work and females with the private sphere of the 
family (Cancian and Gordon 1988; Shields 2002; Stearns 1992). Stearns and 
Stearns (1996) document that although anger control became a cultural ideal 
for men and women, it was especially important for women who were charged 
with creating a retreat from the competitive workplace. Gendered norms about 
anger—and their corresponding expectations that females should avoid and sup- 
press these feelings— persist despite the influx of women into the labor force during 
the second part of the 20" century. 

A consequence of these emotion norms is the widespread belief that anger is ex- 
perienced and expressed more frequently by males than by females. Gender, mental 
health and emotion researchers—particularly those who view emotions as socially 
constructed—assume that females are socialized to suppress rather than express anger 


(Campbell 1991; Hochschild 1979). Some scholars have suggested that the greater 
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prevalence of depression among women is due to their tendéncy to turn anger 
‘award rather than outward as do men (Frankel 1991; Tavris 1982; Weissman and 
Paykel 1974). This argument is based on a “catharsis” model of emotion rooted 
in psychoanalytic theory and assumes that the uninhibited expression of anger is 
crucial for the development and maintenance of emotional well-being; when angry 
feelings are suppressed, they turn inward and people feel depressed. 

In contrast to prevailing cultural beliefs and scholarly assumptions about 
women’s anger, recent sociological research based on national data reveals that 
men and women do not differ in the frequency with which they experience anger 
(Simon and Nath 2004). Numerous psychological studies report similar findings 
(Kring 2000; Thomas 1993). Moreover, while there are some differences in the 
ways in which men and women manage anger (e.g., women tend to talk with 
others while men tend to use substances), there is no evidence that women are 
more likely than men to suppress these feelings (Lively and Powell 2006; Simon 
and Nath 2004). Indeed, the tendency for men to use mood-altering substances to 
manage anger suggests that they are as likely as women to suppress these negative 
emotions—albeit in different and gendered ways. Research does, however, find sex 
differences in other dimensions of anger. Ironically, women report more intense 
and persistent anger than do men (Simon and Nath 2004) ~a finding that is also 
echoed in several psychological studies (Averill 1982; Brody et al. 1995). 

Drawing on theoretical and empirical insights from the sociology of emo- 
tion, we argue that women’s more intense and persistent anger—rather than their 
presumed tendency to suppress these feelings—is involved in their high rate of 
depression. That is, we posit that intense and persistent subjectively experienced 
anger mediates the well-documented relationship between sex and depression. At 
the same time emotion norms discourage women from experiencing and express- 
ing anger, sociological research finds that their unequal status in the family and 
workplace disproportionately exposes them to anger-eliciting social interactions. 

Sociologists of emotion have theorized about the link between anger and de- 
pression and provide clues about why intense and persistent angry feelings are 
associated with depressed affect and why women report more of these emotions 
than men. These theorists focus on the structural—rather than cultural—basis of 
anger and depression. Kemper’s social interactional theory (1978) argues that 
people experience intense negative emotions such as anger and rage in interac- 
tions that they perceive as unjust. If such interactions persist, these emotions may 
become more enduring emotional states (or moods) such as depression. Because 
disadvantaged persons are more likely than their advantaged peers to be subjected 
to unjust interactions—and are less likely to have the power to alter them—they 
are more likely to experience intense and persistent anger as well as more enduring 
feelings of depression (also see Barbalet 2001). 

Collins’ theory of interaction rituals (2004) also argues that people’s structural 
location and subsequent social interactions affect their transient emotions (e.g., 
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anger and joy) and moods (e.g., depression and happiness) —the latter of which 
he refers to as emotional energy. Intense transient emotions that arise in social 
interactions and which persist influence an individual's overall emotional energy. 
In the case of anger, people who experience intense and enduring emotions that 
emerge in unjust interactions also experience a degradation of emotional energy 
(i.e., an increase in feelings of depression). 

Like Kemper, Collins posits that intense transient feelings of anger and endur- 
ing feelings of depression are more common among disadvantaged than advan- 
taged persons. However, unlike Kemper, Collins theorizes that the relationship 
between transient emotions and emotional energy is bi-directional rather than 
unidirectional. Just as individuals whose anger is intense and persistent may expe- 
rience decreased emotional energy, persons with low emotional energy may select 
into unjust social interactions that elicit intense and enduring anger—a supposi- 
tion that is consistent with affect control theory (Heise 1979), theories about self- 
verification (Swann and Brown 1990), and symbolic interaction more generally. 

Although they emphasize somewhat different emotional processes, Kemper 
and Collins provide compelling reasons to expect an association between intense 
and persistent anger and depression—which include feelings of sadness, loneli- 
ness, fearfulness, restlessness and worry. These emotions are key components of 
symptom scales that assess the prevalence of depression in the general population 
such as the CES-D (Radloff 1977) and SCL-90 (Derogotis and Cleary 1977). 

Support for these ideas can be gleaned from research that documents the emo- 
tional consequences of gender inequality in the family and workplace for women. 
Hochschild’s seminal qualitative studies of female airline attendants (1983) and 
wives in dual-income families (1989) show that intense and persistent anger from 
repeated unfair social interactions were transformed into depression. Subsequent 
quantitative studies document that involvement in an inequitable division of 
household labor (Lennon and Rosenfield 1994; Ross et al. 1983) and service 
occupations (Erickson and Wharton 1997) are associated with more depressive 
symptoms. Women report more depression than men because they are more likely 
to experience unjust social interactions in the family and workplace. 

The later studies did not investigate whether women in unfair work and fam- 
ily roles report more intense and persistent anger than men and women in more 
equitable situations. However, Sprecher (1986), Ross and Van Willigen (1996), 
and Lively et al. (2008) found that perceptions of inequity in social relationships 
are associated with more anger for both genders. Here again, women report 
more anger than men because they perceive greater inequity in their social roles 
and relationships. 

While these studies provide insight into the structural basis of anger and de- 
pression in the workplace and family, women’s unequal status in society may 
expose them to unfair anger-eliciting social interactions in other life domains as 


well (Griffiths 1995; Ridgeway and Smith-Lovin 1999). In other words, women’s 
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high rate of depression may reflect unresolved, intense anger that is rooted in their 
unequal experiences across a variety of social contexts. 

Additional support for our ideas comes from studies of clinically depressed 
persons who report intense and persistent anger (Picardi et al. 2004; Newman 
et al. 2006). For example, Taylor’s (1996) study of women suffering from post- 
partum depression revealed intense and persistent anger towards their unhelpful 
partners, their helpless children, the male-dominated medical profession and 
cultural norms that expect new mothers to be “happy.” Hagan and Foster’s (2003) 
recent longitudinal study of gendered pathways to disadvantage based on the 
general population also shows that intense and persistent anger in adolescence 
is associated with the development of depression among females and substance 
abuse among males in early adulthood. 

In addition to research on the contribution of social factors to sex differences 
in anger and depression, research on the biology of affect documents biological 
contributions. Psychophysiological studies find that women have a stronger and 
longer emotional response to affective imagery than do men (Bradley et al. 2001; 
Schwartz et al. 1980). Neuroendocrinological research finds that women exhibit 
a more robust and longer lasting increase in certain bio-chemicals following ex- 
posure to stress than men (McCarthy and Konkle 2005). Medical and psychiatric” 
studies have also uncovered a sex difference in the synthesis and utilization of neu- 
rotransmitters such as serotonin —bio-chemicals that regulate anger and depression 
(Heninger 1997). This research suggests that sex differences in the neurobiological 
and endocrinological systems predispose women to experience more intense and 
persistent feelings of anger and more depression than men. 

It is also possible that social and biological factors interact in complex ways 
to produce sex differences in the experience of anger and depression. That is, 
socio-cultural and social-structural influences on men’s and women’s feelings may 
reinforce biologically based sex differences in emotion. Support for this idea comes 
from recent sociological research on the joint influence of social and genetic fac- 
tors on happiness (Schnittker 2008) and alcohol dependence (Pescosolido et al. 
2008). We do not have the data to explore the joint influence of social and bio- 
logical factors on sex differences in anger and depression. Nevertheless, theoretical 
and empirical insights into the cultural and structural basis of emotion—as well 
as developments in the biology of emotion —strongly suggest that an examination 
of the relationship between sex, anger and depression may provide insight into a 
problem that has long puzzled sociologists of gender and mental health. 

Going beyond existing explanations, we evaluate whether: (1. intense and persis- 
tent angry feelings are associated with more symptoms of depression, and if (2. sex 
differences in the intensity and duration of anger are involved in the sex difference 
in depressed affect. In addition to an examination of these two hypotheses, we also 
explore the extent to which women’s higher level of depression is due to their ten- 
dency to suppress rather than express anger — as some mental health scholars suggest. 
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Data and Methods 
Data 


Our analyses employ data from the 1996 Emotions Module of the General Social 
Survey. The GSS, which has been conducted regularly since 1972, is based on a 
nationally representative sample of adults living in households in the United States. 
About half (N = 1,460) of the 2,904 respondents who were interviewed that year 
were asked a variety of questions about their emotions. Our analyses are based on 


the subsample of respondents who identified an event that made them angry in 
the past month (N = 1,125). 


Measures 


Symptoms of Depression 

These are measured with questions that asked respondents how many days in the 
previous week they felt: 1-sad, 2-blue, 3-lonely, 4-fearful, 5-restless, 6-worried, 
7-happy, 8-overjoyed and 9-excited. Items 7 through 9 were reverse coded. We 
divided the sum of responses by the number of items included in the measure; 
scores range from 0 to 6.33 days (alpha = .71). Note that although this measure is 
not identical to the CES-D and SCL-90, it includes the same emotions. 


Intensity and Duration of Anger 

These are measured with single-item variables based on the following questions: 
“How intense was your anger?” (0 = not at all intense to 10 = very intense) and 
“How long did your anger last?” (1 = a few seconds to 6 = continuously). 


Anger Management Techniques 

Respondents were asked if they did one or more of the following things to change 
their angry feelings in the past month (yes = 1): 1-talked to the person I was angry 
at, 2-talked to someone else about how I felt, 3-yelled or hit something to let out 
my pent-up feelings, 4-left the situation, 5-tried to change the situation by doing 
something, 6-planned how to end the relationship with the person who made me 
angry, 7-thought about how to get revenge, 8-fantasized about a magical solution 
to the problem, 9-went out to get some exercise to make me feel better, 10-had a 
drink or took a pill, 1 1-prayed for help from God, 12-tried to forget it by doing or 
thinking about something else, 13-tried to think about the situation in a different 
way, 14-tried to accept the situation as it was, 15-waited for the feelings to pass, 
and 16-did something else. 


Socio-demographic Variables 

All analyses include respondents’ age (in years), education (in years), household 
income (in dollars), and race (white = 1; black = 1; other = 1). To reduce missing 
cases, we assigned imputed mean scores for household income to respondents 
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with missing data (N = 124). Dichotomous variables for employment, marital and 
parental status (employed = 1, married = 1, parent = 1) are included in all analyses 
as well. While feminist scholarship reminds us that women experience unjust 
social interactions in other social roles and life domains (Griffiths 1995), these 
experiences are unfortunately not easily captured in survey data. Because research 
indicates that poor health is associated with more depression and women report 
more health problems than do men (Williams and Umberson 2004), all analyses 
also include self-assessed health (1 = poor to 4 = excellent). Finally, we measure sex 
as a dichotomous variable (female = 1). Although this variable does not capture 
the power structures that create gender or the enactment of gendered identities in 
social interaction (Risman 1998), it allows us to assess whether sex differences in 
anger are involved in the sex difference in depressed affect. 

It is important to mention that although we are advancing an alternative hy- 
pothesis about the female excess of depression, it complements existing hypotheses 
that attribute women’s higher rate of depression to their greater exposure and 
vulnerability to stress as well as gendered-responses to stressors. However, these 
studies did not investigate whether women’s more intense and persistent anger 

’ mediates the relationship between sex and depression. Certain types of stressors 
(e.g., events and situations that repeatedly expose individuals to unjust social j 
interactions) may be depressing because they involve unresolved, intense anger. 

Also, by identifying dimensions of anger that mediate the relationship be- 
tween sex and depression, our research extends prior research that has focused on 
anger as an outcome of social disadvantage. Although they are both considered 
to be distressing emotions, anger and depression are distinct primary emotions 
that have different etiologies and evolutionary functions (Eckman 1999; Ortony 
et al. 1988). Moreover, feelings of anger tend to have a specific target whereas 
feelings of depression tend to be directed towards the self. In fact, anger is not 
included in depression screening scales such as the CES-D and SCL-90 because 
it is not considered to be a symptom of depression. In our sample, the intensity 
and duration of anger are only modestly correlated with depressive symptoms 
(.22 and .25, respectively). 

Finally, although our cross-sectional data do not allow us to assess the causal 
direction of the relationship between intense and persistent anger and symptoms 
of depression, they provide much needed insight into the associations among sex, 
anger and depressed affect in the general population at a single point in time. 
Keep in mind that at the same time intense and persistent anger may transform 
into depression (Kemper 1978), feelings of depression may lead to intense and 
persistent anger (Collins 2004). It is, of course, possible that intense and persistent 
anger and depression are concurrent emotions as research on clinically depressed 
persons suggests (Newman et al. 2006; Picardi et al. 2004; Taylor 1996). 

To examine the relationship between sex and both the intensity and duration 
of anger as well as symptoms of depression, our first model for each dependent 
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Table 1: Socio-demographic Characteristics of the Analysis Sample by Sex 





Total Male Female 
Socio-demographic Characteristics 
_ Age, mean years . 42.8 — 424 A 
: (58) (19.0) 08) 
Race % 
White — 82 848 B01 
Black 13.0 9.2 15.8 
Other : oo. 60. 60 2 40 
Education (mean years) Toe 13.9 13:5 
eee 
Household income ($1,000s) -$20790 = $32,749 = G27500. 
Employed (%) Tals 12.00 64.8 
Married (%) 47.1 $15 22 a. 
Children less than 18 in the home (%) 38.8 CVA 43.3 
Self-assessed health (mean score) — S08 10 
oo “1 (7) = 8 


1,050 447 603 


eee ee ee ee ee ee 


Note: Numbers in parentheses are standard deviations. 


variable only includes sex. To investigate whether sex differences in these emo- 
tions are a function of differences between men’s and women’s socio-demographic 
characteristics, Model 2 includes their age, race, education, household income 
as well as their employment, marital and parental status, and self-assessed health. 


Analytic Strategy and Sample 


Our analyses are based on respondents who had complete information on all 
variables in the models (N = 1,050). Means and standard deviations for all of the 
emotion variables appear in appendices A and B. Auxiliary analyses (available) 
indicate that two-thirds (66%) of the respondents reported that they felt angry at 
least one day in the prior week. Consistent with prior research, these analyses also 
indicate that there is no sex difference in the frequency of angry feelings. 

The socio-demographic characteristics of the analysis sample by sex appear in 
Table 1. Bivariate analyses (available) indicate that women are older and have less ed- 
ucation and lower household incomes than men. Similar to other national samples, 
women are less likely than men to be married and employed but are more likely to 
be residing with minor children; they also assess their health as poorer than do men. 


Results 

Sex Differences in the Intensity and Duration of Anger 

We first assess whether the men and women in our sample differ with respect to 
the intensity and duration of anger. Table 2 contains the results of analyses in 


which these dimensions of subjectively experienced anger are regressed — first on 
respondents’ sex and then on their socio-demographic characteristics. 
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Table 2: Effects of Sex on the Intensity and Duration of Anger ~ 


Intensity of Anger Duration of Anger ___ 
Model Model 3_ Model 4 





Black (0, 1)? 


Household income 


Self-assessed health 





Adjusted R 02. 4 ae | 


Notes: Numbers shown are unstandardized OLS regression coefficients. Numbers in 
parentheses are t-ratios. N = 1,050. 

aWhites are the reference category. 

*9<.05 “p<.01 **p<.001 (two-tailed tests) 


Consistent with recent sociological research based on the same data as well as 
psychological and biological studies, women report that the anger they experi- 
enced in the past month was more intense than men’s (Model 1). Moreover, this 
sex difference persists when respondents’ socio-demographic characteristics are 
held constant (Model 2). In addition to gender, age is associated with the intensity 
of anger; younger adults report more intense anger than their older peers. 

Also consistent with previous research based on the same data as well as psycho- 
logical and biological studies, women report that the anger they experienced in the 
past month lasted longer than did men’s (Model 3), which persists when their socio- 
demographic characteristics are controlled (Model 4). Here again, age is the only other 
variable associated with the duration of anger. In fact, Table 2 results suggest that other 
factors may be responsible for the greater intensity and persistence of women’s anger.! 

Supplementary analyses (available) based on alternative specifications of our 
socio-demographic variables indicate that divorced persons report more intense (but 
not more persistent) anger than their married counterparts, but personal income, 
the number of hours employed in the prior week, the presence of children younger 
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than 6 in the home and the number of preschool age children in the home are not 
associated with the intensity and duration of anger. Marital unhappiness increases 
the intensity (but not the duration) of anger among married respondents. These 
analyses further indicate that the inclusion of these variables in the models does 
not reduce the coefficients for sex to non-significance (see the first two columns 
of Appendix C).? Auxiliary interactional analyses (available) indicate that there is 
no sex difference in the association between socio-demographic characteristics and 
the intensity and duration of anger as well as no socio-demographic variation in 
anger among women and men. 


The Sex Difference in Depression 


Do individuals who report more intense and persistent anger report more symp- 
toms of depression than persons whose anger is less intense and persistent? Are sex 
differences in the intensity and duration of anger involved in the sex difference in 
depressed affect? The answers to these questions are in Table 3, which contains the 
results of analyses in which symptoms of depression are regressed on respondents’ 
sex (Model 1) and socio-demographic characteristics (Model 2). Model 3 includes 
these variables as well as both the intensity and duration of respondents’ anger. 

Consistent with the extensive body of research on depression, women report 
more symptoms than men (Model 1), which is not explained by their socio- 
demographic characteristics (Model 2). Younger, less educated, non-employed 
and unmarried adults also report more symptoms than their older, more educated, 
employed and married counterparts, which is consistent with prior research as 
well. Persons residing with minor children report neither more nor less symptoms 
than those not living with minor offspring, and those who enjoy better health 
report fewer symptoms than their less healthy peers. 

Supplementary analyses (available) reveal that divorced persons and individu- 
als living with children less than 6 years old report more depression than the 
married and those not living with preschool age children. However, personal 
income, the number of hours worked in the past week, and the number of 
preschool age children in the home are not associated with symptoms. These 
analyses further reveal that the inclusion of these variables in the models does 
not reduce the coefficients for sex to non-significance (see the last column of 
Appendix C). Additionally, marital unhappiness —which is associated with more 
symptoms - reduces the sex difference in depression among the married to non- 
significance. This finding suggests that marital unhappiness helps explain the 
sex difference in depression among the married. Auxiliary interactional analyses 
(available) indicate that there is no sex difference in the association between any 
of the socio-demographic variables and depression with two exceptions: The 
association between symptoms and being divorced and widowed is greater for 
women than for men; divorced and widowed women also report more symptoms 
than married women. These findings echo recent research on the relationship 
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Table 3: Effects of Sex, the Intensity and Duration of Anger, and‘Anger Management 


Techniques on Depression 
Symptoms of Depression 


Model 1 Model 2 


Female (0,1) 2 a 
: : : : (3.52). (2.83) 
Black (0, 1)? -.09 
ee (81) 
Other (0,1)? . -.24 
: : oe (1.49) 
Age = \idiee a 
(4.33) 
Education — - / O05 
(3.93) 
Household income -.00 
(.27) 
Employed (0, 1) -.01 
|. : (13) 
Married (0, 1) -.24** 
(3.12) 
Children less than 18 in the home (0, 1) 08 
: -  . (1.02) 
*Self-assessed health =O0 
(7.69) 


Intensity of anger — 

Duration ranger 

Talked to the person (0, 1) 

Talked to someone else (0, 1) 

Yelled or hit something (0, 1) 

Thea to think about the situation (0, 1) 
Had a drink or took a pill (0, 1) 

Tried to forget it (0, 1) 

Tried to change the situation (0, 1) 
Prayed for help from god (0, 1) 
Fantasized about a magical solution (0, 1) 
Went out to get some exercise (0, 1) 
Waited for feelings to pass (0, 1) 
Tried to accept the situation (0, 1) 


Left the situation (0, 1) 





Model 3___Model 4 
AS 2 20 
(1.85) (2.74) 
-.14 -.08 
(31) Cs 
21 2054 
(1.74) (5) 3 
aOss5 = dzen 
(3.76) (3.73) am 
06°" =06°" : 
(4.23) (4.06) 
-.00 .00 
(54) 2s = (20) 
-.01 02 
(.17) (.24) 
~21 -.22** 
(2.81) (2.87) 
08 08 
(1.02) (1.02). 
Soe =O 5 tae 
7.14) (751) 3 
oo eS 
(3.16) 
“1538 = 
(5.96) 
“153 
(2.04) 
03 
(.44) 
Af 
(1.27) 
-.11 
(1.39) 
a 
(2.04) _ 
02 
(.19) 
01 
(09) 2 
07 
(.81) 
45° 
(3.22) 
30" 
(2.95) 
-.01 
(07) 
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Table 3 continued 





Symptoms of Depression 
Model 1 Model 2. Model3 —_ Model 4 





“hought about how to get revenge (0, 1) 15 
E 97 
nned how to end the relationship (0, 1) oe ye ies : 
Did something else (0, 1) : a 
aa a (54) 
ercept 214. 406 3 489. 
Adjusted R? 01 AZ 18 Ab 


Notes: Numbers shown are unstandardized OLS regression coefficients. Numbers in 
parentheses are t-ratios. N = 1,050. 

aWhites are the reference category. 

*9<.05 **p<.01 ***p<.001 (two-tailed tests) 


between gender, marital status and mental health (Simon 2002; Umberson et al. 
1996), which finds that marital status is more closely associated with depression 
among women (and substance abuse among men). 

The most intriguing results of Table 3 are in Model 3; consistent with our first 
hypothesis based on theoretical and empirical insights about the structural basis of 
emotion, individuals whose anger was more intense and persistent during the previ- 
ous month report significantly more depression in the prior week than those whose 
anger was less intense and persistent. Moreover, the inclusion of both dimensions of 
anger in the model reduces the sex coefficient for depression to non-significance. This 
finding provides support for our second hypothesis; sex differences in the intensity 
and duration of subjectively experienced angry feelings are involved in the sex differ- 
ence in depressed affect. Although our data do not allow us to assess this possibility, 
these findings—which are consistent with prior research based on clinical and non- 
clinical samples (Hagan and Foster 2003; Hochschild 1983, 1989; Newman 2006; 
Picardi et al. 2004; Taylor 1996)—strongly suggest that intense and persistent anger 
has etiological significance for the development of depression in the general popula- 
tion of adults and sex differences therein. Additional interactional analyses (available) 
indicate that the associations between intense and persistent anger and depression 
do not differ for women and men. These results suggest that there would not be a 
sex difference in depressed affect if women’s anger was less persistent and intense.° 

As a final step in our analysis, we explore whether the ways in which respon- 
dents managed their angry feelings are associated with depression and the extent 
to which sex differences in the use of anger management techniques are involved 
in the sex difference in depressed affect. To this end, Model 4 of Table 3 includes 
the anger management variables. 

We want to point out that logistic regression analyses (shown in Appendix D) 
indicate that there is a significant sex difference in only 3 of the 16 anger manage- 
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ment items. Women are more likely than men to manage their angry feelings by 
talking with someone else about how they felt and praying to God for help. Men, 
on the other hand, are more likely than women to manage anger by having a drink 
or taking a pill. These findings are similar to Simon and Nath’s (2004) study based 
on the same data as well as psychological research, which finds that females tend to 
express anger verbally while males tend to express these feelings behaviorally (Brody 
and Hall 1993; Kring 2000). The tendency for women to manage their anger with 
prayer also echoes findings from studies of gender differences in coping and social 
support (Thoits 1991). That men are more likely than women to manage their anger 
with mood-altering substances is also consistent with research on the gendered- 
response hypothesis, which finds that males tend to express emotional upset with 
externalizing problems such as substance abuse. Overall, while we cannot say with 
certainty whether men and women use these emotion management techniques to 
suppress angry feelings, there is no evidence that women are less likely than men to 
express these emotions. In fact, the evidence points to the contrary. 

Model 4 indicates that only 3 of the 16 emotion management strategies are 
significantly associated with symptoms of depression. However, these strate- 
gies—which include having a drink or taking a pill, fantasizing about a magical 
solution to the problem, and going out to get some exercise—are associated with 
more rather than /ess symptoms. While these findings are interesting, the cross- 
sectional nature of our data prevents us from sorting out whether these anger 
management techniques are ineffective for reducing depression or if depressed per- 
sons are more likely than their non-depressed peers to use these techniques. Most 
important for our hypotheses, Model 4 reveals that the inclusion of the emotion 
management items in the model does not reduce the sex coefficient for depressive 
symptoms to non-significance. It thus appears that differences in some ways men 
and women manage anger are vor involved in the sex difference in depressed affect. 


Conclusions and Discussion 


Although sociologists have developed a number of hypotheses about the well- 
documented female excess of depression, none consider the importance of sub- 
jectively experienced anger for understanding the prevalence of this mental health 
problem in the general population of adults and sex differences therein. The lack 
of research is surprising because mental health scholars have suggested that the 
female preponderance of depression is due to their tendency to suppress rather 
than express anger. The idea that suppressed anger underlies women’s high rate 
of depression reflects a social constructionist view of emotion, which posits that 
emotion norms encourage females to turn their anger inward rather than outward 
as do men. However, while provocative, recent sociological research indicates that 
women are not less likely than men to experience and express anger. Ironically, 
this research —as well as research on the psychology and biology of emotions —also 
reveals that women report more intense and persistent anger than do men. 
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Drawing on theoretical and empirical insight from the sociology of emotion 
about the structural rather than cultural basis of anger and depression, we hypoth- 
esized that intense and persistent angry feelings are associated with more depressive 
symptoms. We also hypothesized that sex differences in the intensity and duration 
of anger are involved in the sex difference in depressed affect. Using data from a na- 
tionally representative sample, we obtained empirical support for our hypotheses. 

Similar to other studies, we found that women report more intense and persis- 
tent anger than do men. However, we were unable to account for sex differences 
in these dimensions of anger despite the inclusion of socio-demographic charac- 
teristics and self-assessed health—for which significant sex differences exist. We 
also found minimal socio-demographic variations in these dimensions of anger 
among women and among men. There are several possible reasons for our inability 
to explain women’s more intense and persistent anger. 

The first possibility is that unmeasured variations in men’s and women’s social 
experiences are responsible for sex differences in these two dimensions of anger. 
An important next step for research is to examine whether women’s intense and 
persistent angry feelings stem from their greater exposure to repeated unjust social 
interactions across a variety of life domains or are due to their greater vulnerability 
to such interactions. A sense of powerlessness among women to alter the injustice 
they regularly experience—whether from the inequitable division of household 
labor, the types of jobs they tend to hold, and/or their unfair wages—may also 
contribute to their more intense and persistent feelings of anger relative to men. 
We believe—as do other feminist-oriented emotions scholars—that women’s in- 
tense and persistent anger reflects the cumulative effect of everyday injustice they 
experience in their social roles and across a variety of life domains (Griffiths 1995). 
Although we could not assess this idea, the use of measures of lifetime stress 
exposure would be a start for investigating this possibility (Turner et al. 1999). 

A second possibility is that women are biologically predisposed to experi- 
ence anger and depression more strongly and for longer duration than are men. 
Although we focused on social factors, we noted that biological research finds sex 
differences in anger and depression that are biologically based. Whether these sex 
differences reflect psychophysiological, neuroendocinological or other biochemi- 
cal factors is still unclear. What is clear is that sociologists of gender, emotion and 
mental health cannot ignore these findings if they want to understand the sex 
difference in depressed affect. 

A third possibility is that socio-cultural and structural factors interact with 
biological propensities in complex and not particularly well-understood ways to 
produce sex differences in anger and depression. Biological research indicates that 
social factors influence biological processes (McCarthy and Konkle 2005). At the 
same time, recent sociological research shows that genetic factors affect emotions 
and health-related behaviors (Pescosolido et al. 2008; Schnittker 2008). This and 


other research also reveals that social influences on emotion processes are even 
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greater once genetic factors are held constant. These findings suggest that the 
magnitude of the influence of the socio-demographic variables on sex differences 
in the intensity and persistence of anger may have been greater if our models 
included certain biological factors. 

Social scientists have been reluctant to consider the contribution of biological 
factors to sex differences in emotion since biological arguments about womens 
emotions have been used to justify their social disadvantage (see Shields 2007 fora 
discussion of this issue as well as the use of socially constructed beliefs about mens 
and women’s emotions to justify gender inequalities). However, Fausto-Sterling 
(1992), a feminist biologist, argues that the failure to acknowledge the complex 
web of interactions between the biological and social environment impedes sci- 
entific understanding of sex and gender. We concur that a multi-disciplinary 
approach—which focuses on the interplay between biological and social fac- 
tors—would expand our knowledge about sex differences in anger and depression. 

We also found that women report more symptoms of depression than do men, 
which is not explained by socio-demographic characteristics or self-assessed health. 
Our most intriguing result, however, is that individuals who report more intense 
and persistent angry feelings report more depression than those whose anger is less 
intense and persistent. Moreover, sex differences in the intensity and persistence 
of anger help explain the sex difference in depressed affect. These findings strongly 
suggest that women’s more intense and persistent angry feelings play an important 
etiological role in their high rate of depression. 

In addition to providing support for our hypotheses, our findings support 
Kemper’s social interaction theory and Collins’ theory about interaction rituals. 
Recall that these theorists argue that socially disadvantaged persons are more likely 
than their advantaged peers to experience situationally based anger, which if intense 
and prolonged, may transform into enduring emotional states or moods such as 
depression. It is, of course, also possible that persons with low emotional energy 
(i.e., depression) select into situations that elicit intense and enduring anger. Given 
the cross-sectional nature of our data, we obviously could not sort out the causal 
direction of this relationship. Longitudinal survey data and data from daily diaries 
would allow researchers to evaluate these different possibilities. Researchers could 
also assess whether intense and persistent anger and depression are concurrent emo- 
tions—a finding that is reported in studies of clinically depressed adults (Newman 
et al. 2006; Picardi et al. 2004; Taylor 1996). While we could not assess whether 
anger is causally linked to depression, our findings are consistent with Hagan and 
Foster's (2003) longitudinal study, which shows that intense and persistent anger in 
adolescence is associated with the development of depression among females and 
substance problems among males in early adulthood. 

As a final step in our analyses, we explored whether sex differences in the tech- 
niques used to manage anger are involved in the sex difference in depressive symp- 
toms. Consistent with other studies, we found that women tend to talk with others, 
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while men tend to use mood-altering substances. The later finding suggests that men 
use mood-altering substances in an attempt to dampen, transform and/or suppress 
all unpleasant emotions, not only those that are culturally izappropriate for males 
such as depression. However, there was no evidence that women are /ess likely than 
men to express their anger as the social constructionist view of emotion suggests. 
There was also no evidence that sex differences in anger management help explain 
the sex difference in depressed affect. These findings belie the assumption long held 
by mental health scholars that the greater prevalence of depression among women 
is due to their tendency to turn their anger inward rather than outward as do men. 

Although we have advanced an alternative hypothesis about the relationship 
between sex and depression, our work builds on and complements existing hypoth- 
eses. While these hypotheses attribute women’s higher rate of depression to their 
greater exposure and vulnerability to stress as well as gendered-responses to stressors, 
researchers have not considered whether women’s more intense and persistent anger 
mediates the relationship between sex and depression. We contend that certain types 
of experiences— particularly those that repeatedly expose individuals to unjust social 
interactions—are depressing because they involve unresolved, intense anger. 

By identifying dimensions of anger that mediate the relationship between 
sex and depression, our research also extends prior work—which has focused on 
anger as an outcome of social disadvantage. In doing so, our study points to some 
promising new lines of inquiry that would benefit from greater integration of 
insights from the sociologies of emotion and mental health as we have attempted 
to do here (also see Simon 2007). For example, while mental health research shows 
that exposure to stress is associated with higher levels of depression, it is likely that 
some stressors (e.g., a divorce, the inequitable division of household labor, and 
unfair financial compensation for work done outside the home) are associated 
with intense and persistent anger—particularly when the event or situation is 
perceived as unjust. A consideration of the intensity and duration of individuals’ 
anger—as markers of the injustice they experience in everyday social interactions 
across a variety of life domains—may also help explain other well-documented 
group (e.g., socio-economic status) differences in depression. 

We noted that our analyses, which focused on sex differences in anger and de- 
pression, do not capture the power structures that create gender or the enactment 
of gendered-identities in social interaction. Gender scholars urge researchers to 
avoid contributing to the social construction of the dichotomy between men and 
women (Shields 2007). To this end, we examined whether there are variations 
in anger and depression among women and among men who hold different sta- 
tuses. Although we found minimal within-gender variation, research should assess 
whether variations in gender-role ideology and gendered-identities contribute to 
variation in these emotions among women and among men. 

We want to emphasize that our findings are only generalizable to men and 
women in the United States. Hopcroft and Bradley (2007) recently documented 
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cross-cultural variation in the gender gap in depression and showed that it tends 

to be larger in societies with more rather than less gender equity. These authors 

attribute this seemingly paradoxical finding to a variety of social factors that char- 
acterize high equity societies such as the United States—including the prevalence 

of single parenthood among women (which exposes them to economic hardship) 

and difficulties they experience from: combining work and family roles. These 

authors also suggest that women in high-equity societies expect equality, which 

creates a sense of relative deprivation. In light of the lower expectations women 

in low-equity societies have for fair social interactions, we suspect that they also 

experience less intense and persistent anger than women in gender-equity societies. 

While our findings indicate that intense and persistent anger is associated with 

symptoms of depression, it is ironic that these feelings seem to be necessary for the 

collective redress of large-scale social inequalities. Collective behavior researchers 

tell us that the success of social movements often lies in the ability of movement 

organizers to instill in disadvantaged individuals a sense of righteous anger — anger 
related to positive emotions such as pride rather than negative feelings of depres- 
sion (Britt and Heise 2000; Jasper 1988; Taylor and Reitz 2010). It is obviously 
beyond the scope of this article to assess the social conditions in which intense and 

persistent feelings of anger are transformed into feelings of righteous anger and 

pride rather than depression. We hope that social movement scholars will explore 

this interesting paradox. 


Notes 


1. in an attempt to identify social factors that help account for sex differences in the 
intensity and persistence of anger, we examined the role context of the anger-eliciting 
event and target of respondents’ anger. These analyses (available) indicate that women 
are more likely to be angered by an event in the family, while men are more likely 
to be angered by an event at the workplace. Women are also more likely to be angry 
with a family member, whereas men are more likely to be angry with someone at 
work. ‘These analyses further show that persons who were angered by a family event 
reported more intense (but not more persistent) anger than those angered by a work 
event. However, they also reveal that sex differences in the role context and target of 
anger do not help account for sex differences in the intensity and duration of anger. 


2. In this table, we only present the sex coefficients for our dependent variables when 
alternative specifications of socio-demographic variables are included in the models. 
For example, the coefficient for sex in the regression of the intensity of anger is .57*** 
when personal rather than household income is in the model. 


3. Supplementary analyses (available) based on the sample of respondents who reported 
that the anger-eliciting event occurred within the previous week (N = 645) rather than 
the previous month produced even stronger results for the associations between the 
intensity and duration of anger and depression than those shown in our tables. In other 
words, the patterns presented in this research offer a conservative test of our hypotheses. 
Additional auxiliary analyses (available) also revealed that while the frequency of anger 
is positively and significantly associated with depressive symptoms, the inclusion of this 
variable in the models does not appreciably reduce the sex coefficient for depression. 
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Appendix A. Means and Standard Deviations for the Intensity and Duration of 
Anger and Depression by Sex 


1 _lotal__ Males __ Females _P Values _ Reliability 
“ntensi and Duration of Anger an E 603 
“Duration of anger "3.53 


(1.48) 


‘Sad 


‘ Fearful 


‘Worried 






Notes: Numbers in parentheses are standard deviations. 
-®These variables were reverse coded; high values indicate infrequent feelings. 
*P values, which refer to the difference between men and women, are based on two- 


tailed tests. NS = non-significant. 
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Appendix B. Means and Standard Deviations of Anger Management Techniques by Sex 


Total Males Females P Values 





N 1,050 447 
Anger Management Techniques ea 
Talked to the Person (0, 1) cae 34 39 NS 
(48) (48) (49) e 
Talked to Someone Else (0, 1) 9 1 64 og 
(.49) (.50) (.48) a 
Yelled or Hit Something (0, 1) .08 .09 .08 NS 
(.27) (.28) (.27) 
Tried to Rethink the Situation (0, 1) a 36 as Nea 
(.48) (48) (47) a 
Had a Drink or Took a Pill (0, 1) .06 08 .05 .05 
(.24) (.27) (.21) al 
Tried to Forget It (0, 1) 31 34 29 05 3 
(.46) (.48) (.45) 4 
Tried to Change the Situation (0, 1) 27 .26 2F NS 
(.44) (.44) (.44) 
Prayed for Help from God (0, 1) 28 20 34 oo 
(.45) (.40) (47) 
Fantasized About a Magical Solution (0, 1) 07 07 07 NS 
(.25) (.25) (.25) 
Went Out to Get Some Exercise (0, 1) 14 13 14 Nea 
| | (35) (34) (38) 
Waited for Feelings to Pass (0, 1) 29 29 .29 NS 
(.46) (.46) (.46) 
Tried to Accept the Situation (0, 1) 46 45 AT NS 3 
oo (50) (50) (50) 4 
Left the Situation (0, 1) a5 15 16 NS 
(.36) (.36) (.36) 
Thought About How to Get Revenge (0, 1) 06 06 07 NS- 
ees (.25) (.23) (.26) g 
Planned How to End the Relationship (0, 1) 10 .08 11 NS 


Did Something Else (0, 1) 





Notes: Numbers in parentheses are standard deviations. 


*P values, which refer to the difference between men and women, are based on two- 
tailed tests. NS = non-significant. 
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Appendix C. Sex Differences in the Intensity and Duration of Anger and Depression 


With Alternative Specifications of Sociodemographic Variables in the Models 
ois ees Pomale <i ae, 
Intensity Duration Symptoms of 
4 of Anger ofAnger Depression 
ersonal Income* i aoe a 









E. (3.62) (3.31) (2.91) 
Number of Hours Worked? Do ei oe 
(3.79) (3.23) (2.99) 
‘Marital Status® So 20" “19g 
ee (3.62) (2,99) (2.58) 
Children less than 6 in the Home (0, 1)* Soe Sie 207 
| (3.80) (3.34) (2.87) 
Number of Children less than 6 in the Home (0, 1)* 60° B27 2 
(3.91) (3.38) (2.95) 
Marital Unhappiness*® ‘DL ao .08 
3.16 2.13 83 


Notes: Numbers shown are unstandardized OLS regression coefficients. Numbers in 
parentheses are t-ratios. All analyses are based on the full sample (N = 1,050) with 

the exception of models that include marital unhappiness, which are based on married 
respondents (N = 494). 

ln addition to sex and personal income, these models include race, age, education, 
employment status, marital status, children less than 18 in the home, and self-assessed 
health. 

bin addition to sex and the number of hours worked, these models include race, age, 
education, household income, marital status, children less than 18 in the home, and self- 
assessed health. 

‘In addition to sex and dummy variables for marital status (married = 1, divorced = 1, 
widowed = 1, never married = 1), these models include race, age, education, household 
income, employment status, children less than 18 in the home, and self-assessed health. 
The married are the reference category in these analyses. 

4in addition to sex and children less than 6 in the home, these models include race, age, 
education, household income, employment status, marital status, and self-assessed 
health. 

‘These analyses are restricted to married respondents. In addition to sex and 

marital unhappiness, these models include race, age, education, household income, 
employment status, children less than 18 in the home, and self-assessed health. 
Responses for Marital Unhappiness range from very happy (coded as 1) to not so happy 
(coded as 3). 

*D<.05 *p<.01 ** p< .001 (two-tailed tests) 
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The Economics and Sociology of Religious Giving: 
Instrumental Rationality or Communal Bonding? 


Jared L. Peifer, Cornell University 


Religious individuals commonly make sizable monetary sacrifices by contributing to 
their congregations. This social action resides in the overlap of religious and economic 
realms of behavior, creating a certain tension. Following a Weberian approach to 
social inquiry, I treat religious giving as social action whereby individuals direct their 
value-rational and instrumental-rational behavior towards others. Using data from 
the American Congregational Giving Survey and the Social Capital Community 
Benchmark Survey, I test hypotheses derived from a rational choice perspective, the 
sense of solidarity one feels, and from the religious meaning of the giver. Rational 
choice hypotheses produce mixed results, the solidarity impact is confirmed, and high 
levels of religiosity have a strong impact on giving. 


Contributing money to one’s house of worship represents a social action that resides 
in the overlap of two distinct realms of social behavior — the religious and economic 
realms. Simply put, religious motives encourage generous giving, while economic 
motives lead one to give less. However, previous research on religious giving has 
generally failed to elucidate this tension. Instead, scholars have separately explored 
the religious motives or rational choice determinants of religious giving. Rational 
choice approaches theorize a calculative and maximizing actor (Finke, Bahr and 
Scheitle 2005; Iannaccone 1990, 1997; Olson and Caddell 1994; Olson and Perl 
2005). Focusing on religious motives, Miller (1999) explores the meaning individu- 
als attach to religious giving by simply asking “Why do you give money to your 
church?” While these approaches are valuable, the existing literature on religious 
giving would benefit from a more “ecumenical approach” (Lechner 2007) that 
appreciates the competing pulls of the religious and economic realms. Religious 
giving is only one example of the more general phenomenon of balancing the 
seemingly non-economic with the economic. Another might be the parent who is 
torn between work and parenting obligations (Blair-Loy 2001) or the difficulty of 
placing a price tag on intimate services, such as hospice care (Zelizer 2005). 

In addition to highlighting the competing individual-level approaches, this 
research also highlights a familiar concept in sociology that has been downplayed 
in the realm of religious giving and the study of economic behavior in general. 
I demonstrate that the sense of community, or level of solidarity, one feels with 
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their congregation has an appreciable impact on their level of giving. In fact, 
solidarity and religion have been closely connected in sociology ever since Emile 
Durkheim seemingly conflated the two in The Elementary Forms of Religious Life 
(2001[1912]). Since then, the importance of the sense of community provided 
by a religious group has been emphasized in various ways (Collins 2004; Greeley 
1985; Kanter 1972; Warner 2008). For example, American churches have the 
potential to provide individuals with a sense of community, or a sense of belong- 
ing (Becker 1999). Solidarity has also been shown to induce sacrificial behavior 
directed towards members of the in-group. Social psychological experiments have 
demonstrated how laboratory subjects that successfully perform tasks together are 
more likely to make monetary sacrifices for the newly formed group. This sacri- 
ficial behavior is attributed to solidarity, or the “emotional buzz’ produced from 
the positive social interactions (Lawler 2001). Melding these strains of research 
adds a convincing explanatory factor to the topic of religious giving in particular. 
More generally, the topic of solidarity in relation to giving has the potential to 
generalize to nonprofit organizations, which are perpetually looking for ways to 
induce member contributions. 

Undergirding this analysis is Max Weber's approach to social inquiry which helps 
parse individual-level motivations (religious and economic) that people bring to 
religious giving. Part and parcel of Weber's approach is the importance of the actor's 
social context (or how social action is directed toward the behavior of others). Such 
an approach yields a sound framework from which to address various types of social 
behavior that dwell in the overlap of competing realms of influence. 

Focusing on giving within the Christian tradition, I use data from the 
American Congregational Giving Study and from the Social Capital Community 
Benchmark Survey of the Saguaro Seminar. I test hypotheses generated from a 
rational choice paradigm, a sense of community viewpoint and finally a religious 
meaning perspective. The findings suggest that the rational choice paradigm pro- 
vides mixed results, and that both the sense of community felt by individuals and 
the religious meaning of individuals induce religious giving. 


Previous Research on Religious Giving 


Every week, religious followers are faced with an economic decision that directly 
affects their bank accounts and relates to their religious life. In most churches 
across America, an offering plate is passed and religious followers must decide 
how much they will contribute. Total giving to religious organizations reached 
approximately $65 billion in 2004 (Ronsvalle and Ronsvalle 2006). For re- 
ligious individuals, this giving can represent a significant proportion of their 
income. The Ronsvalles found that members of 40 selected denominations gave 
an average of 2.6 percent of their income to their church. Looking at the full 
sample of ACGS data, average denominational giving rates are 3 to 9 percent 
of respondents’ household incomes. ACGS respondents give, on average, 5.2 
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Table 1: Mean Percentage of Income Contributed to Church by Denomination 


: Standard 
Denomination Mean % Deviation % Frequency 
Assemblies of God 9.0 9.0 1,430 
Southern Baptist 6.9 6.4 1,422 
Catholic 3.2 7.0 1,681 
Evangelical Lutheran Church in America 3.9 bl 1,889 
Presbyterian (U.S.A.) 4.1 48 1,861 
Total 52 6.8 8,283 
n = 8,283 


Source: American Congregational Giving Study, Member Questionnaires (1993) 


percent of their income to their church (see Table 1). Such sacrificial giving 
within religious communities deserves a closer look. 

There are various economic or rational choice approaches to the study of reli- 
gious giving. One economic approach evokes a household consumption model that 
includes the unique dimension of “afterlife consumption,” where one is oriented 
towards obtaining rewards in the afterlife (Azzi and Ehrenberg 1975). Iannaccone 
(1990:298) notes that the central feature of the household consumption model is 
that, “...families are viewed as quasi-firms engaged in the production of ‘household 
commodities.’ These commodities may be as concrete as meals and laundry or as 
abstract as relaxation and love.” Religious satisfaction is another example of an 
abstract commodity that family members produce from their stock of religious 
human capital (i.e., familiarity with doctrine, rituals and traditions) and other vari- 
ous inputs, such as money and time. Iannaccone (1994) also asserts an alternative 
approach to religious behavior known as the “strictness model,” which posits that 
strictness (or prohibitions on the personal behavior of members, such as no smok- 
ing or alcohol, distinctive dress, etc.) makes churches stronger by weeding out free 
riders. As such, the argument is that stricter churches collect more contributions. 
In a similar vein, Finke, Bahr and Scheitle (2005) find that exclusive congregations 
generate higher levels of giving through congregational beliefs, requirements and 
networks of monitoring. The religious economies approach suggests that congrega- 
tions “compete” with other religious groups in their local geographic area for mem- 
bers (Borgonovi 2008; Finke and Stark 1988; Perl and Olson 2000). This leads to 
the prediction that churches in competitive and pluralistic contexts will have to 
work harder to produce a better religious experience and attract more members. In 
turn, church members are expected to give more generously. 

In addition to these rational choice approaches, sociological research has 
found several other factors that affect giving. Hoge et al. (1996) find high family 
income, high levels of involvement in church, evangelical theology, planning 
one’s giving by the year, and small congregational size positively influence abso- 
lute giving (or the total amount given, ignoring the giver's income). Incidentally, 
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Hoge (1994) also identifies a consistent skewness found in absolute-giving dis- 
tributions, in which approximately 20 percent of givers provide about 80 per- 
cent of church funds.! The skewness of absolute dollars given has also been used 
to measure the level of free riding within congregations (Olson and Perl 2005). 
In her field study, Miller (1999:38) finds that those who talk about giving “out 
of a sense of responsibility or obligation” to their churches give less than those 
who reported giving out of “love for God, out of obedience to scripture, or to 
meet the needs of others.” Finally, others have suggested that religious giving 
practices are bound up in denominational traditions (Chaves 1999; Hoge 1994; 
Hoge et al. 1996; Hoge and Yang 1994). 

Thus, the empirical literature on religious giving touches on several explana- 
tions. While this research has been instructive, I argue that it has not sufficiently 
highlighted the potential tension between religious and economic perspectives nor 
has it paid enough attention to the role solidarity might play in religious giving. 


Theoretical Background and Hypotheses 


From a Weberian approach to sociology, the social scientist is to be concerned with 

the interpretive explanation of social action. This directs attention to individual 

behavior that is invested with meaning and oriented towards the behavior of other 
social actors (Weber 1978[1922]). That social action is oriented to the behavior 
of others underscores the importance of solidarity. Weber also usefully erects ideal 

types of instrumental rational action and value rational action. Instrumental ratio- 
nal action describes an actor that seeks to attain his or her own rationally pursued 

and calculated ends, while value rational action is pursued “independently of its 

prospects of success.” (Weber 1978[1922]:24-25) It is noteworthy that these par- 
ticular motivations might easily lead to conflicting behavioral outcomes. Generally 
speaking, religious values instruct one to give more while economic motives lead 

one to give less. Smith et al. (2008:110) observe a “comfortable guilt” among reli- 
gious givers: *...living with an awareness and feeling of culpability for not giving 

money more generously, but maintaining that at a low enough level of discomfort 
that it was not too disturbing or motivating enough to actually increase giving.” 
This observation speaks to the uneasy tension at this overlap of religion and econ- 
omy. Does a rational choice approach provide a sufficient theoretical framework 
to explain religious giving levels? Does the sense of community an individual feels 

with his or her church affect giving? Do religious beliefs have an impact on giving? 

While previous research on religious giving has frequently addressed absolute 

giving amounts (Forbes and Zampelli 1997; Luidens and Nemath 1994; Lunn, Klay 
and Douglass 2001), fewer studies (Donahue 1994) have considered the percentage 

of one’s income that is contributed to his or her church, which is the approach taken 
here. This operationalization of giving privileges the level of sacrifice individuals 
make and is thus amenable to addressing notions of free riding at the individual level, 
an important concept stemming from the rational choice perspective. 
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A trend in the sociology of religion literature features the application of rational 
choice to religious behavior. In contrast to the notion of value rationality, rational 
choice scholars tend to focus on the import of Weber's instrumental rationality. 
Instead of viewing religious behavior as distinctly “irrational” in common parlance, 
they assume it is instrumentally rational. Iannaccone (1995) outlines the rational 
choice project in religious studies by emphasizing how assumptions of maximizing 
behavior help generate predictions for a wide range of religious phenomena. Stark 
and Bainbridge (1987:27) erect a comprehensive theory of religion made up of 
formal hypotheses. A central axiom in their theory is that “humans seek what they 
perceive to be rewards and avoid what they perceive to be costs.” 

Some sociologists of religion have resisted the intrusion of rational choice 
perspectives. For example, Bruce (1999:2) writes a scathing critique of Stark 
and Bainbridge’s rational choice theory in attempt to “drive a stake through” the 
“yampire’s chest.” For many in the religious studies field, attempts to treat religious 
followers as economic consumers and religious bodies as profit-seeking firms 
are both untenable and troubling (Bryant 2000; Chaves 1995). Much of their 
resistance stems from the attempt of rational choice scholars to explain multiple 
realms of religious behavior from a common economic calculus of maximization. 

Of the various types of religious behavior, the allocation of an individual’s 
scarce dollars directed to a church clearly qualifies as economic behavior (though 
not solely economic behavior). It is therefore appropriate to test the extent to 
which rational choice helps explain religious giving. To be clear, rational choice 
logic shuns the idea of voluntarily giving money away, especially when one 
could free ride. Churches largely rely on the voluntary contributions of their 
members,’ and in many congregations, giving is anonymous (Hoge et al. 1996). 
From the perspective of rational choice scholars and many financially struggling 
churches, collecting sufficient contributions creates a free rider dilemma. The 
consistent skewness found in church giving distributions has been argued to 
support this notion (Olson and Perl 2005). ‘The rational choice perspective 
does not necessarily preclude contributing money for services one enjoys. But 
voluntarily giving generously, when one could still enjoy the benefits without 
giving generously, defies rational choice logic. 

Based on these arguments, it is possible to state a number of hypotheses 
related to rational choice theory. Stark and Finke (2000) propose that larger 
churches must have less dense social networks in comparison to smaller churches 
and predict less efficient monitoring of member behavior in large churches. For 
example, because fellow church members in a small congregation can easily 
notice whether or not an individual is present at a worship service, monitoring 
and sanctioning may be motivation enough for an individual not to sleep in 
on Sunday morning. Extending this monitoring prediction regarding size of 
church to religious giving, it would follow that members of smaller churches will 
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give more generously (a larger percentage of income) than members of larger churches 
(statements in italics represent a specific empirical test of a broader hypothesis). 

Scheitle and Finke (2008) more directly apply the monitoring mechanism to 
financial contributions, and in doing so assume that giving amounts are readily 
visible in congregations. They measure monitoring through the mean level of 
friendships an individual has within the congregation. However, this assumes that 
financial details are shared among friends, which Wuthnow (1994) finds rarely 
to be the case. Additionally, Hoge et al. (1996) note that in most churches, the 
amount a person or family gives is kept confidential. Thus, I suggest that consider- 
ing whether or not a church practices pledging is a better proxy that lends itself to 
testing the monitoring hypothesis.‘ Instead of relying on the assumed knowledge 
of friends’ levels of giving and the subsequent sanctioning, being asked by church 
leadership to make a monetary pledge better represents the monitoring concept. 
In pledging churches, individuals are formally asked to commit to giving a certain 
amount for the coming year. To be clear, the actual giving levels are not necessarily 
visible to church leaders. However, a leadership team and clergy are often aware of 
how much people say they will give, thus increasing the visibility of giving which is 
necessary in order for monitoring and subsequent sanctioning to affect individual 
giving. Accordingly, individuals in pledging churches will give more generously than 
individuals in non-pledging churches.” 

In a similar fashion, churches can also deliberately canvass members in order 
to encourage them to support the congregation's operating budget for the year. In 
the ACGS data, canvassing means contacting at least some, if not all, members of 
a church by phone or in person. Such heightened attention to who is planning to 
give is yet another proxy for heightened levels of visibility where monitoring and 
sanctioning would likely operate. This leads to the expectation that individuals 
in churches that canvass members will give more generously than individuals in non- 
canvassing churches. In sum, these rational choice arguments assert that individuals 
give because other religious actors are perceived to be watching. More generally, 


Fypothesis 1.1: Religious giving is driven by the social 
monitoring of other congregational actors. 


As mentioned previously, lannaccone’s (1994) strictness hypothesis explains 
that strict congregations weed out free riders due to the high costs of membership, 
leaving only the committed who should be generous monetary contributors. 


Aypothesis 1.2: Individuals in stricter churches will give 
more generously. 


Stark and Finke's (2000) clergy-incentive hypothesis posits that state-subsidized 
churches in parts of Europe produce lazy clergy and lazy laity. However, when the 
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clergy’s income depends on member contributions the clergy will work harder to 
produce a more inspiring religious experience. This assumes that followers will 
pay more for inspiring religious services. Short of overt state subsidies in the 
United States, some churches do obtain some funding from sources other than 
members’ weekly contributions. For example, churches rent their facilities and 
receive investment income. Assuming the clergy of churches that receive outside 
funding are less motivated to produce an inspiring religious experience, one can 
test the following hypothesis. 


Hypothesis 1.3: Individuals in churches that receive more 
funding from alternative sources will give less generously. 


Following the logic of the Stark and Finke's (2000) clergy-incentive hypothesis, 
church members will monitor the decisions and performance of the clergy and 
church leadership. For example, if a member disagrees with a church decision to 
build a new sanctuary, does he or she contribute less? Alternatively, a member that 
really likes the new youth pastor might give more generously. 


Hypothesis 1.4: Individuals who believe their churches are 


using their money wisely will give more generously. 


While previous studies have found volunteers give money more generously than 
non-volunteers (Havens, O’Herlihy and Schervish 2006), some have suggested 
that monetary contributions and time are substitutes (Duncan 1999; Iannaccone 
1990; Iannaccone, Olson and Stark 1995). In a church setting, individuals can 
give of their time (volunteer) or their money. If time volunteered is a substitute 
for monetary contributions, then it would follow that... 


Hypothesis 1.5: Individuals who volunteer more hours to 
the church will give less generously to the church. 


As people decide how much to contribute to their churches, a free rider might 
look around and consider the income (and thus potential donations) of others in 
the church. In a congregation that has many high-income members, an individual 
might feel his or her contribution would not make much of a difference or that 
others will likely give generously. In contrast, amid a low-income congregation, 
that same individual’s contribution will have a larger impact. Olson and Caddell 
(1994) test this hypothesis at the individual level (predicting absolute dollars 
given) and find a significant effect. Namely, individuals give less when they see 
that others have high incomes. However, Olson and Caddell’s data only include 
non-randomly selected congregations from one denomination (United Church 
of Christ). Additionally, the response categories on the questionnaire cap giving 
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amounts at “$15 per week or more,” a significant weakness in the dataset. How 
does the following rational choice hypothesis fare in the more extensive data 
utilized in this analysis? 


Hypothesis 1.6: Individuals in churches with a larger 
proportion of wealthy individuals will give less generously. 


Sense of Community 


While both the individual religious and economic motives are important to 
consider, a sociological approach has something additional to say about the 
larger social context of the giving. Swedberg (1998:23) notes, “While economic 
theory... analyzes economic action in general, sociology only analyzes eco- 
nomic action that is also oriented in its meaning to the behavior of others.” An 
economic actor looks inward to consider the tradeoff of giving generously or 
free riding, the religious actor looks upward to meditate on the spiritual sig- 
nificance of giving, and the social actor casts his or her gaze to the surrounding 
congregation. Does he or she feel a sense of belonging and give generously as a 
result? Previous studies on religious giving have paid fair heed to the individual 
characteristics of religious givers. However this line of research has yet to fully 
elaborate the social context of religious giving.° 

In Elementary Forms of Religious Life, Durkheim (2001[1912]) suggests that re- 
ligious life finds its origins in individuals worshiping representations of the group, 
partially in response to the “collective effervescence” they experience when conven- 
ing together. In Suicide, Durkheim (1951[1897]) suggests that levels of cohesion 
in various religious groups help explain varying rates of suicide. Andrew Greeley 
(1985:144) states all humans are engaged in a “manic ‘quest for community.’ ” In 
Congregations in Conflict, Becker (1999:82) defines a “family church” as fostering 
a “general sense of well being, acceptance, and belonging. The phrases ‘a sense 
of belonging’ and ‘T feel like I really belong there’ recurred in many interviews.” 

There is also evidence that cohesion can cause sacrificial behavior directed to- 
wards the group. Collins (2004:169) predicts “the higher levels of ritual intensity 
in collective activity, the more... individuals will sacrifice to the group.” With 
religious giving being a form of “sacrificing for the group, Collins’ interaction 
ritual model points us towards the importance of emotions and solidarity. In a 
similar fashion, Lawler’s (2001) affect theory of social exchange also features actors 
that feel an “emotional buzz” from successful interactions, These positive emotions 
are then attributed to the appropriate group, which in turn increases commitment 
to the group in the form of making sacrifices for the group. Addressing broader 
concerns about caring behavior and informal volunteering in a literal neighbor- 
hood setting, Wuthnow (1996:267) highlights how caring behavior comes more 
naturally when the community is strong, “even if it takes time away from the 
business of making ends meet for ourselves and our families. Seeming natural, it 
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can happen with less thinking, less introspection, less deliberation.” Following this 
line of thought is the following hypothesis: 


Hypothesis 2.1: The sense of solidarity an individual feels 
from the local congregation increases the individual’s level 
of religious giving to that congregation. 


Unfortunately, the ACGS data do not directly measure the sense of community 
one feels towards his or her church. ACGS respondents are, however, asked to 
react to the following statement, “If I had to change the congregation I attend, 
I would feel a great sense of loss.” While it is unclear how respondents interpret 
“sense of loss,” the term resembles an emotional reaction that may very well tap 
into the sense of community one feels towards their church. It then follows that, 
individuals who would feel a greater sense of loss if they were to leave their congregation 
will give more generously. 

While the ACGS data fail to provide a satisfactory measure of perceived 
cohesion, the SCCBS data provide a much better measure of the “sense of 
community” members feel towards their place of worship, rendering the follow- 
ing assertion testable. People who feel a “sense of community” towards their local 
congregations will give more generously. Closely related to the sense of community 
one feels is the amount of trust one has for fellow congregation members. Trust 
is an important element of any cohesive relationship and serves as a satisfactory 
test of the solidarity hypothesis. Individuals who trust more deeply the members 
of their congregations will give more generously. 


Religious Meaning 


Considering the religious context, one would expect many individuals to directly 
include God as a central actor in their giving behavior. If all the possessions of a 
religious follower are perceived as a gift from God, religious giving is merely an 
opportunity to give back to God what already belongs to him (Keister 2008). 
Some evangelical churches teach a “prosperity gospel” which holds “that God will 
reward a generous giver with yet greater financial return in this life.”(Hamilton 
2000:115) Miller (1999:40) cites the following religiously motivated reasons for 
giving: “Because I love God,” “to obey the Bible or God,” “out of thankfulness for 
all I have,” “because everything I have is God's.” 

These examples represent various religious meanings givers might attach to 
their monetary contributions. Just as Weber's proverbial woodchopper may 
be splitting wood to keep the homestead warm, to work off frustration from 
an argument, or to stay fit and healthy, I expect religious givers to attach a 
multiplicity of meaning to their offering plate behavior. Such an interpretive 
approach that privileges individual religious meaning shows strong promise in 
explaining varying rates of giving. As Hoge (1994:102) states, “People strongly 
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committed to God and God’s promises will give more money to the church. This 
is axiomatic.” More formally, 


Hypothesis 3.1: Individuals who have strong religious beliefs 


give more generously. 


Two specific ways to empirically test this hypothesis is to assert that individuals 
who claim that their religion is very important to them will give more generously. More 
specific to the Christian tradition, individuals who believe only followers of Jesus 
Christ will be saved will give more generously. 


Data and Methodology 


In order to test most of the preceding hypotheses, I use data from the AGCS which 
was collected in 1993.’ The study randomly selected 125 congregations® from five 
different denominations for a total of 625 congregations. The five non-randomly se- 
lected denominations are Assemblies of God, Southern Baptist, Evangelical Lutheran 
Church in America, Presbyterian U.S.A., and Catholic. The combined data yield 
10,902 respondents with an impressive array of individual-level characteristics for 
each respondent along with a detailed description of each respondent’s congrega- 
tion. Congregational-level data were collected from separate interviews with church 
leaders. Researchers attempted to survey 30 randomly selected members from each 
congregation. About 15 percent of the sample belong to churches with fewer than 
15 respondents. In order to determine if the smaller sample size of respondents in 
some congregations affected the results, I tested an alternative ACGS model using 
only respondents from congregations with more than 14 respondents. These results 
(available upon request) are largely the same as those presented. 

The dependent variable for the ACGS data stems from the proportion of the 
respondent’s household income contributed to his/her congregation through 
regular weekly giving and through capital campaigns. Individuals give on aver- 
age about 64 percent of their total giving dollars to the church through regular 
weekly offerings and about 9 percent to their churches’ capital campaigns for 
a total of 72 percent going directly to their church (see Table 2). The indepen- 
dent variables in this analysis include both congregational-level descriptors and 
individual-level characteristics and straightforwardly operationalize the preceding 
hypotheses (see Appendix 1 for descriptive statistics of the sample of respondents 
included in the main ACGS model). 

‘The proportion of ones income given to the church plus a small amount is log 
transformed in order to normalize the dependent variable, allowing for a clearer 
observation of the bulk of the observations that are concentrated on the left side of 
the distribution while pulling in extremely generous givers (see Figure 1). Because 
the log of zero is undefined, a small amount (.001) is added to the dependent 
variable in order to retain in the featured analysis the 126 respondents who give 
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Table 2: Mean Giving to Various Causes as a Percentage of Total Giving 
Standard Cumulative 
Mean % Deviation % | Mean % 


Regular weekly giving to congregation 63.6 24.5 63.6 
Capital campaigns for congregation 8.6 1333. W22 
Denominational causes fe 113 19.7 
Religious causes outside of denomination 74 12.6 87.1 
Nonreligious causes 13.0 16.6 100.1 
n= 8,301 


Source: American Congregational Giving Study, Member Questionnaires (1993) 


nothing. A model which drops the 126 non-givers, an ordinal response and a 
binary response model are also predicted to verify robustness of the results (avail- 
able upon request). The results of the binary response model reveal puzzling results, 
suggesting that different processes are at play when predicting how much givers 
give as opposed to whether people give at all.? The results from the remaining 
models largely confirm this article’s main results, however, these plausible models 
cast different intervals of confidence around some of the findings."° 

The nested structure of the ACGS data, where both individual variables and 
congregational variables are present, makes it appropriate to use a hierarchical 
linear model.'! The goal of this kind of multi-level analysis is to predict the value 
of a dependent variable as a function of predictor variables from more than one 


Figure 1. Distribution of Logarithm of Proportion of Income Given to Church Plus .001 
Proportion of Income Given (Untransformed) 
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level (Luke 2004; Snijder and Bosker 1999). The intraclass ‘correlation of the in- 
tercept-only model is .18 (.26/(.26 +1.15)), indicating 18 percent of the variance 

of giving rates is explained at the congregational level.'* Differing from standard 

OLS regression, the hierarchical linear model includes an error term for both the 

individual and group level. This corrects for the potential dependency of residuals 

from individuals within the same congregation, which would be a violation of 
OLS regression assumptions. In order to specify the random intercept model, I 

present an equation at each level. 


Level As logY 2.001) = 8 7x, Pe yok ge 


Level 2: Py =YoortV o1%15 tO FY 0g aj +0, 


In these models, Yo represents the proportion of income the individual contrib- 
utes to church. The parameter 7,, (h=1,...,p) are the coefficients for individuals 
level variables x,» While 7, (h=1,...,q) are the coefficients for congregational level 
variables z_. The subscript j indicates that a separate level-1 model is being esti- 
mated at each level of j level-2 units (or congregations). Each congregation may 
therefore have a different average rate of giving which is represented by By The 
variable R, is the error term, representing the individual level variance in the model. 

‘The level-2 part of the model indicates that the random intercept 8, is a func- 
tion of level-2 variables z_. The parameter Y,, represents the value of the average 
level-1 dependent rable controlling for level-2 congregational variables. The 
parameter U). represents the error term at the congregational level. Combining 
the models leads to the following random intercept hierarchical linear regression 
model to be featured in this analysis. 


log(Y, + .001) =y,,+ Y i0tig EE eX yt V ney tt Yo Sait U, + R, 

‘The regression parameters, Y,. (h=1,...,p) and Yo» (p=1,-..sq) have the same in- 
terpretation as unstandardized regression coefficients for a logarithmic dependent 
variable, in that a unit increase in Xo (OF Zp) leads to a 7, (or Yo») increase in the 
logged proportion of income given to the church, controlling for the remaining 
variables in the model. To more easily interpret this, a coefficient of .20 for a vari- 
able would indicate that a unit increase of that variable would lead a tither (gives 
10 percent of income) to give 12.2 percent of his or her income (.10*exp(.20)). 
For an individual earning $45,000, that would be an increase of $990 per year 
($45,000*.122 — $45,000*.10). 

The SCCBS data, which were collected in 2000, are used to test the hypotheses 
1.5, 2.1 and 3.1.'° The SCCBS study highlights social capital topics of interest and 
surveys a random national sample as well as 40 community samples. The total 
sample size is 29,233, though the sample used in the main model is smaller due 
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to survey design. The average community adjusted response rate was 28.9, and the 
national sample adjusted response rate was 28.7. The respective cooperation rates, 
which represent the eligible respondents reached who completed an interview, 
are 41.6 and 42.3. Because funding organizations were able to select the sampled 
region, communities were not randomly selected. Despite the low response rates 
and the non-random community selection, the SCCBS data include respondents 
from diverse parts of the country. Furthermore, these data provide the only op- 
erationalization of solidarity within churches that this author is aware of and is 
therefore necessary to test the solidarity hypothesis. 

Because the SCCBS survey does not solely target religious behavior, it is not 
possible to control for the same number of detailed variables as in the ACGS 
analysis. But similar to the ACGS model, the dependent variable is based on 
the amount one contributes to a religious cause or church as a proportion of 
income. Ample demographic and religious control variables are also included 
(see Appendix 2 for SCCBS descriptive statistics of the sample of respondents 
included in Table 4). Consistent with the ACGS model specification, the SCCBS 
data are analyzed using OLS regression of the log transformation of the proportion 
of income given to religious causes plus .001. In order to maintain similar scope 
conditions as the ACGS model, only SCCBS respondents who are members of 
Christian congregations are included. Results from weighted data, a model that 
drops non-givers, ordinal response and binary response (available upon request) 
largely confirm the hypothesized results presented in Table 4. 


Results and Discussion 


Findings for the rational choice hypotheses that are able to be tested are mixed 
(see Table 3). Failing to support rational choice hypotheses, the size of church, 
pledging and canvassing status, and church strictness (hypotheses 1.1 and 1.2) do 
not have a statistically significant impact on individual giving after controlling for 
everything else in the model. The non-significant impact of church size suggests 
individuals at larger churches are not more likely to free ride, when free riding is 
measured as a percentage of income given to the church. The non-impact of pledg- 
ing and canvassing also fails to confirm the hypothesis that monitoring within 
churches appreciably induces giving. ‘The strictness variable measures whether or 
not the church teaches that Christians should abstain from certain kinds of food, 
alcohol and/or tobacco, gambling, or certain kinds of entertainment (such as moy- 
ies, nightclubs or dancing).'° The results indicate that, controlling for everything 
else in the model, being in a strict church has no impact on giving.'° 

The substitutability of volunteer hours for monetary contributions in the reli- 
gious context (Hypothesis 1.5) is falsified with strong significance in the positive 
direction. Instead of individuals trading time for money, those who volunteer an 
additional hour per week have a 2 percent higher giving rate (exp(.02) — 1). The 
SCCBS data confirm that individuals who volunteer at all have 34 percent higher 
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Table 3: Multi-Level Regression of Logarithm of Proportion of Income Given to 
Church Plus .001 


Hypothesis Level of Measure Variable Description SSC Sz Score 
1 Congregational Size of church -.00 (-.87) 
1A Congregational Pledging church -.02 (-.69) 
14 Congregational Canvassing church 04 (1.22) 
ileZ Congregational Strict church .00 (.13) 
1:3 Congregational Alternative funding: investments, fees -.00* (-2.08) 
1.4 Individual Budget is appropriate OS (3.60) 
1.4 Individual Trust leadership .08** (3.17) 
1.4 Individual Enthusiastic about programs .08** (3.08) 
1.5 Individual Volunteer hours 2" (11 ae 
1.6 Congregational Wealthy individuals in church -.00 (-.47) 
2A Individual Individual would feel sense of loss .06** (2.64) 
oi Individual Religion is very important ors (6.94) 
oa Individual Followers of Jesus Christ can be saved on (6.01) 

Denomination (reference Catholic) : 
Congregational Assemblies of God .80** = (10.658 
Congregational Southern Baptist .48*** (6.93) 
Congregational Presbyterian U.S.A. .19** (3.08) 
Congregational ELCA eae (4.94) 
Education (reference high school or less) 
Individual Some college .05 (1.76) 
Individual College graduate Dae, (4.44) 
Individual Graduate i (3.10) 
Individual Demographics 
Individual Age 02" (19.2 
Individual Household income a01R (-8.89) 
Individual Household income squared OOo (8.03) 
Individual Female aS (-6.16) 
Individual Married .04 (1.25) 
Individual Number of children -.03* (-2.35) 
Religious Characteristics 
Individual Skew of church absolute giving .03 (1.63) 
Congregational Church encourages the tithe .00 (.03) 
Congregational Size of church budget per person OE (2.93) 
Congregational Church mortgage payment are (3.70) 
Individual Frequent attendance 10°* ~~ (24.68) 
Individual Amount given to non-church causes 06" {123 
Intercept -5.76"* — (-63.13) 
Variance Component 
Between congregation variance .04 
Within congregation variance 81 
N (Congregations) 588 


N (Individuals) 1 oor 


Source: American Congregational Giving Study 1993 
*p <,05*"p <201e2**p:< 2004 
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giving rates (exp(.29) — 1) than those that do not volunteer (see Table 4). This find- 
ing confirms previous research on volunteering (Havens, O’Herlihy and Schervish 
2006). Failing to confirm Hypothesis 1.6, the percentage of church members with 
incomes of more than $100,000 does not significantly affect individual giving. It 
does not appear to be the case that individuals consider the relative wealth of other 
church members before deciding how much they will contribute. 

In support of rational choice Hypothesis 1.3, individuals give a little less for 
each additional percentage point of the churches total receipts that come from 
investments and fees. Results also support that givers take into account how their 
contributions will be used and hold clergy and church leaders accountable for 
budget decisions (Hypothesis 1.4). Namely, individuals that think the church 
budget is appropriate, trust their leadership and are enthusiastic about their 
church program have about 8 to 11 percent higher giving rates (exp(.08) — 1, 
exp(.10) — 1). The accrued results from testing hypotheses 1.3 and 1.4 lend sup- 
port for the clergy incentive hypotheses. Both the sources of church funds and 
congregant response to budget decision makers have an impact on giving. 

The solidarity hypothesis (2.1), as tested by the ACGS data, is supported, and 
benefits from the ability to control for all other variables in the ACGS model. 
Individuals who would feel a “sense of loss” if they were to leave their churches have 
6 percent higher giving rates (exp(.06) — 1) than those that do not. However, the 
ACGS operationalization of solidarity is ambiguous and offers partial evidence in 
favor of the solidarity hypothesis. The SCCBS survey enables a more direct test of 
the “solidarity” hypothesis. The SCCBS model (see Table 4) supports the sense of 
community hypothesis, in that, individuals who report feeling a sense of commu- 
nity with fellow church members have 34 percent higher giving rates (exp(.29) — 1). 
However, the impact of trusting people at church is not significant, indicating trust 
may not be a proper operationalization of solidarity.'” Lastly, both models show 
strong support of the religious-meaning hypothesis (Hypotheses 3.1). Controlling 
for everything else, individuals who consider themselves more religious and believe 
only followers of Jesus Christ can be saved give significantly more. 

Both models control for variables that need to be considered when look- 
ing at religious giving. Perhaps most importantly, the large magnitude of the 
denomination coefficients emphasizes the importance of denominational affili- 
ation (Chaves 1999). In the ACGS data, individuals in each denomination give 
significantly more than individuals of the Catholic denomination (reference 
category), after controlling for everything else. In the SCCBS data, most non- 
Catholic religious denominational categories (Steensland et al. 2000) give more 
than Catholics.'® Women and less-educated individuals tend to give less while 
older people tend to give more. Household income has a statistically signifi- 
cant (though substantially small) curvilinear effect whereby initial increases in 
income depress giving, but giving eventually rises as income rises. The ACGS 


model finds each child significantly decreases giving rates, while SCCBS yields 


1584 © Social Forces 88(4) 


Table 4: Ordinary Least Squares Regression of Logarithm of Proportion of Income 
Given to Religious Causes Plus .001 





Hypothesis Variable Description = = == = = b_ __—_—st Score 
Wo Volunteer at house of worship eye (5.33) 
aed Sense of community from church a (3.44) 
2.1 Trust people at church 05 (1.03) 
oa Religion is important pM bie (4.95) 

Denomination (reference Catholic) 
Black Protestant -.14 (-1.12) 
Evangelical Protestant oo. (6.26) 
Mainline Protestant ar. (6.42) 
Other Christian ob (5.71) 
Other Protestant hie (4.65) 
Education (reference less than high school) 
High school AS (1.25) 
Some college cata (2.70) 
Graduate 48** (3.95) 
Age (reference age 18-34) 
Ages 35-49 a6 (3.30) 
Ages 50-64 oe (5.76) 
Ages 65 and older 63, (9.23) 
Individual Demographics 
Household income <OT (-5.30) _ 
Household income squared .00*** (4.37) 
Female -.30"** (-7.81) 
Married .08 (1.82) 
Number of children .00 (.26) 
Religious Characteristics 
Friend from different religion 04 (.62) 
Attends every week ire (11.81) 
Participate in church activities a3" (8.40) 
Participate in religious organization aa (5.52)a 
Intercept -5.56*** (-34.76) 
oe ai sguared — eee SS se eee 
N = 3,932 


Source: Social Capital Community Benchmark Survey of the Saguaro Seminar 2000 
pssUoe- D'=<e0n mn = O04 


no significant impact of children. In the ACGS data, the skewness of absolute 
giving in a congregation (as determined by the absolute giving amounts of 
the congregation's sample respondents) has no impact on individual giving. 
Individuals who attend churches that encourage the tithe give no more or less 
than those in churches that do not. However, individuals at churches with a larg- 
er total budget per person give more. Individuals who attend churches paying 
off mortgages also contribute more. Individuals who give more to non-church 
causes also give more generously to their churches. Both datasets control for 
frequency of the individual’s church attendance, yielding large positive effects on 
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giving. Ihe SCCBS data also yield significant positive effects for participating in 
additional church activities and being involved in other religious organizations. 


Conclusion 


Religious giving is an example of more general social behavior that resides in the 
overlap of conflicting realms of motivation. Weber’s definition of social action 
nicely addresses these types of social behavior. The ideal types of value-rational 
and instrumental-rational action help sort out the individual-level determinants 
of religious giving. Hypotheses were generated and tested for each ideal type. 
Additionally, the dependent variable in this analysis (proportion of income given 
to church) provides a fitting operationalization of individual-level free riding. 
Instead of observing absolute dollars given, the proportion of income measures 
the giver’s relative sacrifice. Unfortunately, the data at hand are not sufficient to 
test the religious competition theory (Borgonovi 2008; Finke and Stark 1988; 
Perl and Olson 2000), an important concept stemming from the rational choice 
paradigm. That said, the rational choice hypotheses that are able to be tested re- 
ceive mixed support. Namely, individuals do give more when they approve of the 
leadership’s use of the funds and when the church is more dependent on member 
contributions. These findings lend support to the clergy incentive theory (Stark 
and Finke 2000). However, the impact of the size of church, social monitoring, 
strictness, volunteering and income levels of other church members do not find 
support. The strong impact of religiosity confirms previous findings that strong 
religious beliefs do in fact induce giving. 

I have also demonstrated that the perceived solidarity an individual feels toward 
his or her congregation induces more generous giving. One possible mechanism of 
this process would be that actors feel an emotional buzz from convening together on 
a weekly basis,which increases commitment to the group (Lawler 2001). A benefit 
of this hypothesized emotional link between solidarity and religious giving is that 
it bypasses the monitoring (Scheitle and Finke 2008) and strictness (Iannacconne 
1994) mechanisms, which fail to garner empirical support in this analysis. In many 
circumstances, religious giving is not a realm of behavior where monitoring is likely 
to operate. Namely, the religious giving environment is often intentionally kept 
anonymous (Hoge et al. 1996) and financial behavior is a sensitive topic that most 
Americans keep to themselves (Wuthnow 1994). Perhaps monitoring and strictness 
induce commitment only to the extent that they create solidarity. The scope of inter- 
est then changes from isolating specific variants of strictness or cases of monitoring 
to identifying sources of solidarity. It seems likely that these various mechanisms are 
not at odds with one another, but interact in complementary ways. 

These findings also show potential generalizability to a broader range of 
non-profit organizations. For example, nonprofit membership associations 
(Tschirhart 2006) that rely on charitable donations from members are similar 
to churches. Nonprofits are perpetually looking for ways to induce generous 
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contributions, and the religious giving example provides a case study in which 
individuals are confronted with the opportunity to give as often as they attend 
a worship service. While the religious context of giving to the church is impor- 
tant to this case, the impact of solidarity on charitable giving likely operates 
independent of the religious characteristics. 

In sum, religious giving resides in a complex arena of social behavior, wherein in- 
dividual religious and economic motives can potentially pull in opposing directions. 
While giving amounts are not especially visible to the surrounding congregation, 
God is watching, and the religious meaning of givers takes this into consideration. 
At the same time, allocating scarce dollars is an economic decision so the rational 
choice considerations must also be accounted for. And as this research demon- 
strates, the individual’s perceived sense of solidarity also shapes the giver’s behavior. 
In the spirit of an “ecumenical approach to explaining religion” (Lechner 2007), 
each theoretical approach should be considered when addressing this common 
religious and economic experience for a significant portion of Americans. 


Notes 


1. Jannaccone (1997) explains this skewness as an inevitable mathematical outcome of 
a low correlation between income and rates of giving. . 


2. The terms “cohesion,” “solidarity” and “sense of community” are used interchangeably 
for the remainder of this article. 


3. Three quarters of all congregations receive at least 90 percent of their income from 
individual contributions, and 80 percent of all money collected by churches comes 
from individual contributions (Chaves 2004). 


4. An ideal measure to test whether social monitoring and sanctioning explains religious 
giving would be to ask religious individuals whether they know how much others in 
their congregation are giving and whether they think others know how much they 
contribute. Only in churches with individuals who know others’ giving amounts 
would monitoring and sanctioning among members be a possible explanation. 


5. An alternative explanation might be that church leaders institute pledging in 
response to low rates of giving from congregants. This reverse causality possibility 
cannot be avoided with the cross-sectional data at hand and also applies to 
hypotheses 1.1, 1.3 and 1.5. 


6. ‘This is not intended to shortchange the important work that has been done on the 
importance of one’s denomination (Chaves 1999; Hoge 1994; Hoge et al. 1996; 
Hoge and Yang 1994) or congregation (Schwadel 2002). I also acknowledge that 
lannaccone’s (1994) strictness hypothesis takes on an aspect of the givers social 
context, though this paradigm still relies on the cost-benefit analysis of individuals. 


7. The data were downloaded from the Association of Religion Data Archives, www. 


TheARDA.com, and were collected by Dean Hoge, Charles Zech, Patrick McNamara 
and Michael Donahue. 


8. Eighty-five percent of congregations approached for the study cooperated. Investigators 
selected the remaining 15 percent of the sample by finding congregations that were 
similar to the congregations that refused to participate. 


10. 


A, 


12: 


2. 


14. 


iL. 


16. 
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The multi-level logistic regression was performed in Stata using the xtmelogit 
command. Observing the binary response model predicting the outcome of giving 
nothing vs. giving something produces puzzling results. The results show no impact 
of alternative funding, trusting leadership, “feeling a sense of loss,” religiosity and 
no denominational affiliation impact. Previous research on charitable giving has 
differentiated between the decision to give anything and the level of giving among 


givers (James III and Sharpe 2007; Schervish and Havens 1995). 


The discrepancies that do arise from comparing different plausible models are as 
follows. There is no impact of being enthusiastic about church programs in the model 
that drops non-givers. Also, there is no impact of trusting leadership and “feeling a 
sense of loss” in the ordinal response model. 


Stata’s xtmixed command was used to predict the continuous response model and 
does not allow weights to be attached. 


Anticipating how the intra-class correlation changes as the model expands, adding 
only the denominational affiliation dummy variables to the intercept only model 
produced an ICC of .06 (.07/(.07 + 1.15)). Therefore, about 67 ((.18 — .06)/.18) 
percent of the congregational level variance is explained away by adding the 
denominational affiliation variables. The ICC of the full model (Table 3) is .05 
((.04/(.04 + .81), verifying that 72 percent ((.18 — .05)/.18) of the congregational 
variance is explained by the full model. The multi-level structure nicely elucidates that 
most of the congregational-level impact on giving stems from the denominational 
afhliation of the congregation and that nearly three-quarters of the congregational- 
level variance is explained by the featured model. 


These SCCBS data were downloaded from the Association of Religion Data Archives 
at www. IheARDA.com. Because the multi-level regression model performed on the 
ACGS data cannot use weighted data, the SCCBS results in Table 4 use unweighted 
data. However, when weights are included the model results are similar to the findings 
reported in Table 4. The SCCBS weights reproduce the population distribution of 
gender, age, education, race and ethnicity and are weighted by the number of adults 
and telephone lines in the house 


This measure of giving differs slightly from the ACGS measure in that the SCCBS 
dependent variable includes giving to all religious causes. Table 2 from the ACGS data 
shows that individuals give 15 percent of their total giving to non-church yet still religious 
causes (8 percent to denominational causes and 7 percent to religious causes outside the 
denomination). Therefore, we could roughly estimate that the dependent variable for the 
SCCBS data (giving to religious causes) accounts for an additional 15 percent of one’s 
total giving amount than the dependent variable used for the ACGS data. 


Missing from this operationalization of strictness is affirmation of distinctive dress, 
physical isolation, endogamy or even overt expectations to contribute generously. 


It is worth noting that ACGS statistics (available upon request) show that the 
Assemblies of God and Southern Baptist denominations are the strictest of the 
five featured in this analysis. Because these two denominational dummy variables 
have large positive affects on giving (see Table 3), one might assert that strictness 
associated with these denominations lends support for the strictness hypothesis. 
However, the more proximate measures of strictness employed in this analysis 
provides a satisfactory and falsifiable test of strictness. Chaves (1999) emphasizes 
the importance of denominational affiliation in religious giving without reducing 
that affect of denomination to their varying levels of strictness. Indeed, a whole 
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range of structural and doctrinal factors are encompassed in the denominational 
category. After controlling for denominational affiliation, the results fail to confirm 
the prediction that strict churches are associated with higher contribution rates. 


17. When church members and non-church members are included in the analysis (nearly 
doubling the total sample size), the impact of trusting people at church is statistically 
significant. To be clear, only church members were included in this analysis so that 
its results are comparable to the ACGS data. 


18. Jews and respondents identified as “other religion” were omitted from the analysis in 
order to match the ACGS data which only look at Christian congregations. However, 
when these religious affiliations are included in the analysis, the “sense of community” 
impact is still statistically significant, suggesting generalizability to these religious 
contexts as well. 
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Can Cultural Worldviews Influence Network Composition? 


Stephen Vaisey, University of California, Berkeley 
Omar Lizardo, University of Notre Dame 


Most sociological research assumes that social network composition shapes individual 
beliefs. Network theory and research has not adequately considered that internalized 
cultural worldviews might affect network composition. Drawing on a synthetic, dual- 
process theory of culture and two waves of nationally-representative panel data, this 
article shows that worldviews are strong predictors of changes in network composition 
among U.S. youth. These effects are robust to the influence of other structural factors, 
including prior network composition and behavioral homophily. By contrast, there 
is little evidence that networks play a strong proximate role in shaping worldviews. 
‘This suggests that internalized cultural dispositions play an important role in shaping 
the interpersonal environment and that the dynamic link between culture and social 
structure needs to be reconsidered. 





Introduction 


With the recent consolidation of the “cultural turn” in American sociology (Jacobs 
and Spillman 2005; Friedland and Mohr 2003) the perennial tension between 
culture and social structure has returned with renewed urgency. While this prob- 
lem has always been a core sociological concern, what is distinctive about the new 
cultural sociology is its bold attempt to move beyond the well-worn ways in which 
the answer to this issue was formulated in the classical tradition. The new cultural 
sociology rejects the simple privileging of structure over culture and examines the 
ways in which “social being” and “social morphology” are shaped and transformed 
by mobilization of meaningful collective symbols and the deployment of practical 
patterns of appreciation and evaluation in interaction (Alexander and Smith 2003; 
Spillman 2002; Eliasoph and Lichterman 2003). 

This rejection, however, does not in any way constitute a return to a “Parsonian” 
account in which cultural patterns—embodied in ideas, symbols and values—are 
the primary drivers of social structure. In the new cultural sociology, culture and 
social structure must be kept empirically and analytically distinct (Archer 1996; 
see also Kroeber and Parsons 1958). The issue of the structural determination of 
culture and the cultural formation of structure cannot be solved by appealing to 
a single, general formula (Sewell 2005); they must be reopened and subject to 
empirical specification and theoretical reformulation in concrete social contexts. 

In this article, we aim to contribute to this line of inquiry by investigating 
the relationship between cultural worldviews—here treated as broad orientations 
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toward moral evaluation—and social network composition: We follow network 
theory (Breiger 2004; Wellman 1988) and recent developments in cultural 
sociology in thinking of social structure as “influential and persistent sets of 
interrelationships among actors.”(Spillman 1995:132)' We also move beyond 
the current emphasis on culture as publicly available texts, objects and arti- 
facts (Geertz 1973), and conceive of it as involving broad orientations toward 
“meanings and values” as well (Spillman 1995:13 1). We focus on what Griswold 
(2004) has called “implicit culture” in distinction to the “explicit culture” em- 
bodied in expressive objects. This view of culture is comparable to the broad 
orientations toward the social and physical worlds—usually referred to as “cos- 
mologies” —which have been a core concern in anthropology (Douglas 1978; 
Sahlins 1994; see also Davis and Robinson 2006). 

The case of the relationship between cultural worldviews and social networks 
is an important test case for the analytical approach afforded by the new cultural 
sociology. Network theory, at least in the United States, generally relies on a 
one-sided “morphological determinism” that emphasizes persistent patterns of 
relations as the “efficient” causes of ephemeral cultural contents (Emirbayer and 
Goodwin 1994). Due to this strong presupposition, the effect of culture on net- 
work relations has received surprisingly little empirical consideration. ; 

We advance a model that allows analyzing the interplay of culture and network 
relations. We begin with a critical review of the models of “culture in action” 
found in contemporary network analysis. We then go on to identify plausible 
mechanisms that may link the culture and network domains. As a test of the 
value of our framework, we use panel data from the National Study of Youth 
and Religion to examine changes in ego network composition over a three-year 
period among a sample of over 2000 U.S. adolescents. Specifically, we examine 
predictors of changes in the number of respondents’ strong ties who use controlled 
substances, who get in trouble in school, and who volunteer in the community. 
We focus on these outcomes because much research has focused on the importance 
of peer networks in promoting both deviance (Akers, Krohn, Lanza-Kaduce and 
Radosevich 1979) and volunteering behaviors (McAdam 1986). We find strong 
evidence that moral orientations play a decisive role in shaping future network 
composition, net of previous network composition and other structural controls. 
We conclude by discussing the implications of the model and empirical findings 
for cultural sociology and network theory. 


Culture and Social Structure in Network Theory 


Since its rise to prominence as a full-fledged “paradigm shift” in the late 1970s 
and early 1980s (Wellman and Berkowitz 1988), network analysis has had a 
combative—and in our view problematic—relationship to culture (Gould 2003). 
Wellman (1988:33), for example, in a programmatic statement of the network 
approach, dismisses explanation by way of shared cultural and normative orienta- 
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tions as “psychological and not sociological in character... [S]tructural analysts 
[instead] first seek explanations in the regularities of how people and collectivities 
actually behave rather than in the regularities of their beliefs about how they ought 
to behave... [A]ccounting for individual motives is a job better left to psycholo- 
gists.” Instead of concerning themselves with cultural motives, network analysts 
*..interpret behavior in terms of structural constraints on activity.” Instead of “as- 
suming that inner forces... impel actors in voluntaristic, sometimes teleological, 
behavior toward desired goals... [network analysts] treat norms as effects of struc- 
tural location, not causes.” (italics added) Gould (2003:258) perceptively notes that 
“network analysts have something in common... with materialists who see people 
as servants of historical forces that they did not themselves create.” 

This collapse of culture into social structure is an attempt to resolve the struc- 
ture-culture problem by fiat. By assuming a priori that we can never observe an 
effect of culture on networks, the structuralist approach offers a metatheoretical 
solution to what should be an empirical question. It also leaves an important 
question unanswered: where do social structures come from? If the weakness of 
functionalist sociology was to posit norms and values as the “unmoved mover” 
of social action (Swidler 1986:274), it is fair to say that most network-oriented 
research does the same with relational ties and structural position. It is therefore 
not surprising that we find a rethinking of the relationship between culture and 
social structure in more recent considerations of network theory. 


Cultural Critiques of Network Theory 


Following Archer (1996), Emirbayer and Goodwin (1994) reject the assump- 
tion that networks determine culture and the notion that culture and structure 
are analytically inseparable (i.e., “mutually constituted”). Instead, they suggest 
that researchers should attempt to model the over-time interplay of normative 
commitments and network structures. “Network analysis,” they conclude, “... 
neglects or inadequately conceptualizes the crucial dimension of subjective 
meaning and motivation—including the normative commitments of actors—and 
thereby fails to show exactly how it is that intentional, creative human action 
serves in part to constitute those very social networks that so powerfully constrain 
actors.”(1413, italics added) 

Inspired by S.F. Nadel’s observation that “a satisfactory approach to social 
structure requires simultaneous attention to both cultural and relational aspects 
of role-related behavior,” DiMaggio (1993:119) raises a related objection he calls 

“Nadel’s paradox.” Although he interprets the rejection of cultural meanings as a 
“purely tactical” response to practical problems of measurement and operationaliza- 
tion, DiMaggio also argues that early network analysis (e.g., White, Boorman and 
Breiger 1976) carried with it the more radical assumption that it could provide 
a “self-sufficient means of analyzing social systems without recourse to meaning 


systems and culturally embedded categories.” (DiMaggio 1993:121) 
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This assumption has two implications. First, network theory’s lack of explicit at- 
tention to cultural meanings leaves an action-theoretic vacuum that is usually filled 
by a rational-actor model. Purely structuralist models, DiMaggio (1993: 122) argues, 

“treat network membership and the access of each member to relations as fixed, [and] 
actors as having a keen sense of their utility functions.” Such models generalize a 
single motivation—instrumental gain—across contexts and ignore the potential role 
of culture in shaping the character of social relations (see also Smith 2003).” 

Second, network theory relies on assumptions about the culture-action link that 
are rarely made explicit. DiMaggio argues that this implicit framework “bears an 
affinity to a perspective on action... that is distinct both from rational-choice ap- 
proaches and the Parsonsian [sic] tradition,” regarding culture “as a mystifying system 
of post hoc accounts used by actors to normalize or explain interaction rather than 
to shape it.” Thus the “cultural vocabulary” that corresponds to network analysis is 
primarily one of “typifications, scripts, systems of classifications and accounts.”(121) 

The tendency to define “motives” as “vocabularies” produced in local interac- 
tions continues even in network theory’s “cultural turn,” as exemplified by its 
leading exponent, Harrison White. For White (1997:64), talk of persons, internal 
motivations and value-orientations is not relevant, because “interactions, ties in 
socio-cultural context, are coming to supplant persons as building blocks-and a 
person may come to be seen as a knotted vortex among social networks.” Thus, 
even though White reconsiders the importance of culture in social networks, his 
structuralist sociology continues to conceive of culture as a post hoc commentary 
on ongoing projects of control in concrete social contexts (White 1992; Emirbayer 
and Goodwin 1994). Identities and meaningful narratives are assumed to be the 
product of these enacted control projects rather than the cause. 

Although both DiMaggio’s (1993) and White’s (1992) arguments are con- 
cerned with network structure, the same considerations apply to research on net- 
work composition (McPherson, Smith-Lovin and Cook 2001). On this view, any 
observed association between disposition and behavior is spurious, because both 
are transmitted through localized network connections (Mark 1998; McPherson 
2004). Both Wellman (1988) and Erickson (1982) rely on this “transmission 
plus influence” model to explain cultural homogeneity in informal “cliques.” 
According to this model “people acquire norms, as they do other pieces of infor- 
mation, through network ties.”(Wellman 1988:35) Any homogeneity of attitudes 
and normative orientations among actors is thus understood to be the result of 
interactional influence processes that post date the origins of friendship ties rather 
than the product of cultural similarities that pre-date those ties. Yet this alternative 
hypothesis cannot be dismissed so easily. If cultural orientations bias friendship 
selection in favor of culturally-compatible alters, then we will observe cultural 
homogeneity even in the presence of weak or negligible “transmission” effects. 


Dismissing this possibility a priori precludes any examination of cultural effects 
on the composition of social networks. 
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Given the confidence with which network theorists advance such claims, one 
might expect to find a long series of empirical studies substantiating the power 
of network influence. Though there are many cross-sectional studies that infer 
influence on the basis of positive associations between alter’s and ego’s attitudes 
and orientations (see Haynie 2001 for a recent example), the bulk of them do not 
properly consider that ego and alter may have already displayed substantial cultur- 
al similarity prior to forming a tie (Lazarsfeld and Merton 1954). The few studies 
that do take prior ego-alter cultural similarity into account find that “conformity” 
effects are greatly deflated, sometimes by more than half (Cohen 1978, 1983; 
Kandel 1978). Thus, as Cohen (1978:238) concluded in a much neglected study, 
‘while uniformity pressures clearly do operate, they are by no means the sole source 
of uniformity in cliques... it is rather likely that pair of relations also achieve much 
of their homogeneity through homophilic choice.” Kandel (1978:435) arrives 
empirically at a similar conclusion: “[p]rior homophily on a variety of behaviors 
and attitudes is a determinant of interpersonal attraction... friendships that will 
dissolve are less similar than friendships in the process of formation.” 

These studies are consistent with Schneider's (1987) influential attraction-selec- 
tion-attraction model of organizational behavior, which regards persons’ prior dispo- 
sitions as a key variable in determining their choice of social context. Organizational 
environments are shaped by the prior choices of compatible persons to join the orga- 
nization and by the higher probability that persons with incompatible dispositions 
will select themselves out. More broadly, recent critiques of structuralist network 
approaches to contact selection and relationship formation in the management lit- 
erature show that such ignored individual-level “psychological” factors as personality 
play a key role in explaining systematic differences in network composition, range 
and local structure (Mehra, Kilduff and Brass 2001). 

Given the thin empirical record of the primary mechanism through which 
social network theorists envision “network effects” on culture (but see McFarland 
and Pals 2005), we argue that the question of reciprocal effects between cultural 
orientations and network composition is much more open than most contempo- 


rary network theory leads us to believe. 


« 


Culture, Cognition and the Formation of Social Ties 
A Dual-Process Model of Culture 


Instead of thinking of culture exclusively as disembodied scripts or “vocabularies 
of motive” used to make sense of one’s social position and behavior, we conceive 
of it as operating primarily through embodied and durable schemes of perception, 
appreciation and action. In this model, culture is not primarily linguistic, not 
primarily conscious and not primarily discursive. Instead, it is embodied, tacit, 
largely unconscious and composed of fast and “hot” cognitive-affective complexes 
that play a key role in everyday decisions (Bourdieu 1984; Ignatow 2007; Lizardo 
2004; Vaisey 2008, 2009). 
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Our rejection of the “culture as post-hoc rationalization’*model (DiMaggio 
1993) in favor of this new model is based primarily on recent advances in cognitive 
science. The culture-as-rationalization view is based on implausible assumptions 
about where the cognitive links between culture, motives and conduct should be 
sought. Most scholars have assumed that, to be motivational, culture would have 
to be “stored” consciously as ideologies, propositions, preferences, values or stories 
and have assumed that if cultural contents functioned as network-independent 
motives, we would find consistency between the beliefs people articulate and 
their conduct (see e.g., Collins 1981; Swidler 1986). In the absence of consistent 
evidence for either articulacy or consistency, most network theorists conclude 
that cultural systems of meaning and value cannot motivate action. Instead, as we 
demonstrated above, they regard both action and culture as the spurious product 
of prior network structures. 

There are, however, important reasons to rethink the assumption that cultural 
motives must be conscious (Vaisey 2009). The cognitive and behavioral sciences 
recognize two analytically distinct levels of mental functioning—the conscious and 
the automatic (Nisbett and Wilson 1977; Shiffrin and Schneider 1977; Sloman 
1996; Chaiken and Trope 1999; Smith and DeCoster 2000; Evans 2008). We 
now know that most judgments — including social ones (Chaiken and Trope 1999; 
Smith and DeCoster 2000) —occur below the level of conscious awareness. In the 
social sciences, this insight is central to most versions of practice theory (Bourdieu 
1990; Schatzki, Knorr-Cetina and Savigny 2001). 

Most salient to our argument, there is now strong evidence that moral intu- 
itions vary systematically between nations (see Haidt and Joseph 2004) and be- 
tween subcultures in a given country (Haidt 2007; Haidt and Graham 2007). To 
analyze such differences, Richard Shweder and his colleagues outlined a typol- 
ogy of three analytically distinct areas of moral concern: the ethic of autonomy, 
concerned with harm, rights and fairness; the ethic of community, concerned pri- 
marily with role obligations; and the ethic of divinity, concerned with maintain- 
ing spiritual purity and upholding the “natural” or divine order (Shweder 2003). 
This typology is quite similar to the individualist, community-centered, and 
religious typology developed by Bellah and colleagues (1985). Though Shweder 
(like Bellah) talks about these ethics mainly in terms of “discourse,” psychologi- 
cal research has associated Shweder’s “big three” with different intuitions about 
right and wrong in experimental and observational research (Haidt 2001). These 
studies have demonstrated important differences in moral intuitions between 
(for example) India, Brazil, and the United States, as well as between political 
liberals and conservatives in the United States (Haidt and Graham 2007). This 
research shows that culture can shape intuitions and emotions as well as provid- 
ing useful cultural repertoires or “vocabularies of motive.” 

Taken together, these findings point to a dual-process model of culture that 
regards actors as influenced by deeply internalized cultural schemas and as capable 
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of deliberating and rationalizing when required by the demands of social interac- 
tion (see Vaisey 2008, 2009). Though this model is simple, it differs significantly 
from conventional models that define culture as epiphenomenal a priori or that 
set a cognitively misguided standard for detecting cultural influence. 

The dual-process model combines Bourdieu’s definition of “culture as embod- 
ied schemes,” which has become central to cognitive anthropology (e.g., Strauss 
and Quinn 1997; Bloch 1991), with the “analytical dualism” of recent cultural 
theory, which regards culture and social structure as analytically distinct. Some 
proponents of this “anti-conflationist” approach regard culture as mainly “objecti- 
fied” and transindividual (e.g., Archer 1996), but this is not a necessary postulate 
of analytical dualism. Our version of the dualist approach distinguishes “subjective 
structures” that have been internalized from the “objective structures” in which 
“encultured” individuals come to be embedded, including objectifiable patterns of 
social relationships (see Bourdieu 1996). Our model thus enables an investigation 
of how individuals come to acquire durable dispositions, and how those disposi- 
tions go on to shape their objective structural environments. 


Cultural Worldviews and the Formation of Social Networks 


Most longitudinal studies reveal surprisingly high levels of instability in personal 
networks, with approximately 30 to 60 percent of the personal network experiencing 
turnover in a year’s time (see Bidart and Degenne 2005; Suitor and Keeton 1997; see 
also Wellman et al. 1997 for a synoptic review of recent longitudinal network stud- 
ies). Suitor and Keeton (1997), using a sample of 56 married women, report that 52 
percent of the alters named by respondents in the original interview were not named 
again at either the 1-year or 10-year follow up. Morgan et al (1997:14) conclude that 
there is “a considerable degree of instability in who is present in the network over 
time. In particular, the average overlap of 55% means that just over half of those 
who were named in one interview or the other appeared in both.” These studies 
show that old ties are constantly being deleted and new ones formed throughout the 
life course (Suitor, Wellman and Morgan 1997) and that friendship selection and 
network “fine tuning” represent important—if ill-understood—processes through 
which people shape their relational micro-environment. 

We suggest that cultural worldviews —defined as implicit schemes of percep- 
tion—serve to process information regarding a potential alter’s compatibility 
with ego. In Mary Douglas’ (1978) terms, we regard the friendship choice 
process as inherently “culturally biased.” Egos with different worldviews likely 
have different cognitive and affective reactions to a given alter’s self-presentation 
and interactional style. These reactions inform ego’s cognitive-emotive “choice” 
to pursue (or accept) a relational tie to this alter. This “choice” is nothing like 
the rational and deliberative process outlined in some theories of network for- 
mation (e.g., Bunt, Marijtje, Duijn and Snijders 1999). Nor is it a post hoc 
rationalization of a bond resulting from two people being “thrown together” 
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structurally. Instead, we see this as a “fast” and “hot” cognitive-affective process 
that determines whether two persons “click” as friends. 

DiMaggio (1993:125-27; emphases added) recognizes the importance of this 
process during the early stages of network formation, suggesting that alters’ “at- 
tributes (including biographical information)... enable... [ego] to make snap judg- 
ments about their suitability.” We suggest that these “snap judgments” reflect, inter 
alia, moral intuitions. DiMaggio makes clear the moral dimension of friendship 
choice when he argues that ego’s judgments are the product of “the ways in which 
[alters] express themselves—the values, norms, or attitudes that can be read into 
their utterances and more elusive aspects of style.” DiMaggio concludes that it is 
possible to “conceive of the assessment of sympathy as a cultural matching process, 
in which actors rely subliminally on verbal and nonverbal cues to estimate cultural 
overlap, experienced as comfort/discomfort and confidence/unease.” 

Though DiMaggio highlights the largely instrumental pursuit of ties in his 
own illustration, there is no reason that this framework cannot be extended to 
relational ties in general. As Bourdieu (1984:241, italics added) remarks, “The 
social sense is guided by the system of mutually reinforcing and infinitely redun- 
dant signs of which each body is the bearer—clothing, pronunciation, bearing, 
posture, manners—and which, unconsciously registered, are the basis of ‘antipathies 
or sympathies; the seemingly most immediate ‘elective affinities’ are always partly 
based on the unconscious deciphering of expressive features.” 


Empirical Implications 


Following the dual-process model of culture, we view the observable cues con- 
nected to the tastes and expressive styles of others as inputs to culturally-biased 
cognitive structures. The degree of fit between social input and cultural bias 
produces emotion-laden judgments of “liking” or “not-liking,” thus serving as 
a crucial determinant of tie “decay” (Burt 2000) or persistence. We hypothesize 
that cultural worldviews will have considerable effects on the over-time composition 
of the focal actor's local relational environment and that these effects will not prove 
to be the spurious byproduct of the personal network's pre-existing characteris- 
tics. Specifically, we expect that ties to alters whose behaviors, tastes or expressive 
styles are incompatible with the focal actor’s moral-cultural worldview will tend 
to decay more quickly than ties with others who exhibit compatible cues, even 
after accounting for prior network composition and other attributes indicative 
of social position. We argue that tacit routines of contact-selection enable actors 
to “seek out social relationships that are compatible with their preferred [cul- 
tural] bias and shun those relations in which they feel less at home.” (Thompson, 
Ellis and Wildavsky 1990:266) This implies that moral-cultural worldviews will 
have an effect on the overall composition of the “core” personal network, increasing 


the prevalence of compatible contacts and decreasing the relative frequency of cultur- 
ally incompatible alters over time. 
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Data 


‘The data for the empirical portion of this research come from the 2002 and 2005 
waves of the National Study of Youth and Religion. The first wave of the NSYR is 
a random sample of 3,290 English- and Spanish-speaking teenagers (ages 13-17) 
in the United States. The Wave 2 survey was an attempt to contact all of the teen 
respondents from Wave 1, who were then 15 to 20 years of age. The retention rate 
between waves was about 78 percent. We employ an appropriate weight for all analy- 
ses. Because of missing data, the N for each analysis varies between 2,100 and 2,140. 

These surveys are particularly well-suited to the question of cultural influences 
on network composition for three reasons. First, they contain information on 
several measures of network composition at two time points. Second, they provide 
two measures of moral-cultural worldview that have been successfully employed 
in previous research (Baker 2005; Hunter 2000). Third, they contain a wide 
variety of socioeconomic, demographic and behavioral data that might also be 
predictive of changes in network composition, thus allowing adequate controls 
for confounding factors. 


Measures and Models 
Dependent Variables 


Network Composition: The outcomes of interest here are the number of each re- 
spondent’s friends who (1. “do drugs or drink a lot of alcohol,” (2. “have been in 
trouble in school for fighting, cheating, or skipping classes,” and (3. “regularly do 
volunteer work or community service.” This value was generated by asking the 
respondent to name up to five “closest friends,” and asking which of those friends 
engaged in the activities in question. This data was collected in 2005, during Wave 2. 


Independent Variables 


Lagged Dependent Variables 
These measures reflect the composition of the respondent's social network at Wave 1 
of the survey (2002). They were constructed identically to the measures at Wave 2. 


Moral-Cultural Worldview 

The NSYR provides two ways of measuring moral-cultural worldview. The first was 
derived from Hunter’s (2000) operationalization of the expressivist-utilitarian-civic- 
biblical typology offered in Habits of the Heart (1985).> This question asks, “If you 
were unsure of what was right or wrong in a particular situation, how would you 
decide what to do? Would you... (1. Do what would make you feel happy (expres- 
sive individualist [chosen by 26 percent]),(2. Do what would help you to get ahead 
(utilitarian individualist [11 percent]), (3. Follow the advice of a teacher, parent, or 
other adult you respect (community-centered [42 percent]), or (4. Do what you 
think God or scripture tells you is right (theistic [21 percent])?” A single item is not 
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ideal, of course, and the available responses do not cover all conceivable possibilities. 
It is nonetheless well-matched to the Bellah typology and was explicitly designed to 
measure this typology in youth samples. Hunter (2000) found that this question 
predicted other survey responses in a large number of domains with a high degree 
of discrimination. We therefore hypothesize that these differences will also lead 
to differences in social networks. Because drug use and “getting into trouble” are 
deviant acts at this age, we expect that the more traditional community and theistic 
worldviews (compared to the individualist ones) will lead to networks with fewer 
alters engaged in such behaviors. On the other hand, because both community and 
theistic moralities are more “collectivist” (Hitlin 2003; Oishi, Schimmack, Diener 
and Suh 1998) we should expect that they will be positively associated with acquir- 
ing or maintaining strong ties to regular community volunteers. 

The second measure of moral worldview is an indicator of moral absolutism 
and relativism that closely resembles that used on the World Values Survey (Baker 
2005). This question asks, “Some people say that morals are relative, that there 
are no definite rights and wrongs for everybody. Do you agree or disagree?” The 
respondent could agree (1) or disagree (0). Because research in this area shows that 
relativists are more likely to accept practices that have been traditionally frowned 
upon (such as abortion or premarital sex; see Baker 2005) and less likely to derive 
a sense of community from others (Ryle and Robinson 2006), we hypothesize 
that relativists will develop or maintain more network ties to controlled substance 
users and maintain fewer network ties to regular volunteers than will absolutists.* 


Other Controls 

In addition to the usual demographic controls, we include several other factors to 

attempt to rule out potential spurious associations. To exclude the possibility that 

adult network connections are driving the community worldview response, we 

control for closeness to parents and adult network closure around the respondent.’ 

To rule out the possibility that religious networks or institutions are driving the 

theistic response, we control for religious service attendance, religious tradition 

and the number of close friends who share the respondent's religious beliefs. These 

socio-demographic and network controls should account for the meso- and micro- 
level context that is usually held to shape beliefs, networks and action in the sociol- 
ogy of culture (DiMaggio 1997; Lichterman 1996; Swidler 2001). We also control 

for relevant behavior at Wave 1 (frequency of using marijuana and getting drunk 

for drug using networks; cheating, cutting class or getting suspended for “trouble” 
networks; and volunteering for volunteer networks) to account for behavioral 

homophily and to isolate the effects of the moral worldviews as much as possible. 


Models 


Because the outcomes here are counts, we use Poisson regression to estimate 
changes in network composition. Because we are interested in determining the 
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effect of a durable state (moral worldview) on change in network composition 
between survey waves, we use a lagged dependent variable model rather than a 
fixed-effects specification (Halaby 2004). Though the longitudinal design ensures 
causal order, the models estimated here assume that the specified lag (two-and- 
a-half to three years) is an appropriate one for detecting the hypothesized causal 
relationships. Although this assumption may not be tenable in all cases, it seems 
plausible for investigating cultural effects on network change in this population. 

In addition to estimating coefficients and test statistics, we compare effect sizes 
using percent changes in the estimated count. For dichotomous variables, we 
define effect size as the associated percentage change in estimated count. For the 
other variables, we use the percentage change associated with a one-SD change 
in the predictor (Long 1997). These values will allow comparing the relative net 
strength of each predictor. 


Results 
Do Cultural Worldviews Affect Social Network Composition? 


Table 1 shows the results of regression models predicting Wave 2 network com- 
position. The table is largely self-explanatory, but there are several results worth 
highlighting. Unsurprisingly, network composition in 2002 is a good predictor 
of network composition in 2005. We cannot know whether or not the same indi- 
viduals are in the network, but there is a tendency for network characteristics to re- 
main the same. We should not overstate the durability of such networks, however; 
the polychoric correlations (not shown in the table) between Wave 1 and Wave 2 
network composition are .47 for controlled substance use, .29 for trouble and .33 
for volunteering. Although there are clear continuities in network characteristics, 
there is plenty of variation between survey waves. The multivariate models also 
show that prior behavior (e.g., getting drunk, cheating, volunteering) is a good 
predictor of future network composition, indicating an expected tendency toward 
behavioral homophily (consistent with Pearson et al.’s [2006] results). People tend 
to develop ties with individuals who are similar to themselves in terms of habits 
and lifestyle (McPherson, Smith-Lovin and Cook 2001). 

In addition to continuity and homophily, only one other factor was significant 
in all three models: moral worldview. Compared to the community-oriented 
reference category (“follow the advice of an adult”), young people choosing the 
expressivist response (“makes me happy’) are more likely to develop or maintain 
strong ties with heavy substance users and “troublemakers” and—along with utili- 
tarian (“get ahead”) respondents—less likely to develop or maintain strong ties 
with regular volunteers. Again relative to the community-centered respondents, 
youth invoking a theistic moral worldview are less likely to develop or maintain 
friendships with controlled-substance users and “troublemakers.” It should be 
noted that these are not simply “religious” youth in the usual sense; religious at- 
tendance and conservative religious tradition play no independent role here. The 
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21 percent of teenagers who identify with the theistic worldview are not identical 
to the teens one might otherwise call “religious” based on behavioral measures. If 
some form of “social control” is at work here, it is a social control over and above 
adult connections, parent monitoring, religious tradition, networks of religious 
friends and church attendance. These youth seem to be those that have inter- 
nalized schemes of perception and action consistent with theistic culture in the 
United States (Hunter 2000). 

The “effect size” column for each model allows comparing the relative strength 
of the significant predictors. These comparisons seem surprising from a stan- 
dard network approach, but are consistent with the arguments of Emirbayer and 
Goodwin (1994) and the findings of Lizardo (2006). Moral worldview has a 
larger (in some cases much larger) net effect on Wave 2 network composition 
than a change of one (or several) standard deviations in either Wave 1 network 
composition or previous behavior. In this population at least, worldview predicts 
changes in the content of social networks much better than race, sex, household 
income, parents’ education and a host of other factors. To better illustrate these 
results, Figure 1 shows predicted counts by worldview for Wave 2 network char- 
acteristics holding all other covariates at their means. Taken together, these results 
are certainly noteworthy—a single, relatively abstract moral question, asked nearly 
three years earlier, is more predictive of future friendship networks than either 
prior networks, behavioral homophily or demographic characteristics. Such a pos- 
sibility is seldom, if ever, acknowledged in the literature on networks and culture 


(Erickson 1988; Wellman 1988). 


Additional Analysis: Do Social Networks Affect Cultural Worldviews? 


We have argued that cultural sociology and network theory need to move beyond 
deterministic and “conflationist” views of the culture-structure relationship to 
investigate empirically their dynamic relationship. Though the emphasis here 
is exploring the role that moral-cultural worldviews play in shaping network 
change, we also briefly consider the other side of the process: how networks influ- 
ence changes in worldview. From Wave 1 to Wave 2, 54 percent of respondents 
changed their response to the moral worldview question, though only 33 percent 
made the substantively larger change (in either direction) between an individualist 
response and either of the non-individualist responses. With only one item, these 
figures undoubtedly reflect substantial measurement error, but we should be able 
to detect significant patterns nonetheless. 

Table 2 shows the results of a multinomial logistic regression model predicting 
moral worldview at Wave 2 using all of the Wave 1 predictors used in Table 1. 
‘This model treats the community-centered option as the reference category at both 
waves. Because the statistical significance of a single coefficient is dependent on its 
difference from the reference category only, we present Wald X” tests for the joint 
significance of each predictor across all three equations. We do not take the time 
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Figure 1. Predicted Number of Strong Ties at Wave 2 by Worldview 


m Expressivist 
© Utilitarian 


Community 
@ Theistic 





Drugs Trouble Volunteer 


Type of Strong Tie 


here to interpret each coefficient, but note only those findings that are directly 
relevant to our theoretical concerns. 

The best predictor of worldview at Wave 2 is worldview at Wave 1. Given our 
claim about the durability of cultural worldviews, this is perhaps not surprising.° The 
strong tie variables are not significant predictors of these changes, again suggesting 
the relative robustness of cognitive structures to proximate peer influence. There is, 
however, evidence of some structural effects on worldview change. Adult network 
closure around the respondent is associated with a lower probability of either indi- 
vidualist response. A dense network of adult support seems to dispose teenagers to 
adopting or maintaining a less individualistic worldview. A similar pattern emerges 
for parental monitoring, with greater supervision negatively associated with choos- 
ing “what makes me happy.” Church attendance is also a good predictor of the the- 
istic response. Overall we see a fair degree of stability in cultural worldviews with a 
limited, though readily interpretable, amount of structural influence over three years. 


Discussion and Conclusion 


Our main objective has been to go beyond cultural critiques of network theory 
and offer a simple dynamic model of cultural worldviews and network formation 
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Table 2: Wave 1 Predictors of Wave 2 Worldview 


Table 2: Wave | Freqgictors Ol Walls eee eee 


Wald 
Expressive _Utilitarian Theistic Test 
b b b 





Prior Worldview ae 
Expressivist 1A06e .876"** 025 fi 
Utilitarian a4. .848*** .205 

Theistic oo Were 1.242" a 
Community (reference) 

Moral relativist 1229 -.120 -.391** i 
Prior Network Composition 

Using strong ties -.009 -.081 015 

In trouble strong ties -.006 -.172 -.067 
Volunteering strong ties .049 -.020 .082 

Prior Behavior 

Frequency of drunkenness 015 oo .072 

Frequency of smoking pot 161 bolas 031 . 
Frequency of cheating Pilg 015 ake - 
Frequency of cutting class .057 .160 141 

Ever suspended -.203 204 -.243 

Frequency of volunteering 045 -.025 -.121 

Other Network Characteristics 

Same religion strong ties 018 -.040 .058 

Adult network closure -.162** -.168* .061 Po 
Dating 035 .034 -.150 

Number of strong ties hne .270* -.028 

Family Characteristics 

Parent monitoring =103% -.117 ee at 
Two-parent biological family -.165 029 -.047 

Closeness to parents -.009 -.034 024 

Religious Participation 

Church attendance -.054 -.028 ee ey 
Conservative Protestant -.307 .182 536 

Black Protestant -.255 -.356 -.239 

Mainline Protestant .350 411 ors 

Catholic -.120 aeZ -.440 

Jewish -.177 -1.478 671 

Mormon (LDS) -.448 -.025 .682 

Other religion 466 654 1.065* 
Indeterminate religion 488 1.164* .632 


No Religion (reference) 
Demographic and Other Characteristics 





Gender (female = 1) -.049 -.286 -.444** 7 
Age (W1) -.065 -.161* -.001 

Black -411 499 247 

Other race -.287 395 113 = 
Southern residence -.100 -.173 -.015 

GPA -.028 .064 197 
Household income .000 -.027 -.016 

Parent education -.019 .066 -.007 

Constant -.260 -1.343 -2.442* 

N 2,098 : 

X 650.31 


Note: "Community" response is W2 reference category. *p<.05 “p< .01 ***p < 001 (two-tailed). 
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and change. To illustrate the value of this model, we investigated some of its key 
implications empirically. Consistent with a “dual process” model, our results 
strongly suggest that cultural beliefs play an independent causal role in the trans- 
formation of the local interactional environment. 

These results have important implications for conceptualizing the dynamic 
interplay of culture and social structure. With few exceptions, contemporary 
network theory relies on the metaphor of networks as durable “conduits” or “pipes” 
through which ephemeral cultural contents are transmitted (Borgatti 2005). We 
have argued that this conduit model, while applicable in some circumstances, 
cannot be the basis of a general theory of networks and culture for two reasons: 
first, personal networks are hardly sturdy “structures” but are in a constant state 
of flux. Moreover, tastes, dispositions, and worldviews may be more durable and 
stable than network ties (Lizardo 2006). Second, because individuals are con- 
stantly dropping old ties and adding new ones, processes of contact selection and 
relationship formation are continually in progress. Current network theory does 
not have—and given its metatheoretical presuppositions, cannot provide— a coher- 
ent account of self-selection into relational environments (Cohen 1983; Schneider 
1987). Nor can it account for how actors select among possible candidates for 
entry into their intimate circle of relationships. By contrast, we have shown that 
implicit cultural worldviews seem to play a crucial role in this contact selection 
process,’ and we have argued that embodied culture serves as a kind of relational 
filter that shapes the evolution of personal networks.° 

It is important to be clear about our definition of culture. While we agree—fol- 
lowing Geertz (1973) —that a lot of culture is public and symbolic, we do not view 
culture solely through the “text” metaphor. We see such an approach as abdicat- 
ing the primary responsibility—and promise—of a cultural sociology, which is to 
understand how culture shapes action. Though our approach is consistent with 
assertions about the “autonomy” of culture, we do not understand this autonomy 
as “autonomy from” some other social domain (¢.g., the economy, the polity) but 
rather as a species of causal autonomy. That is, we see culture as autonomous 
insofar as it figures as a causally relevant, “locally independent” factor that shapes 
patterns of conduct and affiliation. 

We in no way deny the causal autonomy of the objective relational environ- 
ment. In fact, we see the causal autonomy of objective structures as implied by 
the causal autonomy of culture. By structuring one’s exposure to systematic pat- 
terns of experience, objective environments have their own effects on dispositions, 
sometimes serving to cement them, but at other times—as when environments 
are suddenly transformed —leading to their transformation. It is certainly not our 
intention to endorse a new “culturalism” in which cultural patterns are the only 
efficacious element and social structure is an epiphenomenon. Instead, the main 
analytical payoff of distinguishing culture from social structure and taking each 
seriously is to enable investigations of the causal interplay of each domain over 
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time. By investigating the conditions in which culture is decisive for shaping social 
structure, our approach will necessarily also lead to a better understanding of those 
contexts within which “network effects” on culture are most likely to be observed. 

The network paradigm has an “elective affinity” with a problematic definition 
of culture as primarily linguistic and discursive, composed of ad hoc rationaliza- 
tions of the effects of social structure. We agree that post hoc rationalizations for 
action are ubiquitous, but we do not agree that culture is reducible to justifica- 
tions (Boltanski and Thévenot 2006), “accounts” (Scott and Lyman 1968) or 

“vocabularies of motive.” (Mills 1940) “Sense-making” discourse is the tip of the 
cultural iceberg; submerged in less-accessible regions is a realm of implicit culture 
encoded in non-linguistic form, stored in procedural and not declarative memory, 
with the potential to shape our perceptions and reactions to events, including a 
fairly mundane type of event—friendship selection. 

More generally, empirical research in a wide variety of domains is needed to es- 
tablish the conditions under which cultural matching processes tend to determine 
the fate of newly-formed relational connections and under which conditions this 
process is dampened and overridden by “structural” factors. This is entirely within 
the spirit of recent calls to go beyond the view of social networks as “conduits” and 
to focus on “mechanisms of relation formation in conversational settings. ” (Mische 
2003:259) Although most sociologists now agree both that “networks matter” and 
that “culture matters,” we think that it is time to analyze the cultural and cognitive 
processes related to the formation of social networks and vice versa. The ultimate 
goal of research in this area will be to develop powerful models of the dynamic 
interplay between culture, cognition and social structure. We hope this study has 
taken a useful step in that direction. 


Notes 


1. There are other contemporary theoretical traditions of “network theory.” Most 
influential have been recent attempts by Castells (1997) and Urry (2002) to cast 
the network metaphor as a broad way to characterize recent cultural and structural 
transformations in late-modern societies. While we recognize this tradition of 
theorizing as important, our main concern in this article is with the more empirically 
oriented line of research associated with “American” network analysis (see Freeman 
2004 for an enlightening history). 


2. White (1997:59-60) recognizes and welcomes this development, noting that “[T]he 
recent resurgence of ‘rational actor’ models is not inconsistent with my view since 
there is little that is specifically human about rational actors. Without persons being 


presupposed as actors, attention necessarily shifts to confluences of observable processes-in- 
relations. Out of these emerge actors and locations of social action.” 


3. ‘The wording of this question was slightly modified from Hunter’s original by 
Christian Smith, the principal investigator of the NSYR. 


4. Despite Baker's (2005) assertion that the Hunter question and the relativism question are 
interchangeable measures of absolutism, the two are only moderately associated (y = .172). 
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5. See the Appendix for descriptive statistics and definitions for all variables. 


6. One reviewer pointed out that the between-wave change in moral worldview was 
“closer to completely random than completely stable.” This is true. If the wave 1 and 
wave 2 responses were completely random, we would expect 29.7% to give the same 
specific answer and 52.6% to give the same category of answer (individualist or non- 
individualist) at both waves. The actual consistency—46% and 66% respectively —is 
better than this, but only covers 24% and 32% of the distance between total randomness 
and total stability. This does not undermine our argument, however, for two reasons: first, 
there is an unknown quantity of real change in the data, which is to be expected with 
respondents of this age; second, measurement error—especially with a single subjective 
item—makes approaching perfect consistency unrealistic under any conditions. 


7. This is course not to deny that individuals sometime consciously seek out new friends 
and sometimes even consciously consider the pros and cons of certain relational 
partners. However, our main argument is that this type of friendship selection process 
is the exception and not the rule (DiMaggio 1993). Furthermore, the dual-process 
model predicts that even when individuals are processing information in the more 
traditional “rule-based” form (Sloman 1996), the biases induced by incorporated 
moral and cognitive dispositions will still lead to biases in decision-making (e.g., 
partial consideration of the evidence, differential weighting of attributes, etc.). 


8. In this respect, “homophily” may in fact be the end result of this process of culturally 
mediated contact selection rather than being its cause (Lewis et al 2008). 
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Appendix 


Most of the control variables used in the analyses (see Table A1) are either self- 
explanatory or can be found in the documentation at youthandreligion.org. We 
note variables constructed especially for these analyses: 

Parent closeness is the maximum value of the closeness variable reported by the 
respondent for either parent. The resulting value was standardized (in the full 
sample) to have mean = 0 and SD = 1. 

Network closure was constructed from three variables that were asked of each 
respondent's social network. For each reported friend, the respondent was asked, 
which of these friends, (1. “(do/does) your [PARENT TYPE] not really know that 
well;” (2. “have parents who know YOU by name;” (3. “have parents who know 
your [PARENT TYPE] well enough to call (him/her/them) on the phone.” These 
responses were combined (the first was reverse coded) to give a sense of how much 
adult networks were closed around the respondent. The resulting sum (0-15) was 
divided by 3 to make it comparable to the other network measures. 

GPA was constructed from the variable “grades” (y91), which asked, “What 
kind of grades do you usually get in school?” The original responses were 10 
ordinal categories ranging from “all As” to “mostly Fs” with an additional category 
for “mixed” (n = 159). The GPA scale used in these analyses was made into a scale 
with range 0 to 4 by rescaling the 10 point ordinal scale and setting the “mixed” 
responses to the sample mean. 

Parent education is the highest level of education for either parent, measured 
on a 12-point ordinal scale. 


Multidimensionality and Gravity in Global Trade, 1950-2000 


Min Zhou, Harvard University 


The expansion of global trade in the post-war period is subject to various interpre- 
tations. Some stress the trade-promoting role of the novel features in the world 
economy; some insist on the role of traditional factors, such as geographic distance, 
political difference and cultural dissimilarity, in continuously depressing trade flows; 
others even argue that the importance of these traditional factors has been on the rise. 
To adjudicate the divergence, this article applies the gravity model to a large data set 
on global bilateral trade from 1950 through 2000. Although the global institutional 
factor does promote bilateral trade and global economic activity indeed becomes 
more trade-generating, the trade-depressing effects of geographic distance, political 
difference and cultural dissimilarity remain strong. Moreover, geographic and cultural 
proximity actually generates greater gravity over time that draws countries together, 
which may trigger fragmentation in global trade along geo-cultural lines. 





Introduction 


A central characteristic of the period after World War II has been the dramatic expan- 
sion of global trade. In terms of trade the world is indeed becoming more tightly 
connected —countries have increasingly larger numbers of trading partners, and the 
world trade network has become denser (Kim and Shin 2002). Trade is a type of 
multidimensional interaction, such that economic, geographic, cultural, political 
and global institutional factors are all potentially associated with its strength (Irwin 
and Kasarda 1994; Frankel 1997, 1998). Although trade does not directly result 
from these influences, scholars have long recognized that distance or difference in 
geographic, political and cultural dimensions depresses trade between countries, 
while countries’ overall economic activity and the institutionalization of global gov- 
ernance are positively associated with bilateral trade flows. The explosion of global 
trade has triggered some critical theoretical questions. For example, have economic 
forces and institutionalized global governance increasingly promoted trade flows? 
Has global trade been freed of the geographic distance, political difference and 
cultural dissimilarity that have traditionally fragmented global trade? 

The core question is how the effects of the multidimensional factors on global 
trade have changed over time. Centered on this core question, three competing 
perspectives can be identified within the globalization scholarship. I call them 
the integration, continuity and fragmentation perspectives, respectively. Their 
fundamental distinction is their different predictions about the direction of 
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change over time in the effects of various factors on bilateral trade flows. First, 
the integration perspective stresses that new transportation and communications 
technologies (Castells 2000; Dicken 2003) and increasing institutionalization of 
global economic governance (Murphy 1994; Fligstein 2005) have energized the 
explosive growth in global trade in the postwar period. According to this perspec- 
tive, global economic activity has become increasingly trade-oriented (Gerefh 
and Korzeniewicz 1994; Feenstra 1998), and the institutionalized global gover- 
nance has become a driving force for bilateral trade (Ingram, Robinson and Busch 
2005). In the meanwhile, traditional restrictions imposed by geographic distance, 
cultural dissimilarity and political difference have decreased drastically, leading 
to the “death of distance” (Cairncross 1997), the prevalence of the neoliberal 
ideology, and the emergence of a global culture of consumerism (Omae 1990; 
Sklair 1995; Cairncross 1997; Friedman 2005). Second, despite acknowledging 
the importance of technological and institutional innovations, the continuity per- 
spective envisions unchanged multidimensional complexities underlying global 
trade. It contends that the persistent trade-depressing impact of traditional factors 
such as geographical distance (Krugman 1991a; Frankel 1997; Hummels 2007), 
differences in political regimes (Bliss and Russett 1998; Mansfield, Milner and 
Rosendorff 2000; Milner and Kubota 2005), and cultural dissimilarity in lan- 
guages, colonial histories and civilizational traditions (Huntington 1996; Rauch 
1999; Guiso, Sapienza and Zingales 2004), continues to matter for global trade. 
Last but not least, such theories as “new economic geography” and “clash of civi- 
lizations” go even further by claiming that the geographic, political and cultural 
factors have become increasingly important for global trade (Krugman 199 1a; 
Fujita, Krugman and Venables 1999; Russett and Oneal 2001; Huntington 1993, 
1996). According to this fragmentation perspective, global trade is actually be- 
coming more and more fragmented along geographic, political and cultural lines. 

To adjudicate the divergence among the three perspectives, this study analyzes 
a large data set about trade flows between pairs of countries (or dyads), which cov- 
ers 174 countries from 1950 through 2000. Using the gravity model, it estimates 
how the five sets of factors (economic, geographic, cultural, political and global 
institutional) influence global bilateral trade and, more importantly, how their 
respective influence changes over time. 


Change Over Time in Multidimensional Influences on Trade 


‘Theories on globalization generally agree that two fundamental changes have 
occurred to the infrastructure of the world economy. On the one hand, the great 
leap in transportation and communications technology has profoundly altered the 
terrain of global trade, both in the sense of geographical reach and organization of 
economic activities (Parraton et al. 1997; Castells 2000). Improvements in trans- 
portation and communications systems enable the disagglomeration of economic 
activities and foster trade over wider distances, thereby rendering global economic 
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activities more trade-oriented. On the other hand, the institutionalization of 
global governance opens up new markets and provides a set of general norms and 
rules that facilitate global transactions (Fligstein 2005). It also connects countries 
more closely through institutional ties, which are easily transformed into trade 
relations (Ingram, Robinson and Busch 2005). Consequently, the integration per- 
spective argues that the changing nature of the world economy and the enhanced 
institutionalization of global governance boost global trade. There are no apparent 
disputes among globalization scholars over the positive effects of the economic 
and global governance factors; however, empirically it is unclear how their effects 
change over time and whether they are growing in magnitude as the integration 
perspective predicts. The more prominent contention concerns the change over 
time in the effects of geographic, political and cultural factors. Competing predic- 
tions can be developed based on the integration, continuity and fragmentation 
perspectives on globalization. 


Changing Nature of the World Economy 


The change in the technological and institutional infrastructure of the world econ- 
omy in the postwar period alters the way in which global economic activities are 
conducted, which in turn has great implications for global trade. According to many 
globalist theories, economic activities are becoming increasingly trade-oriented, and 
this enhanced trade orientation is spreading to almost all industries and sectors. 

First, economic activities have become increasingly trade-generating in the 
postwar period. Firms are increasingly able to divide the production process 
into different stages and locate them over the world according to comparative 
advantage, or so-called “slicing up the value chain” (Krugman 1995), thereby 
increasing trade in inputs, semi-finished manufactures and final products. In 
this “disintegration of the production process” (Feenstra 1998), manufacturing 
or services activities done abroad are combined with those performed at home. 
The delocalization of production leads to more world trade as intermediate inputs 
cross borders several times during the production process, which is termed “intra- 
mediate trade.” (Feenstra 1998) A number of studies have called attention to the 
prevalence of the internationalized production in the world economy (Gereffi and 
Korzeniewicz 1994; Friedman 2005). 

Second, the enhanced trade orientation is not limited to a handful of industries. 
If we see the whole range of economic activities involved in the design, production 
and marketing as “commodity chains,” the formation of denser producer-driven 
global commodity chains in capital- and technology-intensive industries and 
buyer-driven global commodity chains in labor-intensive industries extends the 
intensified trade-orientation to both traditional and modern industries (Gerefh 
and Korzeniewicz 1994; Gerefh 1999). 

Third, the influence of internationalized economic activities is not confined to 
tradable sectors, but extends to the whole economy. The “interlocking relation- 
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ship” between tradable and non-tradable production is becoming deeper and 
denser —non-tradable production often requires tradable inputs and non-tradable 
outputs are often sold to the tradable sector (Held et al. 1999). 

Based on these observations, the overall economic activity, for which GDP is 
the conventional proxy, is argued to be increasingly trade-generating. Although it 
is not surprising that GDP is positively related to trade, this relationship should 
be enhanced by the intensified trade-orientation within the world economy. The 
first hypothesis is to empirically test this enhanced trade-generating capability 
of global economic activity over time. In the gravity model the product of two 
countries’ GDP is commonly used to assess the effect of two countries’ overall 
economic activity on their bilateral trade, so Hypothesis 1 is proposed as follows: 

Hypothesis 1 (integration argument): The product of two countries GDP has 
a positive effect on bilateral trade volume, and this positive effect increases in 
magnitude in the postwar period. 


Institutionalization of Global Governance 


Besides the changing nature of the world economy, the institutionalization of 
global governance is the other prominent change that boosts global trade. The 
institutionalization of global governance in the postwar period has been intensi- 
fied by the growing network of international governmental organizations, which 
are considered as agents for this widespread institutionalization. Numerous IGOs 
were established in the postwar period (Jacobson, Reisinger and Mathers 1986). 
Even after adjusting for the new sovereign countries (the number of IGOs could 
be inflated by the increase of countries), the density of IGO networks (the ratio 
of the number of IGOs to that of countries) still has increased dramatically in the 
postwar period (Wallace and Singer 1970; Pevehouse et al. 2003). 

As these IGO networks become both more extensive and more intensive, they 
accelerate the growth of global trade, according to the integration perspective. 
‘Two lines of literature explain how IGOs promote trade. The first literature places 
stress on economic IGOs, which contribute to global trade through negative 
integration and positive integration (Scharpf 1995; Fligstein 2001, 2005). IGOs 
aimed at negative integration promote the dismantling of trade barriers, such as 
tariffs on imported goods, capital controls, import quotas, rules that disadvantage 
foreign companies in doing business, or standards that are manipulated to prevent 
foreign products from entering the domestic market. On the other hand, for an 
expansion of world market it is inadequate only to remove barriers. Certain regula- 
tions and governance, or positive integration, are called for. Global governance of 
world trade cuts down transaction costs associated with economic exchange, and 
provides institutional underpinnings of new markets. 

‘The second literature notices that non-economic IGOs, such as social and 
cultural 1GOs, also bring substantial increases in global trade (Ingram, Robinson 
and Busch 2005). SCIGOs increase awareness, understanding and positive senti- 
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ment between people from different countries, thereby enlarging selection pools 
and positively affecting consumers’ preference. They also produce positive inter- 
personal associations between citizens of different countries (Ingram, Robinson 
and Busch 2005), and thus increase bilateral affinity (Russett and Oneal 2001). 
The increased understanding and appreciation of another country and its culture 
enhance preference for and trust in its products, thereby increasing consumption 
of goods from that country (Guiso, Sapienza and Zingales 2004). 

Therefore, whether or not they are established for the purpose of promot- 
ing economic exchanges, all [GOs are part of the institutionalization of global 
governance and provide governance and inter-country ties that potentially boost 
trade. To gauge the overall effect of IGOs on trade, I use the total number of 
joint-membership [GOs between countries. Using total numbers takes care of 
the multiplexity of IGO involvement and the interdependence of overlapping 
institutions, while using joint membership is due to the fact that IGO influence 
often requires that both countries be subject to the same IGOs (Russett and 
Oneal 2001; Ingram, Robinson and Busch 2005). As a proxy for the intensity of 
involvement in global institutions, IGO membership should have a positive effect 
on trade, and the increased density and effectiveness of this governance should 
make this effect increase over time. 

Hypothesis 2 (integration argument): The total number of joint-membership 
IGOs has a positive effect on trade between countries, and the magnitude of this 
positive effect increases in the postwar period. 


Geographic Factors 


One prominent feature of trade is its spatial component. As early as the 1950s, 
the “trade matrix” literature empirically found a strong negative correlation 
between trade and distance (Beckerman 1956). Although without holding other 
influences constant, the importance of distance revealed by this “trade matrix” 
technique is rough, it has been so widely accepted that one cannot get very far 
into an empirical analysis of bilateral trade without recognizing the inhibiting 
effect of distance. In the integration perspective, however, geography has be- 
come less a friction in global trade because the reduction in transportation and 
communications costs entails a “compression” of space, a shrinking of the world 
(Harvey 1989; Dicken 1998), or even “the death of distance.” (Cairncross 1997) 
The continuity perspective does not agree with the diminishing importance 
of geography. Scholars have noticed that roughly a quarter of global trade by 
value occurs between countries that share a land border, and that half of global 
trade takes place between partners less than 3,000 kilometers apart (Hummels 
2007). Moreover, these proportions have remained largely constant over recent 
decades. While the artificial factor of preferential trade policy may play a role, 
some scholars assert that the dominant explanation for the high regional con- 
centration of trade must be geographical proximity (Krugman 199 1a; Frankel 
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1997). This “local bias of trade” is so strong that “they make overwhelmingly 
clear that distance still matters and still creates natural trading blocs.”(Krugman 
1991b:20) Such theories as “new economic geography” call for a “geographic 
turn” in studying the world economy (Krugman 1991a, 1995; Fujita, Krugman 
and Venables 1999). 

Several recent studies even suggest that distance is becoming more important 
for several reasons. According to this fragmentation perspective (see Hummels 
2001, 2007; Leamer and Storper 2001), countries are trading greater volumes 
of goods that are highly sensitive to distance and there is a shift in the composi- 
tion of global trade toward industries with high distance sensitivity. In addition, 
modern productive activities are often divided into stages and located on a global 
scale, thereby boosting global trade of intermediate inputs; meanwhile, the time- 
dependent nature of these modern productive activities raises the cost of time. 
Hence, a large increase in global trade comes from imported inputs traded only 
among proximate countries. As a result of these changes, geographic distance not 
only still matters, but its effect should be on the rise. 

Based on these apparent differences among the three perspectives, three com- 
peting hypotheses are proposed as follows: 

Hypothesis 3-1 (integration argument): The effect of geography (distance and 
nonadjacency) on bilateral trade diminishes over time in the postwar period. 

Hypothesis 3-2 (continuity argument): The effect of geography (distance and 
nonadjacency) on bilateral trade remains largely unchanged over time in the 
postwar period. 

Hypothesis 3-3 (fragmentation argument): The effect of geography (distance 
and nonadjacency) on bilateral trade increases over time in the postwar period. 


Political Factors 


Political scientists have long argued that a country’s political institutions affect 
its trade policies, which in turn influence its international trade relations. In 
particular, a country’s regime type is related to its trade openness — democratic 
countries are more likely to carry out liberal trade policies than their autocratic 
counterparts (Mansfield, Milner and Rosendorff 2000; Milner and Kubota 2005), 
and regime change toward democracy induces trade liberalization (Milner 1999; 
Milner and Kubota 2005). Some economists such as Rodrik (1994:69) concur 
with this argument, claiming that “historically sharp changes in trade policy have 
always been preceded (or accompanied) by changes in the political regime.” In 
terms of bilateral trade, it is argued that trade flows continue to be significantly 
greater in democratic pairs than mixed pairs (composed of an autocracy and a 
democracy) or autocratic pairs, because groups of democracies are better able to 
liberalize trade than groups of autocracies or groups composed of both democra- 
cies and autocracies (Dixon and Moon 1993; Bliss and Russett 1998; Morrow, 
Siverson and Tabares 1998; Mansfield, Milner and Rosendorff 2000). 
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In this continuity literature, two patterns with respect to how political regimes 
affect bilateral trade can be identified. One is the “regime difference” argument, 
which claims that differences in political regimes impede bilateral trade. For ex- 
ample, through formal modeling Mansfield, Milner and Rosendorff (2000) show 
that due to institutional differences (such as the legislature’s role in making trade 
policies) between democracies and autocracies, democratic pairs tend to agree on 
lower trade barriers. The other is the “weak-link” argument. The so-called weak 
link refers to the less democratic one of the two countries involved in a trade dyad. 
According to this argument, in bilateral trade it is the degree of democracy of the 
less democratic partner that would most influence trade relations between them. 
The more democratic it is, the more ready its even more democratic partner will 
be to trade with it (Oneal and Russett 1997; Bliss and Russett 1998). 

However, there are also theories suggesting a declining role of political factors 
in global trade over time. The institutionalization of global governance, embodied 
in international institutions such as the World Trade Organization, International 
Monetary Fund and World Bank, exerts external pressures and spreads neoliberal 
ideas that push all countries toward trade liberalization, regardless of their politi- 
cal regimes at home. Moreover, the failure of inward-looking economies and the 
success of export-oriented growth models have been attracting countries to open 
up to foreign trade, and this attraction appeals to both democratic and autocratic 
countries (IMF 1997; Dicken 2003; Friedman 2005). In this light, regime types 
should make little difference for economic liberalization (Nelson 1990; Remmer 
1990). This integration perspective predicts a decrease in the effect of political 
regime type on bilateral trade flows. 

In contrast, the prevalence of neoliberalism in today’s world economy, accord- 
ing to some political scientists (Russett and Oneal 2001), may actually make the 
effect of political factors more salient on trade. Because they are more exposed 
to information from outside world and follow international norms or rules more 
closely, democratic countries are more likely to be affected by the neoliberal ideol- 
ogy and more easily yield to external pressures that advocate free trade. As a result, 
compared with their autocratic counterparts, democratic countries trade increas- 
ingly more with each other. Thus, this perspective implies an increasing effect of 
political factors on trade. 

Taken together, again we come up with three more or less contradictory hypotheses: 

Hypothesis 4-1 (integration argument): The effect of political regime (regime dif- 
ference and weak link) on bilateral trade diminishes over time in the postwar period. 

Hypothesis 4-2 (continuity argument): The effect of political regime (regime 
difference and weak link) on bilateral trade remains largely unchanged over time 
in the postwar period. 

Hypothesis 4-3 (fragmentation argument): The effect of political regime (regime 
difference and weak link) on bilateral trade increases over time in the postwar period. 
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Cultural Factors b 
Multidimensional differences in value and social action have been identified ~ 
among national cultures across the globe (Hofstede 2001), which may have an 
impact on inter-country economic activities. In the field of international busi- 
ness, a large number of studies have examined the effect of cultural differences on 
economic activities in global markets. Different cultural backgrounds are found 
to be the catalyst for cultural biases in economic exchange, as larger cultural 
differences make it increasingly difficult to understand, control and predict the 
behavior of others (Elsass and Veiga 1994), which hinders the development of 
trust and complicates effective pairwise interactions (Neal 1998). Thus, the risk 
of malfeasance and uncertainty inherent in trade causes trade to be concen- 
trated among peoples with common culture and long-term relationship (Guiso, 
Sapienza and Zingales 2004). 

These studies lead to the recognition of the continuous importance of cultural — 
similarities on bilateral trade flows by the continuity perspective. In particular, - 
common languages, former colonial ties and civilization types are most frequently 
employed as proxies for cultural similarities. Trade is facilitated by the ease of com- 
munication in a common language (Bliss and Russett 1998; Rauch 1999); studies 
of historical statistics have revealed the relative stability of historical colonial ties, 
such that trading preference within former colonial empires is still discernible 
(Louis and Robinson 1994; Frankel 1998); sharing the same religion-based civili- 
zation implies similar social values and behavioral patterns that facilitate commu- 
nication, foster trust and generate similar lifestyles and tastes (Huntington 1993, 
1996). Costs associated with global trade increase with cultural differences, and 
cultural differences remain an impediment to bilateral trade flows. 

Nevertheless, many integration theories foresee a decline in the importance of 
cultural factors. For example, new transportation and communication technolo-- 
gies give rise to a networked global society and make communications among 
different cultures less and less problematic (Castells 2000; Friedman 2005); in- 
stitutionalized global governance has greatly reduced uncertainty associated with 
trade and boosted trust across markets with different cultures (Fligstein 2005; 
Ingram, Robinson and Busch 2005). There are also a few globalist scholars who 
maintain that national cultural differences are becoming irrelevant because a 
global culture is in the making and the world is becoming populated by cos- 
mopolitan consumers (Levitt 1983). This “globalizing consumerism” brings in 
a standardization of cultural tastes and desires driven by symbols, images and 
the converging aesthetics of the lifestyle and the self-image (Sklair 1995), and all 
accustomed differences in national and regional preferences will be gone (Levitt 


1983). In a word, the integration perspective predicts a diminishing effect of 
cultural factors on bilateral trade flows. 


4 


On the other side of this debate, some scholars argue that the effect of cul- 


tural factors is on the rise and the world economy is becoming more and more 
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fragmented along cultural lines. They reject the notion of a global culture in the 
making but foresee a scenario of cultural fragmentation (Smith 1990; Appadurai 
1996; Huntington 1993, 1996). According to this fragmentation perspective, 
cultural differences are real and deep-rooted, and national cultures and values 
change in “path-dependent” rather than convergent ways (Inglehart and Baker 
2000). Moreover, national cultural differences are likely to be reinforced through 
frequent interaction with people from other cultures, and any potential “global 
culture” provokes resistance in national cultures (Appadurai 1996). Asa result, as 
the world economy becomes more liberal and countries have more freedom to 
choose their trade partners, this reinforced cultural factor naturally draws together 
countries with the same cultural background. One prominent account is the “clash 
of civilizations” theory, which claims that sharing the same religion-based civiliza- 
tion is the foundation for successful economic integration, and that the world is 
actually fragmenting into civilization blocs (see Huntington 1993, 1996). 

Accordingly, three competing hypotheses arise from these perspectives: 

Hypothesis 5-1 (integration argument): ‘The effect of cultural factors (common 
languages, former colonial ties and civilization types) on bilateral trade diminishes 
over time in the postwar period. 

Hypothesis 5-2 (continuity argument): The effect of cultural factors (common 
languages, former colonial ties and civilization types) on bilateral trade remains 
largely unchanged over time in the postwar period. 

Hypothesis 5-3 (fragmentation argument): The effect of cultural factors (com- 
mon languages, former colonial ties and civilization types) on bilateral trade in- 
creases over time in the postwar period. 


Data and Method 
Data 


The data used here span 50 years, from 1950 through 2000 at five-year intervals. 
The dataset has 11 time points and covers 174 countries altogether.’ Io accom- 
modate the definition of IGOs, I only keep sovereign states in the analysis. Note 
that several countries do not have data at all 11 time points, mainly because they 
were either established after 1950 (e.g., Kenya and Russia) or ceased existing 
before 2000 (e.g., East Germany and Soviet Union). The unit of observation is a 
“dyad-year,” that is, a given pair of countries in a given year. 


Dependent Variable 

Bilateral trade volume: The data come from Rose (2004), who took the informa- 
tion from the IMF’s Direction of Trade Statistics.” It covers bilateral merchan- 
dise trade from 1950 through 2000. Bilateral trade data are recorded in U.S. 
dollars and the raw data are readjusted into constant U.S. dollars in 2000 so 
that they are comparable both across countries and over time. As a precaution 
against possible inconsistency in export and import data reporting, bilateral 
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trade volume is created by averaging all of the four possible measures, namely, 
exports from country 7 to country j, imports into 7 from j, exports from j to i, 
and imports into j from i. 


Explanatory Variables 
(1. GDP: The GDP data are obtained from the United Nations Common Database, 
and are in constant U.S. dollars in 2000 based on the World Bank estimates. 


(2. IGOs: The data on IGO membership are from “International Governmental 
Organizations Data Set” (version 2.1; Pevehouse, Nordstrom and Warnke 2003; 
Wallace and Singer 1970). The data present the count of common IGO member- 
ships for each pair of states in a given year (e.g., Thailand-India in 1990).? 


(3. Geographic properties (distance and contiguity): The data on distance are 
from Rose (2004); the adjacency data come from the Correlates of War Direct 
Contiguity data set (version 3.0; Stinnett et al. 2002; Gochman 1991). Distance 
between countries is measured by linear distance between national capitals.* I 
use two categories of contiguous dyads, one for land contiguity and one for 
water contiguity. Land contiguity is defined as the intersection of the homeland 
territory of the two states in the dyad, either through a land boundary or a river, 
such as the Rio Grande in the case of the U.S.-Mexico border. Water contiguity 
is defined as two states in the dyad, separated by water, but within 400 miles 
of each other. 


(4. Cultural properties (civilization type, common language and common colonial 
history): The categorization of each country’s civilization membership comes from 
Henderson (2004). Countries are divided into 10 civilization groups, namely 
the African, Buddhist, Confucian, Hindu, Islamic, Japanese, Latin American, 
Orthodox, Western and other civilizations, based on major world religions and 
Huntington's theory (1993). Common language and colonial history are compiled 
from Rose (2004), supplemented by the CIA’s World Factbook (2008), which in- 
dicate whether the two countries in a given dyad have the same primary language 
and whether they share a common colonial history.° 


(5. Political regime (regime difference and weak link): The democracy score of 
each country comes from the Polity IV dataset compiled by Marshall and Jaggers 
of the Polity Project, which contains coded annual information on regime char- 
acteristics.° I draw on the 21-point measure of a state’s democratic characteristics.’ 
A larger score indicates a more democratic regime. I then create two measures to 
characterize the polity of both members of a dyad jointly. To create the measure 
of “regime difference,” I first obtain the absolute value of the difference in the two 
countries’ regime scores, and then add 1 to this value in order to avoid 0 which 
is unidentifiable in logarithmic form. The second measure “weak link” identifies 
the lower of the two regime scores in the dyad. 


Gravity in Global Trade © 1629 


Controlled Variables 

The population data are obtained from the United Nations Common Database. 
Population is a conventional control variable, which is a proxy for the possibility of 
self-sufficiency. A country with a large population is more likely to have its needs 
met in its domestic markets, while a country with a small population depends 
| more frequently on international markets for various needs. The data on regional 
trade arrangements come from the World Trade Organization, which records 
all existing trade arrangements.* The RTA variable is coded as binary, indicating 
whether there are any RTAs between a pair of countries. Although some RTAs are 
indeed IGOs (e.g., the EU and NAFTA), many are not (e.g., the US-Israel RTA, 
Laos-Thailand RTA and Canada-Chile RTA).’ I control this variable to rule out 
the possibility that the observed trade concentration along geographic, political 
or cultural lines is wholly caused by existing discriminatory trade arrangements. 


The Gravity Model 
To test the hypotheses, I need to estimate the effects of relevant variables on 
bilateral trade flows. It has long been recognized that bilateral trade patterns are 
well described empirically by the so-called “gravity equation.” With a functional 
form reminiscent of the law of gravity in physics, a simple version of the gravity 
equation is typically specified as: 

i = A(Y,Y,)/ D, 


where T,, is the total value of trade flows between country 7 and country j, the 
Ys are their respective national incomes (often using GDP as a proxy), D, is a 
measure of the distance between them, and A is a constant of proportionality. The 
standard procedure is to take the logarithms of the original multiplicative gravity 
; equation, expressing the dependent variable as log(T;,), so as to be able to estimate 
jit in linear form. The double-log specification allows the interpretation of the coef- 
ficients as percentage changes. Extensions to this basic gravity model have been 
widely employed to discern the marginal explanatory power of other trade-related 
variables of interest (see Frankel 1997, 1998; Rose 2004; Ingram, Robinson and 
Busch 2005). The gravity model has both sound theoretical foundations and 
general empirical success (Feenstra, Markusen and Rose 2001). Many economists 
have shown that this model is consistent with a wide range of economic theories 
(see Anderson 1979; Helpman 1987; Bergstrand 1989; Deardorff 1998), and is 
claimed to be “one of the great success stories in empirical economics.” (Feenstra, 

Markusen and Rose 2001:431) 
In this research, the general specification of the gravity model is as follows. All 


variables are in logarithmic form except dummy variables. 
In(Trade..)= B, + B In(GDP. GDP) + B,In(GO,,.) + B,In(Dist,) + B,LAdj., es 
ijt 0 1 i et ae 
B.SAdj. + B,ln(PDiff,.) + BIn(PWeak,..) + B,Civil, + B,Lang,, + B, Colo, + 
B, In(Pop,Pop,), # B, Rea, me 


a 
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where i and j denote the countries in a dyad, ¢ represents the time point, and the 
variables are: 
(1. Trade, is the real value of bilateral trade between i and j in year & 


(2.GDP. is the real value of GDP; 


(3. IGO,, is the number of IGOs that country i and country j are simultaneously 
members of in yeat t 


(4. Geographic factors: Dist, is the distance between two countries; LAdj, is a binary 
dummy for contiguity on land; SAdj, is a binary dummy for contiguity ‘by sea; 


(5.Political factors: PDiff.. is the difference i in the two democratic scores for the 
dyad (i and /) at time ¢, shite PWeak, is the smaller value of the two democratic 
scores for the dyad. Both measures range from 1 to 21, anda higher score indicates 
a higher degree of democracy; 


(6. Cultural factors: Civil, is a binary dummy indicating whether the two countries 
belong to the same civilization; Lang, is a binary dummy for common language; 
Colo, denotes colonizer-colony relations, and is a binary variable which is unity 
if country i ever colonized j or vice versa; 

(7. Controls: Pop, is the population of country #; Rta, is a binary variable that is 


unity if i and 7 belong to the same regional trade arrangement (RTA) at time & 


(8. €,. is the model disturbance for a dyad (i and /) in year ¢, representing the omit- 
ted other influences on bilateral trade. 


Estimating Methods 


The following analysis is mainly conducted by a combination of the fixed-effects 
model and the random-effects model. This method is often called the two-step 
fixed- and random-effects model (see Wooldridge 2002; Halaby 2004). The 
2SFRM applies the FEM to time-varying variables in stage one, and estimates 
the coefficients of time-invariant variables by the REM in stage two. It suits the 
purpose of this particular research well and has several prominent advantages over 
other conventional methods." 

The 2SFRM involves mixing estimators that have the desirable properties of 
fixed effects for time-varying explanatory variables with random effects estimators 
for time-invariant explanatory variables. The greatest advantage of the FEM is 
that it allows explanatory variables to be correlated with unit-specific unobserved 
variables. In this research, the FEM generates unbiased and consistent estimates 
of coefficients for time-varying explanatory variables even if they are correlated 
with unobserved variables specific to each dyad. This feature makes the FEM 
preferable to the REM when estimating coefficients for time-varying variables (see 
Allison 1994; Wooldridge 2002; Halaby 2004). On the other hand, within-group 
estimators of the FEM cannot identify the parameters of observed time-invariant 
variables, while the REM can. However, in this research there are several time- 
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invariant variables, such as geographic and cultural factors, whose effects need to 
be estimated. The 2SFRM is a middle ground that yields the advantages of fixed 
effects while identifying the parameters of time-invariant regressors. 

To address the main hypotheses, we are most interested in the change over time 
in the effects of various factors on trade, which are the coefficients of the interac- 
tion terms between the covariates and time. Obviously these year-by-covariate 
interaction terms are time-varying, so their coefficients will be obtained by the 
FEM in stage one, which guarantees their unbiased and consistent quality. This 
method is recommended by method experts (Angrist 1995; Halaby 2004), who 
advocate interacting time-invariant explanatory variables with time so that the 
FEM can be fitted. 

All these advantages indicate that the 2SFRM is an ideal method for this par- 
ticular research. For data characterized by repeated observations on the same units 
(dyads here), the assumption of independent error structures often does not hold 
(Tuma and Hannan 1984; Finkel 1995). Standard Lagrange multiplier tests on 
the data identify a first-order autoregressive scheme, or AR(1). Therefore, in the 
2SFRM I employ the FEM and REM with AR(1) disturbances to account for this 
problem. The AR(1) correction adjusts for p, the autocorrelation between t and 
t-1. Its estimates are shown in the tables. In the following analysis, after the AR(1) 
correction, error structures behave normally. To estimate the 2SFRM, I use the 
Stata software (Release 10; StataCorp 2007) —although it does not have a built-in 
2SFRM estimator, Stata provides convenient routines such as the “xtregar” com- 
mand (Stata 2007) for the estimation of the FEM and REM allowing for AR(1) 
disturbances, which makes the estimation of the 2SFRM feasible in two stages. 


Results from the Gravity Model 


Descriptive statistics in Table 1 present the correlations between bilateral trade 
and relevant variables. There are positive correlations between trade and GDP, 
common IGO memberships, adjacency, regime weak link, common civilization 
and language, population, and trade arrangements, whereas trade is negatively cor- 
related with distance, regime difference, and common colonial history. All these 
correlations are significantly different from 0 at the .05 level except that between 
trade and common colonial history (p = .17). 

To more accurately examine these relationships, gravity models are estimated 
by the 2SFRM. Besides the dyadic fixed effects embedded in the 2SFRM, I also 
add the year fixed effects to the model, which consist of a comprehensive set 
of dummies for specific years (these year dummies are not shown in Table 2 
to save space). The year fixed effects control for time-varying global influences 
on trade and account for historical influences that affect all dyads, such as the 
global business cycle, oil shocks and so forth (Rose 2004; Ingram, Robinson 
and Busch 2005). With both the dyadic and year fixed effects in place, the coef- 


ficients in the following models indicate the expected change in bilateral trade 
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Table 1: Correlations between Bilateral Trade and Dyadic Predictors 
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5 Land Adjacency 
12 Population 
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2 GDP 
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4 Distance 

6 Sea Adjacency 
7 Polity Difference 
8 Polity Weak 


13 Regional Trade Arrangement 


of a one-unit change in an independent 
variable for a given dyad in a given 
year (Green, Kim and Yoon 2001; Rose 
2004). Results are shown in Table 2. 

Table 2 shows the effects of the five 
sets of factors on bilateral trade. GDP 
and IGO connections have significantly 
positive effects on bilateral trade. Among 
geographic factors, distance shows a sig- 
nificantly negative effect on bilateral trade, 
while land and sea contiguity, as two 
measures of adjacency, show significantly 
positive effects. Within political factors, 
multicollinearity between regime differ- 
ence and regime weak link is identified. 
After trying different model specifications 
with these two variables and testing in- 
dividual significance, I find that regime 
difference clearly has a significant and 
negative impact on bilateral trade, while 
the less democratic regime does not. The 
result here supports the “regime differ- 
ence” argument, but not the “weak link” 
one. In other words, it is the difference in 
regimes that depresses bilateral trade, not 
the other trade partner being undemo- 
cratic. For example, two autocratic coun- 
tries may still develop close trade relations. 
Because the less democratic regime does 
not have an effect and may cause mul- 
ticollinearity, I exclude it from Model 3 
onward. With regard to cultural factors, 
the trade-promoting effect of common 
civilization fails to reach significance; in 
contrast, both common language and 
colonial history have their trade-fostering 
effects significantly established. 

Overall, the findings in Table 2 sup- 
port the continuous multidimensionality 
inherent in global trade. Overall econom- 
ic activity and the institutionalization of 
global governance promote trade flows, 
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Table 2: Unstandardized 2SFRM Coefficients from Gravity Models of Bilateral 


Trade: Overall Effects 
Model 1 Model 2 


a ee ee ee 2 ee See 
=o —________Molel'|___Modelé Model 3___Model4 _Model 5_ 


Economic Factors 
In(GDP,GDP}) 920°" fo" 
(21.95) (22.28) 
Global Institutional Factors 
In(IGO) .618*** bio; 
(5.86) (4.87) 
Geographic Factors 


In(Distance) -1.107*** 
(-22.88) 
Land contiguity 
Sea contiguity 
Political Factors 
In(Difference) -.056** 
(-3.35) 
In(Weak) 033 
(1.42) 
Cultural Factors 
Common civilization .068 
(.66) 


Common language 


Colonial history 


Controls 
In(PopPop)) ego 3.027" 
(-.12.68) (-12.55) 
RTA Pisa P30 ee 
(3.95) (3.73) 
Constant 
FE constant aoe 4: 380"% 
(31.67) (32.89) 
RE constant — -1.486*** 
(-3.61) 
p in AR(1) 
FEp 461 464 
RE p — 465 
R2 
FE R? 187 .200 
RE R?2 — 049 


Notes: N = 55,416 
Numbers in parentheses are t-values 
*)<.05 *“*p<.01 **p<.001 (two-tailed test) 


Model3 —_ Model 4 
.987*** .987*** 
(22.70) (22.70) 
tot an Pollan 
(4.85) (4.85) 
AIA Tt ne] 065 
(-22.98) (-24.26) 
-.028** -.028** 
(-3.09) (-3.09) 
.062 
(.60) 
484** 
(3.17) 
214* 
(2.07) 
=1.054°**  »=1.054""" 
(-12.94) (-12.94) 
.548*** 254g ** 
(3.81) (3.81) 
a OAT ea SATs 
(32.67) (32.67) 
-1.412** —-1.081** 
(-3.42) (-2.87) 
464 464 
465 464 
.200 .200 
.050 .050 


9B7** 
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.060 


Two constant terms are displayed: the FE constant is from the first-stage FEM while the 


RE constant is from the second-stage REM 


Two R? are displayed: the FE R? is the within-dyad R? from the FEM while the RE R’ is 


the between-dyad R? from the REM. 
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interaction terms are created by interacting each covariate with the year term. The 
each factor on bilateral trade as varying, conditional on the level of the year term. 
Hence, they directly measure the change in these effects over time. Table 3 presents 


coefficients of the year-by-covariate interaction terms here estimate the effect of 
the results of the models that estimate and test the change over time. 


tion term for each covariate. To facilitate interpretation I trans 


whereas distance or difference in geographic, political and 
original form to numerals between 0 and 10 


tors change over time. I add in a linear 
so that the baseline of the model describes the year 1950. 


1634 ¢ Social Forces 88(4) 
impedes trade rela 
three perspectives, 





Gravity in Global Trade © 1635 


Under each model presented in Table 3, the main effect of a covariate represents 
its effect in the baseline year 1950, while the effect of its year-by-covariate interac- 
tion term shows how this effect changes over time: 

(1. The economic factor: the trade-promoting effect of GDP has been getting 
larger over time. This finding supports Hypothesis 1 and its underlying integra- 
tion argument. The nature of global economic activities has become more and 
more trade-generating. 

(2. The global institutional factor: the positive effect of IGO membership gets 
smaller over time. That is, the trade-promoting effect of IGO connections has 
been declining over the years. This decline goes against Hypothesis 2 and its un- 
derlying integration argument. Institutionalization of global governance through 
IGOs does promote trade, but its trade-promoting effect has not become more 
effective over time as expected. 

(3. The geographic factor: In gen- 


eral, the fragmentation and continu- 
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gime shows no significant main effect, and there is also no significant change in its 
effect over time. As shown in Table 2, what matters for bilateral trade is difference in 
political regimes, not the degree of democracy of the less democratic regime in a dyad. 

(5. The cultural factor: The three cultural factors each demonstrate a unique 
pattern over time: (1. The trade-promoting effect of common language has 
been getting larger over time, which supports the fragmentation argument in 
Hypothesis 5. (2. Sharing colonial history started with a trade-promoting effect, 
but its effect has been gradually disappearing, which is compatible with the inte- 
gration argument. [t was still a positive one as late as 2000, though (See Figure 1). 
(3. Belonging to the same civilization had a negative effect on trade in 1950, but 
gradually this effect switched its direction and has now become a clearly trade- 
promoting factor (See Figure 1). This pattern is consistent with the fragmentation 
argument. It also explains why the overall effect of common civilization on trade 
in the postwar period is neutral and insignificant in Table 2. 

To better display these changes over time, Figure 1 is created based on the 
comprehensive Model 4 in Table 3. For each factor, by combining its main effect 
in 1950 and its interaction effect with time, Figure 1 visualizes the trajectory of 
each factor’s effect on bilateral trade over time. 

To better illustrate the substantive significance of the effects of various factors, 
Table 4 transforms estimated coefficients in gravity models above into specific 
percentage changes. It presents the percentage change in bilateral trade associated 
with each particular factor, while controlling for other factors. 


Conclusion and Discussion 


Two major conclusions can be drawn from this research. First, global trade in 
the postwar period continues to be multidimensional in nature and the trade- 
depressing effects of geographic distance, political difference and cultural dissimi- 
larity show no sign of fading away. In comparison with the integration perspective, 
the continuity and fragmentation perspectives are closer to the reality. Second, 
intensified geo-cultural gravity increasingly draws geographically close and cul- 
turally similar countries together, which may trigger a clustering process or even 
fragmentation along geographic and cultural lines at the global level. 

As many globalization theories predicted, the economic and global institu- 
tional factors promote global trade. The integration perspective is also correct in 
predicting global economic activities to be increasingly trade-oriented and trade- 
generating. Nevertheless, this perspective has overestimated the trade-promoting 
effect of institutionalized global governance in practice, which has been declining 
over time. A tentative explanation would be the saturation of the global IGO net- 
works and the decreased effectiveness of new IGOs. The dramatic growth of IGOs 
in the postwar period (Pevehouse et al 2003; Beckfield 2003) may have reached 
the level of declining marginal returns. In addition, newly established IGOs may 
not work as effectively as their earlier counterparts in generating trade relations. 
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Table 4: Percentage Change in Bilateral Trade: Overall, in 1950 and in 2000 





From Model 5 in From Model 4 in Table 3 
Table 2 (or Figure 1) 
Overall (1950-2000 1950 2000 


GDP (the doubling of the product of two countries’ GDP increases their bilateral trade by...) 
98.2 34.9 201.5 


8. 
IGO (the doubling of the joint |GO memberships between two countries increases their 
bilateral trade by...) 
49.2 57.6 19.4 
Political Difference (the doubling of the difference in political regime scores increases 
bilateral trade by...) 
-1.9 -10.7 4.3 


Geographic Distance (the doubling of the distance between two countries increases their 
bilateral trade by...) 


-46.8 -41.9 -54.1 
Land Contiguity (adjacency on land increases bilateral trade by...) 

94. 15.3 127.5 
Sea Contiguity (adjacency by sea increases bilateral trade by...) 

54.0 42.5 55.9 


Common Language (sharing common language increases bilateral trade by...) 
53.1 Ted 


Colonial History (sharing colonial historical ties increases bilateral trade by...) 
232 0.9 


Le 
. 


Common Civilization (belonging i the same civilization increases bilateral trade by... 
Notes: All numbers in this table are percentages. 
All numbers are statistically significant at the .05 level except that of common 
civilization in the overall model. 
Based on the setup of the gravity model, the percentage change for GDP, |GO, political 
difference and geographic distance is calculated by (2°-1), while the formula for land 
and sea contiguity, common language, colonial history and common civilization (all 
dummy variables) is (e°-1). B is the coefficient in the gravity model. 
To avoid clutter, this table only shows numbers for 1950 and 2000. Percentage 
changes for other years can be retrieved by using the same formulas. 


The integration perspective encounters more difficulties in explaining the lin- 
gering influence of other traditional factors in geographic, political and cultural 
dimensions. The continuity and fragmentation theories are more consistent with 
the reality. Simply put, distance or difference in geography, politics and culture still 
depresses trade and hinders the emergence of a fully integrated global trade system. 
More importantly, in the postwar period the effects of these factors, instead of fading 
away, have intensified in many cases. Although the effect of colonial historical ties 
has been diminishing and the effects of political regime difference and sea contiguity 
have remained largely stable, geographic distance with its related land contiguity and 
common civilization with its related common language have seen their effects on 
trade flows strengthened over time. Global trade may have been fragmenting, and 
to be precise, have been fragmenting along geographic and cultural lines. 
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This intensified influence of geographic and cultural factors on bilateral trade 
calls for a more sophisticated perspective on “gravitational attraction” between 
trade partners. Despite the great improvement in transportation and commu- 
nications technologies and the institutionalization of global governance, when 
choosing trade partners, countries increasingly favor their culturally similar 
counterparts and geographically close neighbors. How does this geo-cultural 
affinity come about against the backdrop of the improved technologies and 
global governance? Theories from the fragmentation perspective provide some 
explanations; however, more empirical research is needed to explore its underly- 
ing mechanisms, and to identify which specific aspects of geography and culture 
are driving up the significance of this affinity, as both geography and culture are 
composed of multiple dimensions themselves (Fujita, Krugman and Venables 
1999; Hofstede 2001). An analysis of trade data at the sector or industry level 
may also help identify which sectors are more sensitive to this geo-cultural af- 
finity. The consequence of this affinity is worth closer attention too. It not only 
fragments global trade along geo-cultural lines at the global level, but also has 
important implications for economic development at the national level. For 
one thing, countries endowed with well-developed geographic “neighbors” and 
wealthy cultural “cousins” may have an edge in developing foreign trade and 
benefiting from economic globalization. This new form of “neighborhood effect’ 
calls for more research from both the globalization scholarship and the sociology 
of development. Hence, this article also serves as an invitation for more empiri- 
cal research on the multidimensionality, especially the geo-cultural dimensions, 
within the ongoing economic globalization. 


> 


Notes 
1. The country list is available at http://www.wjh.harvard.edu/-minzhou/Appendix.pdf. 
2. Available at http://faculty.haas.berkeley.edu/arose/ RecRes.htm# Trade. 


3. The IGO data here use the following operational coding definition of an IGO: (1. 
An IGO must consist of at least three sovereign states; (2. An IGO must hold regular 
plenary sessions at least once every 10 years; (3. An IGO must possess a permanent 
secretariat and corresponding headquarters. The criteria are also consistent with the 
definition used by the Union of International Associations. 


4. Besides the conventional measure of distance as linear distance between national 
capitals, I also tried linear distance between their largest economic cities. The 
correlation of these two measures is very high (above .91). Modeling with the 
new measure generates similar results, and all findings hold. ‘The rationale behind 
these two measures is as follows: the national capital or the largest economic city 
roughly represents the “economic center of gravity” and captures the “average” point 
of production and consumption in a particular country. Both measures have been 
commonly used by trade economists. 


5. Available at https://www.cia.gov/ library/publications/ the-world-factbook. 
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6. ‘The data were extracted from http://www.cidcm.umd.edu/polity in December 2006. 
Now the data can be found at http://www.systemicpeace.org/polity/polity4.htm. See 
Jaggers and Gurr (1995) for summaries of the Polity Project. 


7. The original score takes on values ranging from -10 to 10. For the purpose of 
logarithmic transformation, the score is readjusted to the range between 1 and 21 by 
adding 11 to the original score. 


Available at http://www.wto.org/ english/tratop_e/ region_e/regfac_e.htm. 


9. RTAs often lack one or more of the key features defining IGOs. For instance, many 
RTAs are agreements between two, instead of three or more, countries, and many do 
not hold regular plenary sessions or possess a permanent secretariat and corresponding 
headquarters. 


10. To check the robustness of the findings, I also tried the random effects model and 
the generalized method of moments panel estimator. They produce substantively the 
same results. 


11. Model 2 is not shown in Table 3 due to space limit, but available at http://www.wjh. 
harvard.edu/~minzhou/Table3.pdf. 
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Effects of Inequality, Family and School on Mathematics 
Achievement: Country and Student Differences 


Ming Ming Chiu, State University of New York at Buffalo 


Inequality, family and school characteristics were linked to student achievement as 
shown by multi-level analyses of 107,975 15 year olds’ mathematics tests and ques- 
tionnaires in 41 countries. Equal distribution of country and school resources were 
linked to higher mathematics scores. Students scored higher in families or schools 
with more resources (SES, native born, two parents, more educational materials, 
higher SES schoolmates, female schoolmates, class time, educated teachers) or ben- 
eficial intangible processes (communication, discipline, teacher-student relationships). 
Students living with grandparents or siblings (especially older ones) scored lower. 
Physical family resource variables showed similar results across countries, support- 
ing the social reproduction hypothesis for physical resources. In richer countries, 
intangible processes had stronger links with mathematics achievement, suggesting 
that greater availability of public physical resources raises the value of complementary 
intangible processes, which can help explain the Heyneman-Loxley effect of stronger 
family effects in richer countries. 





Students with higher family socio-economic status show higher achievement in 
nearly all countries, but the impact of school characteristics is less clear (Baker, 
Goesling and LeTendre 2002; Heyneman and Loxley 1982, 1983a, 1983b). As 
governments and schools have limited education budgets, developing policies that 
optimize student learning and human capital is essential for long-term economic, 
social and political development (Reich 2001). Moreover, cross-country differ- 
ences in the effects of family and school characteristics suggest that governments 
and school principals must customize their policies and practices to the needs of 
their constituents (Gamoran and Long 2006). Past studies have focused on struc- 
tural inputs such as family structure and teacher qualifications (Baker, Goesling 
and LeTendre 2002; Fuller 1987; Fuller and Clarke, 1994; Heyneman and Loxley, 
1982, 1983a, 1983b). This study extends their work by also analyzing the effects of 
family and school processes (family communication, teacher-student relationship). 
Specifically, I examined how country, family and school characteristics were 
linked to student achievement, by analyzing the mathematics test scores and ques- 
tionnaire responses of 107,975 15-year-old students in 41 countries. This study ex- 
tends past research in three ways. First, both country inequality and country income 
links to student achievement are examined. Second, more diverse measures of family 
and school characteristics (especially processes) are analyzed for correspondence 
with student achievement and for possible differences across countries or across 
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schools. Third, this study introduces and tests the hypothesis that in richer countries, 
intangible family and school processes capitalize on these countries more abundant 
physical resources to have stronger effects on students’ mathematics achievement. 


Families, Schools, Countries and Student Achievement 
Family Characteristics 


A child’s family members can provide extra resources or compete for them. Family 
members (especially parents) can provide resources and additional learning opportu- 
nities on which the child can capitalize (see figure 1, middle column; Amato 2001; 
Entwisle and Alexander 1995; Horvat, Weininger and Lareau 2003). Families with 
more parents (two vs. one or none) typically have higher SES, more educational 
resources at home (books), and more parent time to engage in beneficial family 
processes (greater parent communication and involvement) (Chiu and McBride- 
Chang forthcoming; Kurdek and Sinclair 1988). In contrast, divorced or separated 
parents have fewer resources and face more challenges in caring for their children, 
who might receive less attention (especially from stepparents). Children who wit- 
ness conflicts between their separated parents might also suffer emotionally, have 
lower academic motivation and show lower academic achievement (Amato 2001). 
Meanwhile, immigrant parents, especially those that speak a foreign language, often 
have less cultural capital to share with their children (cultural communication and 
possessions such as poetry books), which limits their learning opportunities and 
often yield lower academic achievement (Portes and MacLeod 1996). 

Family members who primarily compete for family resources (such as grandpar- 
ents and siblings) might reduce the available resources for a child, yielding fewer 
learning opportunities and less learning (Figure 1; see also Downey 1994). Some stu- 
dents might benefit from their grandparents’ physical, informational, social and emo- 
tional resources, and show higher academic achievement (DeLeire and Kalil 2002). 
However, students who live with poor or ill grandparents compete with them for 
limited family resources (Patillo-McCoy, Kalil and Payne 2003). Furthermore, many 
countries only recently instituted universal education, so undereducated grandpar- 
ents may be the norm. Because the academic achievements of resident grandparents 
and grandchildren are linked, students living with less educated grandparents might 
learn less than other students (Patillo-McCoy, Kalil and Payne 2003). Siblings can 
also compete for family resources, so students with more resident siblings often have 
fewer learning opportunities at home (Downey 1994). While younger siblings com- 
pete for family resources just after birth, older siblings have more family resources and 
learning opportunities compared to younger siblings (Powell and Steelman 1993). 


School Characteristics 


Higher SES students typically attend schools with superior schoolmates, bet- 
ter physical resources and better teachers (Rothstein 2000). By interacting with 
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Figure 1. Model of Country and Family Effects on Students’ Mathematics Achievement 


Family 
SES (+) 
1st generation immigrant (-) 
2nd generation immigrant (-) 
Foreign language spoken at home (-) 
Living with no parents (-) 
Single parent (-) 
Blended family (-) 
Resident grandparent (-) 
Number of siblings (-) 
Older siblings (-) 
Number of books at home (+) 
ie Cultural possessions (+) | 
= 


Cultural communication (+) 
Student 
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privileged schoolmates, a student benefits from their parental capital, material 
resources, diverse experiences and higher academic expectations (Pritchett 2001; 
Coleman et al. 1966; Jencks et al. 1972; Pong 1997, 1998). These schools often 
have better physical conditions and more educational materials than other schools, 
so students in these schools often do better (Berner 1993; Comber and Keeves 
1973; Fuller 1987; Fuller and Clarke 1994). 

Higher SES students also typically benefit from attending schools with higher 
teacher-to-student ratios and better-qualified teachers (Fuller and Clarke 1994; 
Greenwald, Hedges and Laine 1996; Rivkin, Hanushek and Cain 2005). In 
schools with proportionally more teachers, students often receive more attention. 
Teachers with stronger qualifications typically use teaching processes that are 
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more effective, thereby maintaining student discipline and better relationships 
with their students—both of which are linked to higher student achievement (Ma 
and Willms 2004; Rowan, Correnti and Miller 2002; Willm and Somers 2001). 


Country Characteristics 


Richer vs. Poorer Countries 

Students in countries with higher real GDP per 
capita (such as the United States) often benefit 
from the country’s greater resources ( Figure 1; 
see also Baker, Goesling and LeTendre 2002). 
Richer countries can raise student learning direct- 
ly through educational spending (books, teacher 
training) or indirectly through higher nutritional 
standards or better health care (UNICEF 2001). 
For example, children in poorer countries (such as 
Albania) often lack basic nutrition, are born pre- 
maturely or face exposure to potentially harmful 
environments (lead poisoning, for example) —all 
linked to lower student achievement (Tesman and 
Hills 1994; UNICEF 2001). 

Country wealth might also moderate family or 
school characteristics’ links with student learn- 
ing. As richer countries often have more public 
resources (museums, library books) that can 
substitute for family or school resources, these 
public resources might dilute the importance of 
opportunities provided by families and schools, 
thereby weakening their influence on student 
achievement (Blossfeld and Shavit 1993; Schiller, 
Khmelkov and Wang, 2002). For example, 
school resources are strongly linked to student 
achievement in poor countries, but much less 
so in richer countries where the GDP per capita 
exceeds $16,000 per year (in 1990 U.S. dollars; 
Gamoran and Long 2006; see also Buchmann 
2002; Fuller and Clarke 1994; Heyneman and 
Loxley 1982, 1983a, 1983b). While most schools 
in richer countries have basic educational re- 
sources (teacher, textbook, blackboard), these 
school resources vary far more widely in poorer 
countries (especially schools in Latin American 
countries); greater differences in school resources 











Characteristics, Country Characteristics and Mathematics Scores 
Weaker family/ School links to achievement 
Similar family inks to achievement 
Stronger intangible family! School links to achievement 
Higher student achievement 
Weaker family/ School links to achievement 


Richer country 
Richer country 
Richer country 
More equal country or school 
More equal count 


Table 1: Primary Hypotheses Regarding Famil 


Hypothesis 
Complementary processes 


Country Resources 
Country Inequality 
Equality achievement 
Equality dilution 


Modernization 
Social reproduction 
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yield greater differences in student achievement (Alexander 2001; Gamoran and 
Long 2006; Montagnes 2001). 

In a competing view, high SES families use their superior resources to create 
advantages for their children to overcome any modernization effects (such as hir- 
ing tutors for their children). As a result, family characteristics’ links to student 
achievement would be similar across poor and rich countries (Baker, Goesling 
and LeTendre 2002; Hanushek and Luque 2003). However, the data used in 
these studies were mostly from rich countries above the $16,000 GDP per capita 
threshold (Gamoran and Long 2006). (For some scholars, social reproduction 
includes stronger links between family characteristics and student achievement, 
but they do not provide any mechanisms for these stronger links. Thus, social 
reproduction is defined narrowly in this article.) 

Lastly, the widespread availability of public physical resources (public librar- 
ies, museums) in richer countries raises the value of intangible processes (Chiu 
2007). Unlike physical resources (textbooks) or structural inputs (schoolmate 
SES), processes are intangible and dynamic (family communication, teacher- 
student relationship). While divorced parents might proxy for greater commu- 
nication challenges, family SES reflects both physical resources and intangible 
processes (money for books vs. human capital for communication). For example, 
a student benefits from reading an extra book, but that benefit can be magnified 
substantially by discussing it with a parent or a teacher. Indeed, children in richer 
countries (Sweden, the United States) often spend more time with their parents 
due to fewer competing siblings, less parent time on housework and multi-tasking 
parents (Klevmarken and Stafford 1999; Sandberg and Hofferth 2001). Thus, 
family and school processes might be more strongly linked to student achievement 


in richer countries. 


More Equal vs. Less Equal Countries 

Apart from the amount of resources, distribution might affect student achieve- 
ment. Specifically, greater equality within a country might increase overall student 
achievement through diminishing marginal returns or homophily. Consider a 

thirsty woman and two glasses of water. She greatly values the first glass of water 
and drinks it all. Her thirst quenched, she hardly values the second glass of water 
and does not finish it. This lower value of extra resources is diminishing marginal 
returns (Chiu and Khoo 2005). Hence, a poor student with few books likely learns 

more from an extra book than a rich student would. In more equal countries (such 

as Norway), poorer students have more resources and might benefit more from 

them, resulting in higher achievement overall. 

Greater equality might also increase overall student achievement through peo- 
ple’s preference to interact with others of similar SES (McPherson, Smith-Lovin 
and Cook 2001). As a result, students in more equal countries might cooperate 
more often and share more resources, resulting in higher achievement overall. This 
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greater cooperation and sharing might also dilute the links between family char- 
acteristics and student achievement in more equal countries (Baker and LeTendre 
2005). Likewise, more equal countries often have more equal distribution of re- 
sources across schools, which might result in less variation in student achievement 
(Gamoran and Long 2006). For simplicity, these weaker links of family and school 
characteristics to student achievement are called the equality dilution hypothesis. 

In short, this study examined country, family and school characteristics and 
their links to student achievement (see hypotheses in Table 1). The modernization, 
social reproduction and complementary processes hypotheses partially compete 
with one another, while the equality-achievement and equality dilution hypotheses 
are independent. As studies have shown that gender and time spent on courses are 
linked to academic achievement, these variables were modeled as well. Boys often 
outscore girls on standardized mathematics exams (TIMSS 1995). Furthermore, 
girls often have better self-discipline than boys and might help constitute school 
cultures of effective discipline and academic learning (Chiu and Chow forthcom- 
ing; Lewis 2001; Skiba et al. 2002). Students who spend more time in class also 
tend to show higher academic achievement (Clark and Linn 2003). 

Other possible hypotheses at the country level include: (1. age of academic 
differentiation, (2. government assistance to children, (3. education system cen- 
tralization, (4. religious context, (5. cultural values, and (6. demographic variables. 
First, family background shows stronger effects on younger children (Blossfeld 
and Shavit 1993). Hence, school systems that separate students at earlier ages 
into different scholastic programs will tend to create more homogeneous groups 
of children within a school (Dustmann 2004). Richer countries can afford more 
school resources to delay the age of differentiation, and more equal countries 
might also tend to reduce the advantage of family background and delay the 
age of differentiation. Second, government spending on students, such as on 
schooling might provide more educational resources and thereby increase student 
achievement (Chiu 2008). Third, Wébmann (2000) argues that school teachers 
and staff in less centralized school systems adapt to individual student needs more 
often, and raise their academic achievement accordingly. Fourth, the religious 
context of a country might influence student achievement as immigrant children 
from Islamic countries had lower science achievement than their counterparts 
from Christian or non-religious countries (de Heus and Dronkers 2008). Fifth, 
students might score higher in countries with specific cultural values (Hofstede 
2003; Inglehart et al. 2004). Sixth, the demographic context of a country might 
influence the link between family characteristics and student achievement. In their 
study of 11 countries, Pong, Dronkers and Hampden-Thompson (2003) showed 
that children of single parents scored lower in countries with proportionately more 
single-parent families. (As none of these country-level variables were significant, I 
have not discussed them. Results are available upon request.) 
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Methods 


Although the data and methodology in this study overlap with other studies, 
the research questions differ substantially. Family and school characteristics were 
tested in this study, rather than gender, learning strategies or self-concept (Chiu, 
Chow and McBride-Chang 2007; Chiu and Klassen 2009, 2010; Chiu and 
McBride-Chang 2006). 


Data 


The Organization for Economic Cooperation and Development’ Program for 
International Student Assessment assessed 107,975 15 year olds’ mathemat- 
ics achievement and asked students and principals to fill out questionnaires. 
International experts from participating OECD and non-OECD countries de- 
fined mathematics achievement, built assessment frameworks, created test items, 
forward-translated them, backward-translated them and pilot tested them to 
check their validity and reliability (OECD 2002). These experts created math- 
ematics items examining 15 year olds’ ability to understand, use, and reflect on 
mathematics concepts. Participating students completed a two-hour assessment 
booklet and then, a 30- to 40-minute questionnaire. See variables in Table 2. Note 
that the gross domestic product per capita of many of these participating countries 
exceeds the world’s median GDP per capita. 


Methodological Design 


Investigating these research questions across many countries and schools requires 
representative sampling of 15 year olds, precise tests and questionnaire items, and 
suitable statistical models. For each country, OECD (2002) chose 150 representa- 
tive schools based on neighborhood SES and student intake, and then sampled 
35 students from each of these schools (stratified sampling). OECD excluded 
students who were mentally retarded, refused to take the exam, could not physi- 
cally take it, or did not understand the test language (less than 5% of the sample). 
Using suitable weights, OECD (2002) created representative samples of each 
country’s schools and 15-year-old students. 

Using a balanced incomplete block test design, these students were given sub- 
tests (overlapping subsets of all multiple choice and open-ended questions) to re- 
duce both student fatigue and test-learning effects (Baker and Kim 2004). OECD 
fitted the data with a graded response Rasch model, which estimated each test 
item’s difficulty and each student’s test score (Baker and Kim 2004). 

The questionnaire reduced measurement error by using multiple measures for 
each construct and computing a single index from them with a Rasch model (Warm 
1989). As reliability tests showed that participants in all countries responded con- 
sistently to the questionnaire items in each construct, they likely shared similar 
understandings (Brown et al. 2005; OECD 2002; Rindermann 2007; Schulz 2003). 
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Multilevel analysis of plausible values was used to model the relationships in 
this nested data (students in schools in countries) because it yielded more precise 
standard errors than ordinary least squares did (Goldstein 1995; Rust and Rao 
1996). Missing questionnaire response data (5%) can reduce estimation efficiency, 
complicate data analyses, and bias results (Peugh and Enders 2004). Thus, Markov 
Chain Monte Carlo multiple imputation was used because it addressed these 
problems more effectively than deletion, mean substitution or simple imputation 


(Peugh and Enders 2004). 


Analysis 


A variance components model tested if the variances were significant at each level. 


Bes F ORS) a fi * Boor (1) 


Boo is the grand mean intercept of the mathematics score, Y,,, of student 7 in 
school j in country &. Residuals at the student-, school- and country-levels are 
ep fe and g,,,. 

Explanatory variables were entered in sequential sets to estimate the variance 
explained by each set and to show potential multicollinearity across sets (Kennedy 
2004). This study used proxies to control for past achievement. Focusing on the 
smaller time period between past and current achievement (rather than a student's 
entire life) reduced both omitted variable bias and estimation errors due to vari- 
ables (parents jobs) whose values change over time (Kennedy 2004). Country 
context might affect family characteristics. As families might choose the schools 
that their children attend, family variables might affect school variables. Hence, 
T entered the variables as follows: past achievement, country, family and school. 

First, I entered a vector of u past achievement proxies: relative grade and re- 


medial courses (U). 
Vint rs ant bias ee eh + Book + BU in (2) 
Next, I entered v variables at the country level: GDP per capita and Gini (V). 
ie = Bago + Sy * toys + Boor + BU ix + Boo, V ov (3) 


I tested the significance of these predictors with a nested hypothesis test, x7 log 
likelihood (Kennedy 2004). Then, I tested for interaction effects among pairs of 
significant variables in V. Non-significant variables and interactions were removed. 

Next, I added w family variables and gender: SES, first generation immigrant, 
second generation immigrant, foreign language spoken at home, living with no par- 
ents, single parent, blended family, resident grandparent, number of siblings, birth 
order, number of books at home, cultural possessions, cultural communication and 
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gender (W). OECD did not collect data on relatives who lived nearby, which 
would have allowed testing of local, non-resident family members. Cultural com- 
munication at home reflects an intangible process. Although families of one parent 
and one step-parent (blended families) need not differ financially from two-parent 
families, blended families might have more challenging interactions (Amato 2001). 
Hence, blended families might serve as a proxy of intangible communication pro- 
cesses. Together, blended families and cultural communication at home help test the 
complementary processes hypothesis. Number of books and cultural possessions are 
physical resources, and the remaining variables aside from gender might reflect both 
physical resources and intangible processes. Interaction terms with country variables 
tested if they moderated links between family characteristics and mathematics scores 


(see Table 1 hypotheses). 
Mee = mae eas foie * Booz t Bit 2 Poa’ oie Bae st (4) 


I applied the procedure for V to W. Next, I tested for random effects (Goldstein, 
1995), if the w student-level regression coefficients (Boe = Broo + Sune) differed 
significantly at the country-level (g, ,, # 02). If so, I tested whether they depended 
on the above country characteristics (V). ‘ 


Bare e ee r S0k bs Poa Mens (5) 


Then, I added x school variables: schoolmate SES, percentage of girls in school, 
minutes spent in mathematics classes per week, educational resources, percentage 
of mathematics teachers with university degrees, school discipline, classroom 
discipline, and teacher-student relationships (X). All of the variables in X are 
school-level variables except for the last two (classroom discipline and teacher- 
student relationship ), which are student-level variables. School discipline, class- 
room discipline and teacher-student relationships are also processes that help 
test the complementary processes hypothesis. Educational resources are physical 
resources, while schoolmate SES, female schoolmates, class time and teachers 
with university degrees are structural inputs that reflect physical resources and 
intangible processes. Then, I applied the procedure for W to X. 


Nee = Pio eau ie + Booz + Bn . Daan ook t Boa se = BoXig (6) 

To test for diminishing marginal returns, I repeated the above analysis each 

time I replaced a continuous variable with its log (log GDP per capita vs. GDP 
per capita). The variable that accounted for more variance was retained. 

I reported how a 10 percent increase in each continuous variable above its 

mean was linked to mathematics score (result = b * SD * [10% / 34%]; 1 SD me 


34%). As percent increase is not linearly related to standard deviation, scaling is 
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not warranted. The mean gain between 9" and 10" grade was 48 points, but the 
standard deviation was very large (SD = 31). Thus, this mean gain of 48 per year 
should be used cautiously when interpreting the results. 

An alpha level of .05 was used. I controlled for the false discovery rate with 
Benjamini, Krieger and Yekutieli’s (2006) two-stage linear step-up procedure, 
as computer simulations showed that their procedure addressed this issue better 
than 13 other methods. 

For robustness, I did the following analyses: (1. predicting achievement without 
prior achievement measures in the same model, (2. two-level regressions (school 
and student levels) for each country with all within-country variables, (3. same 
as (a. but with standardized scores within each country, and (d. same as (b. but 
with standardized scores within each country. The results were similar to those 
reported below. The small sample of countries (N = 41) also limits this analysis’ 
statistical power to identify non-significant country level results (power = .75 
for an effect size of .4 at p = .05). Through analyses of residuals, | also tested for 
influential outliers. 


Results 
Summary Statistics 


This sample included a variety of countries. They ranged from poor, unequal ones 
(Indonesia) to rich, equal ones (Switzerland). See Table 2 for summary statistics 
(see appendix A, Table Al for correlation-variance-covariance matrices). 


Explanatory Model 


Mathematics scores showed significant variation at all three levels. About 31 percent 
of the difference in students’ mathematics scores occurred at the country level, 25 
percent at the school level, and 44 percent at the student level. Furthermore, country, 
family and school variables all accounted for some of the differences in students’ 
mathematics scores (Table 3). As many variables were controlled, the unique vari- 
ance identified by each explanatory variable was likely more precise and smaller 
than studies with smaller explanatory models (that might suffer more from omitted 
variable bias). Ancillary regressions and statistical tests are available upon request. 


Past Achievement 

Students at a grade level above their expected grade level (indicating that they 
skipped grades or started school early) scored higher in mathematics. Likewise, 
students who attended remedial courses in school scored lower than other stu- 


dents (Table 3, Model 1). 


Country Characteristics 
Economic conditions were linked to mathematics scores. If a richer country’s GDP 


per capita exceeded the mean by 10 percent, its students averaged 7 more points 
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Table 3: Summaries of 5 Multilevel Regression Models Predicting Students’ 


Mathematics Scores 


Regressions Predicting Mathematics Scores 


Model 2 


Model 3 








Model 1 
Past Math 
Predictor _Scores 
Relative grade 3200" 
: [aes (46) 
Any remedial course -26.82*** 
| wis) 
logGDPpercapig 
GDP Gini 
SES 


ist generation immigrant nd 

2nd generation immigrant 

Pee Bnotage eae home 
~ Living with no a 

Single parent | 

Blended familyy 

Grendpatant | 

Number of sibli ings 

Birth order 7 

Number of books athome 

Cultural POSSerC NS at home : 

Cultural communication = 

Girl | | 

SchoolmeanSES 

% Girls in school | 

Time spent on math courses 


Log (Educational resources) 





Count 


3250 Ce 


(46) 
"26.83" 
(.53) 
69.46" 
“0. 97) 

471" 
( _ 


% Math teachers with “ pe 


School discipline 


(78) 


7 8808 


- (303) 
Baal 
(1.407) 
-3,892** 


(4.403) 


-9.452*** 
(943) 
-15.29°** 
(1.262) 
A258 
(.679) 
-3.442™ 
"(993) 
“A243 
(634) 


281 


(185) 
2.982"* 
(.244) 


Cures 


(191) 
2.947" 
(.288) 


ss 


(.249) 


-18.11™"* 
(498) 





. Ds ‘agrt 


4335" 


__ (2.67) 


Model 4 Model 5 





School Interactions 


A oS 


(.24) 

8. 96" . 
(.19) 
2.47" 


ey 
18 oT 
(.50) 


(1.04) 
20.37" 


te 
(00). 

5.24" 
( 7 








*(1.85) 
74g" 315 
(51) (.24) 


Table 3 continued 
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Regressions Predicting Mathematics Scores 








Model 1 Model 2 Model 3 Model 4 Model 5 
Past Math 

Classroom discipline . - . 160" Ves 
ti (.25) (20) 
Teacher-student relations Oise on 
: (.24) (.52) 
‘Log GDP per capita 8.98*** 
_ * Relative grade (68) 
Log GDP per capita -11.27°"* 

* Any remedial (.83) 
Log GDP per capita 2147 
/ *SES nota 
Log GDP per capita -10.13*** 
_* Foreign language spoken at home (1.26) 
Log GDP per capita 130" 

_ * Living with no parents 7) 
Log GDP per capita -4.05** 
_* Single parent (1.21) 
‘Log GDP per capita -6.68°"* 

_ * Blended family (1.77) 
Log GDP per capita =“O;00' 

* Grandparent _ (.96) 
Log GDP per capita 1A 
_ * Birth order (.40) 

Log GDP per capita Zaye’ 

* Cultural communication (.37) 
Log GDP per capita -2.65°* 
e Gin (.79) 
Log GDP per capita -15.48"* 

* % Girls in school (4.90) 
Log GDP per capita 304" 

_ * School discipline (he 
Log GDP per capita 10°" 

* Teacher-student relations 38 
% Variance at Level Explained Variance at Each Level 
Country (31%) 14 65 64 0 foie 
School (25%) .20 19 45 63 64 
Student (44%) .06 06 11 “3 14 
Total 12 28 .36 38 40 


OL) | Sh SOO Ss 


N = 107,975 


Notes: Standardized coefficients, and standard errors (in parentheses). 


Each regression included a constant term. 
Blended family = one parent and one stepparent 
m= .05 “p<.01 “"p<.001 
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(~ 15% of the mean gain from a year of schooling, 15% = 7 / 48; 7 = In (1+10%) 
* 69.46; log variable computation, In (14+10%) * b; 48 = mean gain per school 
year; Table 3, Model 2). This was the largest country-level link to mathematics 
achievement. Compared to GDP per capita, log GDP per capita accounted for 
more variance in mathematics scores, showing diminishing marginal returns. If a 
less equal country’s Gini exceeded the mean by 10 percent, its students averaged 4 
points lower in mathematics (-4 =-1.71* 8 * [10% / 34%] = b * SD * [10% / 34%]; 
Table 3, Model 2). Log GDP per capita and Gini accounted for 51 percent of the 


cross-country differences in mathematics scores and 16 percent of the total variance. 


Family Characteristics 
Family characteristics had many of the strongest links to students’ mathematics scores. 
Students whose SES exceeded the mean by 10 percent averaged three more points 
than other students (= 8.83 * 1.04 * 10% / 34%). Native-born students outscored 
first-generation immigrants, second-generation immigrants and students who spoke 
a foreign language at home by 8, 4 and 9 points respectively (Table 3, Model 3). 
Students with two parents outscored those living without parents, with single parents, 
or with blended families by 15, 5 or 3 points respectively. The gap between living with 
two parents and living with no parents was the largest family link to mathematics 
achievement (~ 31% of the mean gain from a year of schooling, 31% = 15/ 48). 
Students living without grandparents averaged more 13 points in mathematics 
than those living with grandparents. Students with more siblings averaged 3 fewer 
points in mathematics per extra sibling. Furthermore, students who were born 
earlier than their siblings often had higher mathematics scores. 


Family Physical Resources 

Students’ physical resources at home (books and cultural possessions) were also 
linked to mathematics scores. Students who had more books at home often scored 
higher in mathematics. Meanwhile, students whose cultural possessions exceeded 
their means by 10 percent averaged 1 more point (1 = 2.95 * .98 * 10% / 34%). 


Family Intangible Processes 

Students’ intangible processes (blended family proxy, cultural communication) 
were likewise linked to mathematics scores. Students with two parents outscored 
those living with blended families by 3 points. Meanwhile, students whose cul- 
tural communication exceeded their means by 10 percent averaged 1 more point 
(1 = 3.57 * .98 * 10% / 34%). Together, family characteristics accounted for an 
extra 8 percent of the variance in students’ mathematics scores. 


Gender 


Boys outscored girls by 18 points in mathematics. Gender only accounted for an 
extra .002 of the variance in students’ mathematics scores. 
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School Characteristics 

Compared with family characteristics, school characteristics had weaker links to 
students’ mathematics scores. Students whose schoolmates’ SES exceeded the 
mean by 10 percent scored 9 more points in mathematics (9 = 43.35 * .69 * 10% 
/ 34%), the largest school link to mathematics achievement (~ 19% of the mean 
gain from a year of schooling, 19% = 9 / 48). Meanwhile, students in schools 
with 10 percent more girls, 10 percent more time in math classes, or 10 percent 
more educational resources than their respective means scored 2, 2, or .5 more 
points (2 = 20.37 * 10%; 2 = .08 * 74.1 * 10% / 34%; 2 = In (14+10%) *5.24). 
Not counting schoolmates, time spent in mathematic classes had the strongest 
school link to mathematics achievement (10% more classroom time is linked 
to ~ 4% of the mean gain from a year of schooling, 4% = 2 / 48). Compared to 
educational resources, log (educational resources) accounted for more variance 
in mathematics scores, showing diminishing marginal returns. In schools with 
10 percent more mathematics teachers with university degrees, students scored | 
more point (= 11.19 * 10%). 


School Intangible Processes 

Students whose perceived school discipline, classroom discipline or relationships 
with their teacher were 10 percent higher than their respective means scored 2, .5 
and 1 more point (2 = 7.48 * 1.06 * 10% / 34%; .5 = 1.60 * .99 * 10% / 34%; 
1 = 3.19 * 1.02 * 10% / 34%). Together, school characteristics accounted for 
an extra 2 percent of the variance in students’ mathematics scores (schoolmate 
characteristics accounted for only .1 percent of the variance). Thus, the ratio of 
variance explained by school characteristics over that of family characteristics is 
25 percent (= 2% / 8%). 


Complementary Processes and Social Reproduction Hypotheses 

Nearly all the intangible processes or their proxies (blended family, cultural com- 
munication at home, school discipline and teacher-student relations, but not 
classroom discipline) had stronger links to mathematics scores in richer countries, 
supporting the complementary processes hypothesis. Meanwhile, all of the physi- 
cal resources (books and cultural possessions at home, educational resources at 
school) had similar effects in richer countries, supporting the social reproduction 
hypothesis for physical resources. 

Most of the remaining family and school characteristics’ links with mathemat- 
ics scores were also stronger in richer countries. ‘Ihe positive links between rela- 
tive grade, family SES, birth order and mathematics scores were stronger in richer 
countries. Likewise, the negative links between remedial courses, living with no 
parents, single parents, grandparents, foreign language spoken at home, female and 
mathematics score were stronger in richer countries. Only the positive link between 
percentage of girls at school and mathematics score was weaker in richer countries. 
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Equality Dilution Hypothesis : 

Neither family nor school characteristics showed significant interaction effects 
with country inequality (Gini) on students’ mathematics scores, showing no sup- 
port for the equality dilution hypothesis. Interactions among other variables were 
non-significant and/or accounted for less than .001 of the variance without sub- 
stantially changing the results. 


Differences across Countries and Schools 


Family SES, schoolmate SES and gender showed similar results across countries, 
but other results differed substantially. Students with higher family SES or higher 
SES schoolmates scored higher in mathematics in most countries (93% in both 
cases). Boys also outscored girls in mathematics in 85 percent of the countries. 

Family and school characteristics’ links with mathematics score often differed 
across richer vs. poorer countries. Students with the following intangible processes 
or proxies had higher mathematics scores in some countries, mostly richer ones, 
supporting the complementary processes hypothesis (see tables 4 and A3; blended 
family: 20%; cultural communication at home: 44%; school discipline: 56%; 
teacher-student relations: 49%). All of the physical resources had similar likeli- 
hoods of positive links across richer or poorer countries, supporting the social 
reproduction hypothesis for physical resources (books: 95%; cultural possessions: 
34%; educational resources at school: 29%). 

Native students with two parents outscored other students in mathematics in 
less than half of the countries, primarily richer ones (first generation immigrant: 
32%; second generation immigrant: 32%; foreign language at home: 44%; single 
parent: 34%; living with no parents: 49%). For these immigrant status and fam- 
ily structure variables, the large effect sizes in rich countries and non-significant 
effects in poorer countries averaged out to moderate effect sizes in the 3-level 
analysis of all countries. Also, students with residential grandparents or older 
siblings scored lower than other students in some countries, mostly in richer ones 
(68%, 34%). Only the positive link between percentage of girls in school and 
mathematics score occurred more often in poorer countries (37%). These variables’ 
links to mathematics scores did not differ much in more ys. less equal countries. 

Within each country, these variables’ links with mathematics scores did not vary 
much across schools. Each variable’s effect size differed significantly across schools 
in less than 10 percent of the countries. For each significant variable in each 
country, all effect sizes for each school had the same sign. Furthermore, analyses 
of the residuals showed no influential outliers. 


Discussion 


Past studies showed that students in wealthier countries or with higher family SES 
had higher achievement, but the impact of school resources was mixed (Baker, 
Goesling and LeTendre 2002; Heyneman and Loxley 1982, 1983a, 1983b). 
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Gamoran and Long (2006) accounted for the mixed results by showing that 
school resources’ links with student achievement were much smaller after a coun- 
trys income exceeded $16,000 per year, as the variance of school resources was 
much lower (more equal) in these richer countries. This study extends this line 
of research by also analyzing country inequality, family and school processes. 
Students in more equal countries (lower Gini) scored higher in mathematics 
(supporting the equality achievement hypothesis). In additional to educational 
resources and structural inputs at home and at school, family and school processes 
were significantly linked to mathematics scores in many countries, especially richer 
ones. These links did not differ with respect to country inequality, showing no 
support for the equality dilution hypothesis. 


Country, Family and School Characteristics 


Country, family and school characteristics were all linked to mathematics scores. 
Students in richer or more equal countries had higher mathematics scores, with 
country GDP per capita showing the largest country-level link to mathematics 
achievement. As richer countries often have more public resources that serve as ex- 
tra learning opportunities for its students, they often scored higher than students 
in poorer countries. Moreover, log GDP per capita and log (educational resources 
at school) showed a better fit to the data than their linear counterparts, suggest- 
ing that extra country or school resources yield diminishing marginal returns. In 
countries with more equal distributions of income, students also scored higher. 
The last three results support the equality-achievement hypothesis. 

Family SES, immigrant status, structure and processes were all linked to math- 
ematics achievement. Students with higher SES, who were native born, speaking the 
national language at home, living with two parents, without residential grandparents 
or with fewer siblings (especially older ones) scored higher in mathematics than 
other students, consistent with past studies (Amato 2001; Entwisle and Alexander 
1995; Portes and MacLeod 1996). The largest family link to mathematics achieve- 
ment was living with two parents rather than with no parents. Moreover, students 
with more books, cultural possessions or cultural communication at home scored 
higher in mathematics than other students. The latter results suggest that even for 
poor families, their purchases and involvement can help their children learn more, 
either directly from these resources and processes or indirectly by highlighting their 
importance and thereby motivating their children to learn (Chiu 2008). 

School structures and processes were also linked to students’ mathematics scores. 
Students with more advantageous schoolmates (higher SES) or more school re- 
sources (class time, educational materials, mathematics teachers with university 
degrees) had higher mathematics scores, consistent with past studies (Clark and 
Linn 2003; Coleman et al. 1966; Fuller and Clarke 1994; Greenwald, Hedges and 
Laine 1996). Schoolmate SES had the largest school link to mathematics achieve- 
ment. Not counting schoolmates, time spent in mathematics classes had the largest 
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school link to mathematics achievement. Note that these significant school results 

contrast with the non-significant results in Baker, Goesling and Letendre'’s (2002) 
study, which had different school variables (including principal hours and percent- 
age of experienced teachers, 5+ years). Furthermore, students who enjoyed beneficial 
school processes (school or classroom discipline, teacher-student relationships) also 

scored higher in mathematics. This last set of results shows that school processes 

can help explain why student achievement differs in schools with similar levels of 
resources. Consistent with Heyneman and Loxley’s (1982, 1983) results, the differ- 
ences in mathematics scores among the relatively rich countries in this study were 

explained more by family characteristics than by school characteristics. 


Interactions across Countries 


The results did not support the modernization or equality dilution hypotheses, but 
they supported the social reproduction and complementary processes hypotheses. 
Except for female schoolmates, most family and school characteristics showed 
stronger links to mathematics achievement, thereby rejecting the modernization 
hypothesis. As this data set under-represents the poorest countries however, this 
result must be interpreted with care (cf., Gamoran and Long 2006). Similarly, 
family and school characteristics’ links to mathematics achievement did not differ 
significantly in more vs. less equal countries, thereby showing no support for the 
equality dilution hypothesis. 

The two physical family resource variables showed similar results in both rich 
and poor countries, supporting the social reproduction hypothesis regarding 
physical resources (subject to the same caveat as the modernization hypothesis 
interpretation, Gamoran and Long 2006). In contrast, both family processes 
showed different results in poor vs. rich countries. Of the family variables re- 
flecting some intangible processes, only sibling size showed similar results across 
countries. Hence, the results support the social reproduction hypothesis regarding 
physical resources, but not processes. 

Nearly all family and school processes had stronger links with mathematics 
achievement in richer countries. In contrast, no physical resource variable had 
stronger links with mathematics achievement in richer countries. Both sets of 
results support the complementary processes hypothesis that students reap greater 
benefits from their intangible processes in richer countries which have abundant 
and available physical resources. Immigrant status, family SES and family struc- 
ture variables with both physical resource and intangible process aspects also 
showed more significant links and often stronger links in richer countries. Note 
that country income interaction variables with family structure had moderate ef- 
fect sizes, but other interaction variables had small effect sizes. Nevertheless, these 
results suggest support for the complementary processes hypothesis. : 

Together, these results point to the importance of both physical resources and 
intangible processes to students’ achievement. Differences in privileged and disad- 
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vantaged students’ physical resources accounted for part of their achievement gap in 
both rich and poor countries. Meanwhile, intangible processes were linked to larger 
achievement gaps in richer countries, which can account for Heyneman and Loxley’s 
(1982, 1983) puzzling results that family effects are stronger in richer countries. 


Research and Policy Implications 


If these results are replicated in future studies, they highlight resource distribution, 
process variables and cross-country comparisons as key foci for research and policy. 
GDP per capita, educational resources at school and their distributions were all 
linked to student achievement, showing the importance of both the amount and 
the distribution of a given resource. Likewise, family and school processes were 
also linked to student achievement. 

The cross-country differences in explanatory variables’ links to student achieve- 
ment highlight the importance of proposing and testing hypotheses that can sys- 
tematically explain these differences across countries. Thus, researchers can model 
student achievement more precisely by examining distributions of resources, pro- 
cess variables and systematic differences across countries. 

The links between unequal resource distribution, physical resources, intan- 
gible processes and student achievement across countries suggest several pos- 
sible policy implications. First, the results imply that greater equality is linked to 
higher student achievement at both the country and school levels, suggesting that 
government and school leaders can improve student achievement through equal 
distribution policies as well as economic erowth or larger budgets. 

Both physical resources and intangible processes were also linked to student 
achievement, highlighting the importance of parents and educators attending to 
both aspects when helping their children and students learn. Furthermore, the 
similar effects of family and school physical resources across countries suggest that 
economic growth alone will not reduce the achievement gap between privileged 
and disadvantaged students. On the contrary, the stronger link between family and 
school processes in richer countries suggests that economic growth might widen the 
achievement gap between privileged and disadvantaged students due to intangible 
processes. To reduce this achievement gap, government and school leaders in richer 
countries can provide disadvantaged students with physical resources, and more 
importantly, intangible resources. In contrast, educators in poorer countries can 
focus on physical resources and structural inputs to improve student learning. 


Limitations and Future Research 


This study had some limitations. First, the students sampled were not fully rep- 
resentative of all 15 year olds. Students with very low mathematics achievement 
or very poor children might not attend school (UNICEF 2001). Second, this 
warrant causal interpretations. Third, the reliabilities 


correlational study does not 
ultural communication were low, so results involving 


of cultural possessions and c 
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these two measures should be interpreted cautiously. Fourth, remedial courses and 
relative grade are rough proxies of a student’s past achievement, so results involy- 
ing these measures should likewise be interpreted cautiously. Fifth, the absence 
of other possibly relevant variables might result in omitted variable bias. Future 
research can address these issues. 


Conclusion 


This study demonstrated how country, family and school characteristics were 
linked to mathematics scores. In richer countries or schools with more educational 
resources, students had higher mathematics scores, but with diminishing marginal 
returns. In countries with more equal distributions of income, students scored 
higher in mathematics. Together, these three results support the hypothesis that 
equal distribution of resources improves overall student learning. 

Both resources and processes were also linked to mathematics scores. Students in 
families with more resources (family SES, two parents, native born, books or cultural 
possessions) had higher mathematics scores. In contrast, students with more family 
members who might compete for limited family resources (resident grandparents or 
siblings, especially older ones) had lower mathematics scores. Students with more 
advantageous schoolmates (higher SES) or school resources (class time, educational 
materials, mathematics teachers with university degrees) had higher mathematics 
scores. Furthermore, students who enjoy beneficial family processes (cultural com- 
munication) or school processes (school and classroom discipline, teacher-student 
relationships) also scored higher in mathematics. 

Many family and school characteristics’ links with mathematics scores varied 
across countries. Physical family resource variables (books, cultural possessions) 
showed similar results in both rich and poor countries, supporting the social 
reproduction hypothesis regarding physical resources. Meanwhile, intangible 
processes and proxies (blended family, cultural communication, school discipline 
and teacher-student relations) had stronger links with mathematics achievement 
in richer countries. This result supports the hypothesis that greater availability of 
public physical resources raises the value of complementary intangible processes, 
which helps account for Heyneman and Loxley’s (1982, 1983) puzzling results 
that family effects are stronger in richer countries. 
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Understanding Economic Justice Attitudes in Two 
Countries: Kazakhstan and Kyrgyzstan 


Azamat K. Junisbai, Pitzer College 


Analyzing data from the 2007 Kazakhstan and Kyrgyzstan Inequality Survey, I iden- 
tify and compare the determinants of economic justice attitudes in two formerly 
similar majority-Muslim nations that are now distinguished almost exclusively by 
their dissimilar economic circumstances following the collapse of the Soviet Union. In 
Kazakhstan, where the economy is growing rapidly, the important factors predicting 
economic egalitarianism are connected to people’s perceived ability to do well in the 
future. In contrast, in Kyrgyzstan, which has stagnated in the post-Soviet era, people’s 
immediate economic vulnerability predicts egalitarianism, while their economic pros- 
pects are irrelevant. Finally, the effect of several factors on support for egalitarianism 
appears impervious to the prevailing economic winds: religious orthodoxy, the urban 
ys. rural divide, and membership in a historically privileged ethnic group. These 
patterns reflect both the commonalities in the two countries’ histories, demography, 
and religion and their divergent economic trajectories since the collapse of the USSR. 





Introduction 


Sociologists have long been interested in understanding popular attitudes about 
economic justice. Yet, until recently, most studies have been limited to Western 
democracies with predominantly Christian populations. Given that most of the 
world’s population resides elsewhere, large gaps in our knowledge remain. This 
study seeks to contribute to the literature on economic justice by analyzing data 
from nationally-representative surveys fielded by the author in Kazakhstan and 
Kyrgyzstan in 2007. Not only are these two post-Soviet Central Asian countries 
poorer than those in the West, they are also authoritarian, predominantly Muslim 
(Sunni branch), and rapidly transforming into capitalist economies after nearly 
seven decades of state socialism. These factors combine to make them unique 
among the countries in which similar studies have been conducted. 

While Kazakhstan and Kyrgyzstan share important characteristics, they differ 
significantly in one crucial respect. Because of the two countries’ very different 
natural resource endowments, their previously similar economic trajectories began 
to diverge after the collapse of the Soviet Union in 1991. Due to an abundance 
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of natural resources, including oil, gas and light metals, Kazakhstan has achieved 
a level of economic development far surpassing that of its neighbor.’ In con- 
trast, Kyrgyzstan was ill-prepared for the economic burdens of independence 
and remains so today. The two countries’ official external debt highlights their 
divergent economic fortunes. In 2000, the government of Kazakhstan repaid its 
International Monetary Fund loans in full, seven years ahead of schedule. In con- 
trast, by the early 2000s, the government of Kyrgyzstan’s external debt was equal 
to the countrys GDP (IMF and World Bank 2006). By late 2006, Kyrgyzstan 
had become eligible for debt relief under the World Bank’s Heavily Indebted Poor 
Countries initiative. In 2006, GDP per capita in Kazakhstan was $5,043, but 
only about one-ninth as much—$536—in Kyrgyzstan (United Nations Statistics 
Division 2008), and the percentage of poor and unemployed in Kazakhstan was 
less than half that of Kyrgyzstan (Central Intelligence Agency 2008). In fact, in 
recent years, along with Russia, Kazakhstan has become the destination of choice 
for tens of thousands of Kyrgyz migrant workers. 

This dramatic divergence in the economic fortunes of two otherwise similar 
societies provides a quasi-experimental setting for examination of economic jus- 
tice attitudes. Thus, the goals of this investigation are two-fold: (1. to identify 
and contrast the factors that drive egalitarian attitudes in two countries which are 
distinguished almost exclusively by their dissimilar economic circumstances, and 
(2. to evaluate how well patterns found elsewhere hold in these two post-Soviet, 
majority-Muslim nations. 


What Do We Know About Economic Justice Attitudes? 


To date, our understanding of economic justice attitudes and their predictors 
has been overwhelmingly based on investigations conducted in Western democ- 
racies (e.g., Robinson and Bell 1978; Hochshild 1981; Svallfors 1997; Davis and 
Robinson 1999; Andress and Heien 2001; Arts and Gelissen 2001; Blekesaune 
and Quadagno 2003). This body of work has recently been complemented 
with research based on newly available data from seven Muslim-majority na- 
tions (Davis and Robinson 2006). Given the abundance of studies that have 
been conducted, there has been a striking lack of research on this subject in the 
massive post-Soviet bloc. Davis and Robinson's (1999) study of European coun- 
tries included six post-communist countries surveyed in the 1991 International 
Social Survey Program. The International Social Justice Project survey, origi- 
nally fielded in 1991 and replicated in 1996, represents the last available data 
containing extensive measures of economic justice and perceived fairness of the 
economic system in the post-communist states (Kluegel, Mason and Wegener 
1995; Mason and Kluegel 2000; Kluegel and Mason 2004). However, even the 
ISJP did not include former Soviet republics with majority Muslim populations. 

An overview of published empirical work on principles of economic justice in 
Western societies reveals several recurring themes. In advanced capitalist nations, most 
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people prefer that valued resources be distributed according to the principle of equity 
or desert (ie., contribution) rather than the principle of equality (i-e., need). When 
asked about the main determinants of the existing pattern of resource distribution, 
people cite education, occupation and effort. When asked to choose between equal 
and unequal distributions of income in society, most people opt for the inegalitarian 
position, pointing to desert, incentives and beliefs about human nature as the princi- 
pal justifications for preserving an unequal distribution (Miller 1992). This preference 
is strongest in countries with liberal welfare states, such as the United States, weaker in 
countries with conservative welfare states, such as Germany, and weaker still in social 
democratic nations like Finland and Norway (e.g., Svallfors 1997). 

There is consistent evidence across all societies studied that the most vulner- 
able members of the population are the most likely to hold egalitarian notions of 
economic justice (Miller 1992; Linos and West 2003). Robinson and Bell (1978), 
analyzing data from the United States and Great Britain in the mid-1970s, referred 
to this as the underdog principle. That is, more vulnerable individuals are more 
likely to favor equality, given that greater equality would benefit them. Robinson 
and Bell found that underdogs —ethnic minorities (nonwhites), those employed in 
low prestige occupations, and those with low family incomes—were more likely to 
hold egalitarian attitudes. Correspondingly, Kelley and Evans's (1993) analysis of 
ISSP data finds that younger people, those with low socioeconomic status, those 
with lower levels of education, the poor and women were more likely to support 
the idea of more egalitarian income distribution. 

Robinson and Bell (1978) hypothesized that young people in the United States 
who had reached political maturity during the 1960s should be more egalitarian 
in their attitudes than older people. This was a period in which there was a “re- 
discovery of inequality” and when movements of blacks, women and students for 
greater equality and rights were in their heyday. Robinson and Bell refer to this 
phenomenon as the egalitarian Zeitgeist principle. 

In addition to objective factors, such as various markers of underdog status, 
previous studies have found that subjective beliefs about the nature of society 
affect attitudes toward redistribution (Linos and West 2003; Luo 1998; Fong 
2001; Funk 2000). For example, analyzing 1992 ISSP data from the United 
States, Norway, Germany and Australia, Linos and West (2003) find that “beliefs 
about mobility” (scales of averaged responses to questions regarding factors most 
important for getting ahead in society) significantly affect popular attitudes to- 
ward redistribution. Those who believed their society to be fair (e.g., do not think 
that connections or rich parents are important for getting ahead) were less likely 
have egalitarian positions on redistribution. Robinson and Bell (1978) also found 
that in the United States belief in the American Dream (i.e., the belief that one 
will become successful in the future) significantly dampened egalitarian attitudes. 

Exploring the relationship between economic justice beliefs and religion, Davis 
and Robinson’s (1999, 2006) moral cosmology theory challenges the conventional 
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view that religious modernists are associated with the political left and religious 
traditionalists with the political right. Davis and Robinson hypothesize that mod- 
ernists, who hold the theologically individualistic belief that it is up to individuals 
to determine what is right and wrong and to make their own fates, are economi- 
cally individualistic in holding the poor and unemployed as responsible for their 
economic misfortune. The religiously orthodox, the authors argue, are theologi- 
cally communitarian in believing that the community must uphold divinely- 
mandated laws on what constitutes good and evil. For the orthodox, economic 
beliefs are also communitarian, entailing “watching over” community members 
to ensure that everyone’s economic needs are taken care of (2006:169). Analyzing 
data from 21 European countries and Israel (1999), they found that attitudes 
consistent with economic egalitarianism were more likely to be found among 
religious traditionalists than among religious modernists. Davis and Robinson 
(2006:167) further tested the relationship between religion and attitudes about 
economic justice by analyzing WVS data from seven majority-Muslim nations, 
finding that Islamic orthodoxy, measured as the desire to establish Islamic law 
(the sharia), is associated with “broad economic communitarianism” in each of 
the seven countries. It thus appears that religious traditionalists, regardless of the 
faith tradition in which they are situated, are more likely to hold egalitarian beliefs 
than their modernist counterparts. 

While there has been no systematic research on economic justice attitudes 
in Central Asia, a few studies have included Central/Eastern Europe or Russia. 
Analyzing 1991 Polish survey data, Zagorski (1994) finds that the strongest pre- 
dictor of egalitarian attitudes is the expectation of no future improvement in the 
standard of living, rather than the respondents’ evaluation of their current living 
conditions. This finding is consistent with Robinson and Bell’s (1978) results 
in the United States, where expectations of future financial success, rather than 
one’s current economic situation, mattered in determining people’s economic 
justice attitudes. 

Kluegel et al.’s (1995) analysis of 1991 ISJP data shows Russians and Eastern 
Europeans to be more supportive of active government intervention in the econ- 
omy than people in Western Europe, Japan or the United States. Egalitarianism, 
support for a strong government role in the economy, and deep skepticism about 
merit-based distribution systems were widespread. Most respondents supported 
the idea that resources should be distributed primarily on the basis of need, rather 
than desert. Consistent with findings from Western countries, women and people 
with lower levels of education and income were found to be more egalitarian. 

Mason et al. (2000) find that between the two waves of their ISJP surveys in 
Russia and Eastern Europe—1991 and 1996—enthusiasm for market justice faded, 
while support for egalitarian economic arrangements grew. The authors suggest that 
widespread corruption and problems with privatization produced negative beliefs 
about the fairness of the market economy, reducing support for market justice 
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principles. In Russia, unlike the other countries in the ISJP survey, support for 
egalitarian principles was not limited to retirees and the elderly. Most Russians—ir- 
respective of age—felt that connections and access, rather than effort, determined 
one’s economic trajectory and were skeptical about the wealth and the wealthy. In 
the 1996 ISJP data, attitudes toward market justice were again found to be shaped 
by personal economic experience, as well as evaluations of one’s present and future 
standard of living. In an interesting parallel to findings from the United States and 
Great Britain (Luo 1998; Fong 2001; Funk 2000), Mason et al. (2000) found a sig- 
nificant and increasingly positive relationship between the perceived fairness of the 
existing economic order (measured as perceived causes for poverty and wealth) and 
inegalitarian economic attitudes. Thus, the perception that the existing economic 
system is fair reduced support for egalitarian economic principles. 


Kazakhstan and Kyrgyzstan after the Fall of the USSR - 
Commonality and Divergence 


Despite its many serious flaws, the Soviet Union did provide ordinary citizens with a 
relatively high degree of economic security and social protection. “Total job security 
and price stability, as well as the guarantee of all basic benefits... did become an 
inviolable norm of Soviet way of life.” (Zaslavsky 1995:49) Integral to the Soviet way 
of life were beliefs shared by Soviet citizens, namely, that the state ought to provide 
full employment, free education and health care, as well as an array of social services, 
and that it must prevent the emergence of economic inequality. 

Western politicians and academics were not the only ones caught off guard by 
the sudden fall of the USSR in late 1991. When the dust settled, entire popula- 
tions of the newly independent states found themselves in a strange new world in 
which the rules of social, political and economic organization had been fundamen- 
tally transformed. Almost overnight, egalitarian ideals were officially discarded and 
the long taken-for-granted social safety net disintegrated. The collapse meant that 
entire populations, with the exception of the “top political and economic elite... 
[who] were able to convert power over resource allocation into ownership of 
important assets,” (Dudwick et al. 2003:22) experienced a dramatic deterioration 
in their quality of life. People from all walks of life who had previously been em- 
ployed, housed and socially integrated into their communities were plunged into 
poverty, as the restructuring economy shed thousands of jobs, and hyperinflation 
wiped out families’ lifelong savings in a matter of months. The sheer scale and the 
abruptness of the transformation that followed the collapse of the Soviet state have 
few if any precedents in modern history (Mason and Kluegel 2000). 

The effects of market transition have been particularly acute in Central Asia, 
traditionally the poorest and most heavily subsidized part of the Soviet Union. 
As new states without prior history of either political or economic independence, 
Kyrgyzstan and Kazakhstan experienced severe economic shocks. The govern- 
ments of both had to dramatically cut back on all types of welfare programs. Free 
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medical care, education and housing, taken for granted during the Soviet period, 
had all but disappeared. Overall, people in rural areas suffered the worst deterio- 
ration of their living standards. Even in relatively more economically advanced 

Kazakhstan, in 2001 agricultural workers earned only 29 percent of industrial 

wages, down from 78 percent in 1991 (United Nations Development Program 

2002). In a little over a decade, societies that had long enjoyed the world’s lowest 

levels of economic inequality became as unequal as some of the most inegalitarian 

Latin American nations (Mikhalev 2003). The economic transformation left many 

people bewildered and disoriented, because what they saw as their governments 
failure to meet traditional responsibilities violated the very ideals of economic 

justice that had been internalized during the preceding decades. Combined with 

the absence of viable democratic institutions, the disintegration of the Soviet-era 

social safety net and the resulting widespread impoverishment have been so grave 

as to make some observers of the region wary of the rise of Islamic fundamentalism 

in these formerly secular societies (Rashid 2003). 

Yet, by early 2000s, the two countries’ vastly different natural resource endow- 
ments meant that their citizens, who had enjoyed essentially identical standards 
of living under the USSR, found themselves in very different environments. 
In Kyrgyzstan, political instability and poverty culminated in the spring 2005 
overthrow of the government, while Kazakhstan has witnessed greater political 
stability and high rates of economic growth in recent years. Between 1996 and 
2006, Kazakhstan's economy grew at a robust average rate of 6.85 percent a year. 
In contrast, during the same period, the average annual growth of Kyrgyzstan’s 
economy equaled -.42 percent (United Nations Statistics Division 2008). 

Official infant mortality data provide a different angle from which the rather 
dramatic divergence in the countries’ fortunes is illuminated. In 1999, infant 
mortality rates (deaths of infants under the age of 1/ per 1,000 live births) in 
Kyrgyzstan and Kazakhstan were 22.7 and 20.4, respectively. By 2005, Kyrgyzstan’s 
infant mortality rate had increased to 29.7, while that of Kazakhstan had fallen 
to 15.1 (Shokomanov 2006). According to most recent available data, 43 percent 
of Kyrgyzstan’s population lives at or below the poverty line (National Statistical 
Committee of the Kyrgyz Republic 2006), however, the same is true of only 13.8 
percent of Kazakhstan’s population (CIA World Factbook 2008). 


Hypotheses 


The similarity (shared Soviet history, majority-Muslim populations, explosion of 
economic inequality, collapse of the welfare state) and the contrast (highly diver- 
gent economic trajectories) found in Kazakhstan and Kyrgyzstan today offer fertile 
ground for enriching our understanding of the factors shaping economic justice 
attitudes. While none of the countries in the earlier studies of economic justice had 
the mix of characteristics found in post-Soviet Central Asia, this body of literature is 
instrumental in informing my expectations and supplying testable hypotheses. Here 
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I test to see: (1. how well hypotheses derived elsewhere apply on post-Soviet Central 
Asian soil, and (2. which determinants of egalitarian economic justice attitudes, if 
any, are most/least sensitive to the overall economic situation in the country. 

Hypothesis 1: According to the underdog principle, the more vulnerable 
members of the population (e.g., women, the unemployed, people from low- 
income households and those with low educational attainment) are more likely 
to favor egalitarian notions of economic justice than those who are better off. 
Correspondingly, those in working class occupations are likely to be more sup- 
portive of equality than those in middle/upper middle class jobs.* Although in 
the West ethnic minorities are typically considered underdogs, in Kazakhstan and 
Kyrgyzstan this is not the case. Ethnic minorities in Kazakhstan and Kyrgyzstan 
usually occupied privileged positions in government and industry. This is due 
to the fact that under Soviet rule both republics experienced a massive influx of 
Russians and other Slavic peoples who were brought in to work in the industrial 
sector and in administration, while Kazakhs and Kyrgyz were engaged predomi- 
nantly in agriculture. Thus, I expect members of ethnic minorities to be less 
egalitarian in their economic attitudes than the people belonging to titular ethnic 
groups. Kazakhstan vs. Kyrgyzstan: | expect the underdog principle to be more 
potent in Kyrgyzstan, where economic opportunities are scarce and pessimism 
regarding the country’s economy is pervasive. In more prosperous Kazakhstan, 
with its oil-driven economic boom and expanding range of economic opportuni- 
ties, the egalitarian effect of the underdog principle may be dampened due to a 
transplanted American Dream effect, wherein people’s economic attitudes are 
framed by their long-term expectations, rather than by their current situation. 

Hypothesis 2: In both countries in my analysis, rural residents suffered the 
worst deterioration in their living standards after the collapse of the USSR and 
are still living in an environment defined by high unemployment and lack of 
economic prospects. In Kazakhstan and Kyrgyzstan, there is a glaring dispar- 
ity between economic opportunities available in cities as opposed to rural areas, 
which has precipitated massive rural to urban migration. I expect rural dwellers 
to be significantly more egalitarian in their attitudes toward redistribution in 
comparison to those living in large cities. 

Kazakhstan vs. Kyrgyzstan: Kazakhstan's superior macroeconomic performance 
is apparent when visiting the two countries’ respective capitals, as well as larger 
cities. However, one would be hard pressed for signs of new-found oil wealth in 
Kazakhstan’s decimated countryside. Thus, I would expect to find no significant 
differences between the two countries in the effect of area of residence. 

Hypothesis 3: Consistent with Robinson and Bell’s (1978) egalitarian Zeitgeist 
principle, I expect that older people—those who lived and worked during the years 
of the Soviet Union, with its strong egalitarian ideology—are more likely to be 
supportive of redistribution than are younger people who came of age after the 
collapse of the USSR. Note that while my expectation is in line with Robinson 
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and Bell’s, the effect of age is the reverse of the prediction that Robinson and 
Bell derived by applying this principle to the United States in the mid-1970s. 
When their data were gathered, younger people were those who had come of age 
politically during the egalitarian movements of the 1960s, while older people had 
reached political maturity during less egalitarian times. 

Kazakhstan vs. Kyrgyzstan: If younger people are found to be less egalitarian than 
those who came of age during the Soviet era, I would expect this effect to be stronger 
in Kazakhstan because of the wider range of economic opportunities available there. 

Hypothesis 4: Expectations of future returns/future success reduce egalitar- 
ian attitudes. In his analysis of 1991 Polish data, Zagorski (1994) found that 
subjective expectations are more important than actual earnings in determin- 
ing support for economic justice attitudes. I expect to find less egalitarianism 
among those who can “imagine themselves on the privileged side of an unequal 
distribution.’ (Robinson and Bell 1978:138) 

Kazakhstan vs. Kyrgyzstan: In Kazakhstan's rapidly expanding economy, indi- 
vidual expectations about one’s economic prospects are more tangible and should 
carry more weight (be more consequential) than in Kyrgyzstan, where such hopes 
are more ephemeral. Hence, I expect that anticipation of brighter financial future 
will do more dampen support for egalitarianism in Kazakhstan than in Kyrgyzstan. 

Hypothesis 5: Beliefs about the fairness of the existing economic order shape 
attitudes toward redistribution (Linos and West 2003; Luo 1998; Fong 2001; 
Funk 2000). Kazakhstan and Kyrgyzstan, where resentment and suspicion toward 
the nouveau riche appear widespread and intense, offer an intriguing opportunity 
to test this finding. I expect that those who believe the economic system to be 
unfair (e.g., wealth gained via connections and dishonesty, poverty a result of 
discrimination) are more likely to embrace egalitarian attitudes toward economic 
justice and redistribution. 

Kazakhstan vs. Kyrgyzstan: In today’s Kyrgyzstan, where most people's economic 
fortunes are falling and poverty is ubiquitous, the salience of beliefs about system 
fairness may be diminished. In contrast, the issue of fairness should have more 
potency in Kazakhstan, where the economy is growing and the anxiety about be- 
ing left behind is less abstract. Thus, I expect that concerns about economic system 
fairness have a greater effect in Kazakhstan than in Kyrgyzstan. 

Hypothesis 6: Religious traditionalists are more likely to support economic egali- 
tarianism than are religious modernists (Davis and Robinson 1999, 2006). Although 
the majority of people in Kazakhstan and Kyrgyzstan are Sunni Muslims, the two 
societies are very different from the predominantly Muslim states analyzed by Davis 
and Robinson (2006). The two share a recent history of state-imposed atheism and 
exhibit greater religious diversity due to a sizeable presence of Slavs (i.e., Russians, 
Ukrainians and Belarusians). For example, in multiethnic Kazakhstan more than 65 
percent of the population is comprised of people with a Muslim ethnic background 


(including Kazakhs, Uzbeks, Tatars and Uighurs), while Russians, Ukrainians and 
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Germans make up more than 30 percent of the population. In Kyrgyzstan, peo- 
ple with a Muslim ethnic background (including Kyrgyz, Uzbeks, Uighurs and 
Dungans) account for more than 85 percent of the population. In addition, people 
in both countries also experienced nearly seven decades of state-imposed atheism, 
and many still identify as non-believers. Nonetheless, based on what is known from 
earlier research, I expect that in Kazakhstan and Kyrgyzstan greater support for the 
imposition of sharia law is associated with egalitarian attitudes toward economic 
justice (the religious traditionalism principle). 

Hypothesis 6a (alternative hypothesis): The egalitarian effect of religious tradi- 
tionalism found in other societies will be negated in Kazakhstan and Kyrgyzstan 
due to their legacy of official atheism and heterogeneous demographic composition. 

Kazakhstan vs. Kyrgyzstan: The relationship between religious orthodoxy and 
egalitarianism has been shown to hold in multiple countries and several faith 
traditions (Davis and Robinson 1999, 2006). Thus, I do not anticipate significant 
differences between Kazakhstan and Kyrgyzstan on this issue. 


Data? 


Interviews in Kazakhstan were carried out by the Almaty-based Center for Study 
of Public Opinion (Tcentr Izucheniya Obshestvennogo Mneniya). In Kyrgyzstan, 
the work was conducted by the Bishkek-based El-Pikir Center for Public 
Opinion Research (Tcentr Issledovaniy Obshestvennogo Mneniya El-Pikir). Both 
organizations have strong national reputations and extensive experience in con- 
ducting nationally representative surveys for domestic and foreign clients from 
the academic, government and private sectors. Each organization has cultivated 
a network of trained interviewers for its data collection. Both employ compre- 
hensive quality-control procedures.* 

Any effort to organize nationally representative surveys in Kazakhstan and 
Kyrgyzstan must overcome formidable obstacles. Telephone-based surveys are not 
feasible because telephone coverage in most rural areas is scarce. As a result, face- 
to-face interviews must be conducted. However, the existing lists of residents, such 
as voter lists and address books, are outdated and incomplete. Thus, a sample of 
households, rather than a sample of individuals, is normally used. Yet, a compre- 
hensive national list of households is not available in either country. Due to these 
constraints, the most widely-used method for obtaining nationally representative 
data in both countries is a multistage stratified probability sample of households. 

Selection of settlements: The first stage involves selection of cities, towns and 
villages from the existing list of settlements available from the State Statistical 
Agencies (Goskomstat) of both countries. To make the selection, all settlements 
are first classified into groups (strata) defined by region and population size. 
Both Kyrgyzstan and Kazakhstan are typically divided into five geographic 
regions: north, west, south, east and central. Within each region, all settlements 
are classified into large urban (oblast capitals), other urban (other towns), and 
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rural. As a result, each one of the five regions will have three sub-strata. Then, 
within each region, a number of settlements from each sub-stratum are ran- 
domly selected. The number of interviews to be conducted in each settlement 
is determined by its population size. 

Selection of households: After the settlements are selected, households are chosen 
using the random route sample method. First, ZIP codes are selected randomly; 
next, streets within ZIP code areas are selected randomly; and, finally, actual 
households from each street are randomly chosen. 

Selection of respondents and interview eligibility: After a household has been iden- 
tified, respondent selection is made using the most recent/next birthday method. 
All adults ages 18 and older were eligible for participation. 

Response rate and sample size: The response rates in Kyrgyzstan and Kazakhstan 
were 94 percent and 76 percent, respectively. In Kazakhstan, 1,099 interviews 
were completed, while in Kyrgyzstan 1,000 interviews were completed.’ 

The 2007 Kazakhstan and Kyrgyzstan Inequality Survey was the first of its kind 
in post-Soviet Central Asia and produced a unique, internationally comparative 
dataset. The questionnaire combined translated and pre-tested questions that 
originally were asked in the International Social Justice Project in Russia in 1996, 
along with questions from the Social Inequality module of the International 
Social Survey Program and the World Values Survey, both administered in Russia 
in 1999. The use of well-established questions helps ensure reliable and valid 
measurement and enables theoretically intriguing comparisons of data collected 
in Central Asia to existing datasets originating elsewhere. Comparison of demo- 
graphic variables (age, sex, ethnicity, urban vs. rural) with available Kazakhstan 
and Kyrgyzstan government statistics confirms that the sample data are representa- 
tive of their underlying populations (details available on request). 


Measures 
Dependent Variable 


I measure egalitarian economic justice attitudes with a four-item index, based on 
the respondent's degree of agreement with the following statements: 


1. ‘The most important thing is that people get what they 
need, even if this means allocating money from those 
who have earned more than they need. 

2. It is the responsibility of the government to reduce 
the differences in income between people with high 
incomes and those with low incomes. 

3. ‘The fairest way of distributing wealth and income 
would be to give everyone equal shares. 

4. ‘The government should place an upper limit on the 
amount of money any one person can make. 
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Answers range from strongly disagree (1) to strongly agree (5). The index is a 
mean of responses to these questions, with a higher score corresponding to more 
egalitarian attitudes. Cronbach's alpha for the index is .65 in Kazakhstan and .60 
in Kyrgyzstan.° By combining items dealing with need-based distribution (item 
1), government's role in income re-distribution (items 2 and 4), and general sup- 
port for equal incomes (item 3), the index provides a comprehensive measure of 
support for egalitarian economic justice. 


Independent Variables 


The study employs a combination of variables typically found in analyses of eco- 
nomic justice attitudes. A summary of the independent variables used in the 
model is provided in Appendix Table A1. 

Demographic variables: Individual characteristics included in the model are 
gender, series of dummy variables for age, urban area residence and being an 
ethnic minority (i.e., non-Kazakhs in Kazakhstan and non-Kyrgyz in Kyrgyzstan). 

Socioeconomic variables: Occupation/employment status is a dummy variable 
series identifying owner/manager (reference category), professional, white col- 
lar, blue collar, student, housewife/helping with family business, retired and 
unemployed. Education is measured as years spent studying fulltime. Because 
the educational systems in the two countries were and remain very similar, direct 
comparison of the effects of education is possible. Household income is total 
monthly income of the household after taxes. The two countries have different 
national currencies (Kazakh tenge and Kyrgyz som), and average salaries/pen- 
sions differ substantially. To enable direct comparisons of the effect of income 
between the two countries, the household income variable is standardized to a 
mean of 0 and a standard deviation of 1. 

Economic Prospects and Economic System Fairness: Economic prospects are cap- 
tured by asking respondents to predict their household financial situation in five 
years time as compared with today. 

Perceived fairness of the economic system: is an index of economic system fairness 
based on the respondent's assessment of the frequency with which the following 
non-merit based factors determine success or failure in their society: 


In your view, how often is each of the following factors a reason why there are rich 
people in Kazakhstan/Kyrgyzstan today? 


Dishonesty 

Having the right connections 

More opportunities to begin with 

The economic system which allows [them] to take 


unfair advantage 


ee 
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In your view, how often is each of the following factors a reason why there are poor 
people in Kazakhstan/Kyrgyzstan today? 


1. Prejudice and discrimination against certain groups in 
Kazakhstan/Kyrgyzstan 

2. Lack of equal opportunity 

3. Failure of the economic system 


Answers range from very often (1) to never (5). The scale is a mean of responses to 
these questions, with a higher score corresponding with the view of society's eco- 
nomic system as fair. Cronbach's alpha for the scale is about .65 in both countries. 

Islamic Orthodoxy: Islamic orthodoxy is measured using a question on support 
for the establishment of Islamic law as the sole basis for jurisprudence (Davis and 
Robinson 2006; see Appendix Table A1). Although orthodox Muslims may differ 
in their interpretations of the sharia, they “share the desire to implement Islamic 
law as the sole legal foundation, rather than allow legal codes to emerge through 
pluralistic political negotiation and compromise between competing interests, in- 
cluding secular ones.” (Davis and Robinson 2006:171)’ The responses are recoded 
so that high values indicate support for implementation of sharia. 

As mentioned earlier, both countries in my analysis have significant non-Muslim 
populations. This reality is reflected in my data. In Kazakhstan, slightly more than 
55 percent of the respondents are of Muslim ethnic background, whereas their 
share reaches 84 percent in Kyrgyzstan. To avoid heavy loss of observations, I assign 
non-Muslims the mean on the sharia item for Muslims. This means that any effect 
of Islamic orthodoxy should be interpreted as pertaining only to ethnic Muslims. 

Finally, I control for attendance of religious services, with answers ranging from 
(1) never or practically never to (7) more than once a week. Because attendance of re- 
ligious services may occur for social rather than religious reasons, non-attendance 
does not always signal a lack of religiosity. In fact, authorities in Central Asia are 
usually suspicious of religious activity outside of state-sanctioned mosques for fear 
of religious fundamentalism (e.g., the banned Hizb-ut-Tahrir movement). Thus, 
I make no prediction regarding this variable’s effect of economic justice attitudes. 


Results 
Economic Justice Attitudes in Kazakhstan and Kyrgyzstan 


Table 1 shows the distribution of respondents in Kazakhstan and Kyrgyzstan 
on the four items comprising the index of egalitarian economic justice attitudes. 
People in both countries display largely similar levels of support for redistribu- 
tion, with Kyrgyzstanis embracing slightly more egalitarian attitudes than their 
counterparts in Kazakhstan. Most notably, in Kyrgyzstan, more than 70 percent 
of respondents agree with the idea that “the fairest way of distributing wealth and 
income would be to give everyone equal shares,” whereas less than 48 percent of 
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Kazakhstanis agree with this proposition. 
The statement that generated the most 
‘disagree’ and ‘strongly disagree’ responses 
in both countries refers to government 
placing “an upper limit on the amount of 
money any one person can make.” 

The index ranges from 1 to 5, with 
higher values indicating greater support for 
egalitarian attitudes toward redistribution. 
‘The respective means of 3.23 and 3.30 in 
Kazakhstan and Kyrgyzstan highlight their 
relative similarity on economic justice at- 
titudes, and suggest that people in both 
countries lean toward the egalitarian pole. 


Perceived Fairness of the Economic System 


Table 2 shows the distribution of responses 
from Kazakhstan and Kyrgyzstan on the 
seven items comprising the economic sys- 
tem fairness scale. I find that respondents 
in the two countries share many beliefs 
about the importance of non-merit based 
factors in determining economic success or 
failure. Non-meritorious factors are often 
thought of as defining a person's economic 
trajectory. Respondents in Kazakhstan and 
Kyrgyzstan identified “having the right 
connections” as the most common factor 
responsible for individual wealth. However, 


“lack of equal opportunities” is identified 


as the most common cause of poverty in 
Kazakhstan, while “failure of the economic 
system’ is most often cited in Kyrgyzstan. 
This important difference reflects the diver- 
gence in the two countries’ economic for- 
tunes in the past decade and is consistent 
with my expectations. 

The scale ranges from 1 to 5, with 
higher scores associated with the view of 
the system as fair, and lower scores with 
the view that non-merit based factors are 
important in determining individual eco- 
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negatively associated with egalitarian attitudes. 

I use ordinary least squares regression to esti- 
mate the effects of the independent variables on 
egalitarian economic justice attitudes. To better 
ascertain the relative explanatory power of the dif- 
ferent variables, I compute two models for each 
country. Model 1 includes gender, age, urban area 
residence, ethnic minority membership,occupa- 
tion/employment status, education, household 
income and economic prospects. Model 2 adds 


Prejudice and discrimination against certain groups 


In your view, how often is each of the following 
Lack of equal opportunity 


Having the right connections 

More opportunities to begin with 
The economic system which allows 
Failure of the economic system 


In your view 
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Table 3: Zero-order Correlations for Egalitarian Economic Justice Attitudes 
Kazakhstan Kyrgyzstan 


Male -.042 -.029 
Age 18-24 -.013 .002 
Age 25-34 .009 .007 
Age 35-44 -.007 -.046 
Age 45-54 -.031 -.006 
Age 55-64 -.041 023 
Age 65 or above .092 044 
Ethnic minority -.088 -.130 
Urban -.171 -.198 
Education .026 -.184 
Household income .039 -.200 
Future economic prospects -.102 -.014 
Owner/ Manager -.007 .009 
Professional 042 -.101 
White collar -.004 -.037 
Blue collar -.006 -.096 
Student -.081 .020 
Homemaker, helping with family business -.041 102 
Retired 043 064 
Unemployed 034 .010 
Perceived fairness of economic system -.170 -.003 
Religious services attendance 074 -.081 
Support for shari’a B22 or 


Kazakhstan N = 839, Kyrgyzstan N = 912 
Source: Kazakhstan and Kyrgyzstan Inequality Survey, 2007 


attitudes regarding the fairness of the economic system, Islamic orthodoxy and re- 
ligious service attendance. To further explore whether the effects of these variables 
are different in the two contexts, I estimate fully interactive models by creating 
interactions between all independent variables and a country dummy variable. 
When added to the models, these interactions identify which relationships differ 
by country (see Table 4). 

Regression analysis reveals similarity and divergence between the two coun- 
tries, reflecting both the commonalities in the countries’ history, demography 
and religion, and their divergent economic trajectories, since the collapse of the 


Soviet Union. 


Similarity 
Ethnicity, Area of Residence and Islamic Orthodoxy 
Three significant causal relationships appear to be largely immune to the growing 
wealth chasm between the two societies. 

Belonging to the advantaged ethnic minority is associated with less egalitar- 
ian economic attitudes in both countries. This result is consistent with the 
expectations derived from the underdog principle (H1) because we know that 
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ethnic minorities in Kazakhstan and Kyrgyzstan historically occupied privileged 
positions in government and industry. 

In both countries urban residence is associated with significantly less egalitarian 
attitudes (H2). The deterioration in the standard of living experienced by former 
Soviet citizens after the USSR’s collapse hit residents of rural areas especially hard. 
With the elimination of collective farms, jobs disappeared and infrastructure, such 
as schools and hospitals, soon followed. Scarcity of economic prospects, decrepit 
infrastructure, as well as a certain social stigma attached to rural residence, com- 
bine to make people living outside large cities feel like underdogs, even when their 
household incomes are comparable with those of urban residents. 

Finally, expectations regarding the egalitarian effect of Islamic orthodoxy on 
economic justice attitudes (H6) receive strong support. In both countries, Islamic 
orthodoxy has a significant egalitarian effect on individual attitudes about eco- 
nomic justice. Operationalized as support for the idea that sharia should serve as 
the sole basis for law in the country, this effect is consistent with findings from 
other majority-Muslim nations (Davis and Robinson 2006).* 


Divergence 


Age, Education, Income, Economic Prospects and Fairness of the Economic System 

In Kazakhstan's rapidly growing economy, people in their prime working years 
(35-44; 45-54; 55-64 years old) are significantly less supportive of egalitarianism 
than are older people/retirees (65 or above). Interestingly, before variables measur- 
ing perceived fairness of the economic system, religious services attendance, and 
support for sharia are introduced in Model 2, people in the 25-34 age group 
are also significantly less supportive of egalitarianism than the reference group. 
This, however, does not appear to result from the Soviet egalitarian zeitgeist effect 
because the youngest cohort (18-24 years old) is not significantly different from 
the oldest group in economic justice attitudes in either model. By contrast, in 
Kyrgyzstan, none of the younger cohorts are different from the oldest cohort. In 
the environment of scarce employment opportunities and low pay, economic 
vulnerability permeates all age groups. 

In Kyrgyzstan, higher levels of education and household income are associated 
with less egalitarian attitudes. Those with greater human capital and those from 
wealthier households enjoy more economic security and are less supportive of 
egalitarianism (H1). However, in Kazakhstan neither education nor household 
income dampens egalitarianism. In fact, household income appears to exert a 
statistically significant egalitarian effect on attitudes toward redistribution when 
all other variables are controlled for.’ This seemingly counterintuitive outcome 
appears to echo a similar result reported by Blouin and Robinson (2007), who find 
a positive effect of income on compassion and egalitarianism in a study analyzing 
2004 General Social Survey data. They interpret this as a possible nobless oblige 
effect—the rich may feel an obligation to look out for those not as well off. 
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In Kazakhstan, one’s economic prospects have a significant effect on economic 
justice attitudes (H4). Interestingly, more people in Kyrgyzstan (68.3%) than in 
Kazakhstan (55.3%) expect to do much better or a little better in the future. This 
may be is because many people in Kyrgyzstan are doing so poorly economically 
that there is nowhere to go but up. Yet, only in more affluent Kazakhstan does 
this optimism translate into a significant effect on attitudes toward redistribution. 
In Kazakhstan optimism regarding one’s economic prospects reduces support for 
egalitarian views, a finding that is consistent with results of studies conducted in 
Eastern Europe and Russia (Kluegel et al. 1999). This result is also consistent with 
Robinson and Bell’s (1978) finding for the United States in the mid-1970s—that 
belief in the American Dream acted as an important depressant on egalitarian at- 
titudes. In Kyrgyzstan, with its stagnant economy and pervasive skepticism about 
the country’s economic prospects, most people hope for future improvement in 
their standard of living, but such hopes have no discernible effect on economic at- 
titudes. Instead, only variables measuring vulnerability here and now—household 
income and education—have significant effects. 

According to Linos and West (2003), those who perceive the economic system 
as unfair are more likely to embrace egalitarian attitudes toward economic justice 
and redistribution (H5). In Kazakhstan, belief that the economic system is fair 
(e.g., factors such as personal connections, dishonesty and discrimination do not 
determine individual economic success or failure) significantly diminishes egali- 
tarianism. In contrast, those who perceive the economic system to be unfair are 
more likely to espouse egalitarian views. In Kyrgyzstan, however, this belief is not 
statistically significant. This contrast is best understood through the prism of the 
two countries’ divergent economic trajectories. While people in both countries are 
equally cynical about the causes of individual wealth, in Kyrgyzstan the sense of 
hopelessness, lack of economic opportunity and widespread view of government 
as impotent appear to sever the connection between one’s evaluation of fairness 
in society and economic justice attitudes. 


Null Findings 


Gender and Occupation 
While the effects of some variables transcend a country’s economic environment (e.g,, 
Islamic orthodoxy) and the effects of others are sensitive to it (e.g., household in- 
come), two characteristics in the model are inconsequential in either context. Gender 
has no significant effect on economic justice attitudes in either country, as egalitar- 
ian sentiments are evenly distributed among males and females alike. ‘This result is 
consistent with the Mason et al. (2000) analysis of 1996 data from Russia and Linos 
and West’s (2003) examination of 1992 ISSP data from four Western democracies. 
In neither country does one’s occupation affect economic justice attitudes, as 
people in positions of varying levels of skill and authority are not more egali- 
tarian than business owners/managers (reference category). In fact, students in 
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Kazakhstan and blue-collar workers in Kyrgyzstan are less egalitarian than 
business owners/managers in Model 1. However, once the remaining variables 
are introduced in Model 2, this outcome becomes statistically insignificant 
in Kazakhstan, but persists in Kyrgyzstan. Wald tests reveal that the occupa- 
tional dummies are not jointly significant in either Kazakhstan or Kyrgyzstan 
(F-statistics of 2.01 and 1.48, respectively). This result is consistent with Davis 
and Robinson (2006: 182), who found “no systematic effect of occupation (e.g., 
between owners/managers and blue-collar workers) on economic attitudes” in 
their analysis of majority Muslim nations. 


Discussion and Conclusion 


Due to many of their characteristics— predominantly Muslim populations, a tu- 
multuous market transition after decades of Soviet rule, rapid rollback of the previ- 
ously expansive welfare state, and mushrooming economic inequality— Kazakhstan 
and Kyrgyzstan provide compelling settings for enhancing our understanding of 
economic justice attitudes and their predictors. The virtually identical standards 
of living enjoyed by their inhabitants under the USSR, in combination with the 
two countries’ starkly different economic situations today, help illuminate the 
relationship between a country’s economic performance and popular attitudes 
toward redistribution. Importantly, this study finds that a very different set of 
factors influences egalitarian economic justice attitudes in a crisis environment 
than in an environment characterized by economic growth. 

This analysis had a dual objective: (1. to compare the factors that drive egalitar- 
ian economic justice attitudes in two countries distinguished almost exclusively 
by their dissimilar economic circumstances; and (2. to identify which findings, 
derived in very different contexts, hold in these two post-Soviet, majority-Muslim 
nations. The effects of three independent variables—ethnic minority membership, 
urban area residence and Islamic orthodoxy—on egalitarian economic justice 
attitudes appear impervious to the two countries’ divergent economic trajecto- 
ries. However, the effects of age, education, household income, future economic 
prospects and perceived fairness of the economic system are much more sensitive 
to the countries’ economic fortunes. 

In both Kazakhstan and Kyrgyzstan, ethnic minorities are less egalitarian than 
members of the titular ethnic groups. A history of Moscow-imposed settlement of 
Slavs and other nationalities in Central Asia is likely the most important reason for 
this finding. Hundreds of thousands of Russians, for example, were brought into 
the region between the 1930s and 1980s. The non-titular newcomers were usually 
more educated and were brought in to work in urban areas in manufacturing and 
administrative jobs. Traditionally, Kazakhs and Kyrgyz lived in rural areas, and 
majorities can still be found working in agriculture. According to 2006 data from 
Kazakhstan, 76.8 percent of Russians live in urban areas, compared with just 49.4 
percent of Kazakhs (Shokomanov 2006) 
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Since both countries acquired independence in 1991, large numbers of non- 
titulars have left. However, the remaining minorities still possess higher levels of 
human capital. Although time is not on their side and their relative advantage 
diminishes as the demographic landscape undergoes transformation, as of 2008, 
the relatively more privileged position occupied by minorities in both countries 
during Soviet times still exerts a significant influence. Thus, I would argue that 
despite the new reality of having replaced Slavs in leadership positions in govern- 
ment and industry, at least for now, ordinary Kazakhs in Kazakhstan and Kyrgyz 
in Kyrgyzstan continue to display many of the attributes of underdogs. 

Reflecting the plight of rural and small town residents in both countries, those liv- 
ing in major urban areas are less egalitarian than the rest of the population. According 
to Kyrgyz government data, 75 percent of all those classified as poor live in rural 
areas (National Statistical Committee 2006). In more prosperous Kazakhstan, the 
benefits of economic growth experienced over the past decade have been largely 
confined to urban centers and have yet to trickle down to small towns and villages. 
This imbalance has triggered unprecedented waves of migration from rural to urban 
areas, remaking the demographic landscape of the countries in the process. With 
no relief in sight, there is good reason to expect that this effect will continue to be 
significant in the future. In post-Soviet Central Asia, people living outside large 
cities are the underdogs, and their attitudes toward economic justice reflect this fact. 

The egalitarian effect of religious traditionalism in all of the “religions of 
the Book,” and the Muslim faith in particular, was documented by Davis and 
Robinson (1999, 2006). Kazakhstan and Kyrgyzstan provide perhaps the most 
rigorous test of this relationship. While majority-Muslim of the Sunni branch, the 
two countries also have a 70-year history of official atheism. Thus, Islam in these 
societies is perhaps of the most secular variety found in the world today. If Saudi 
Arabia represents one end of the religious traditionalism spectrum, it would be safe 
to say that post-Soviet Central Asia occupies the other end of the same continuum. 
It is all the more remarkable, then, that the effect of orthodoxy in generating 
egalitarian attitudes still stands and is highly significant in both countries, even 
after accounting for the effects of all other variables in the model. 

While the effects of ethnicity, area of residence and religious orthodoxy on egal- 
itarianism are not contingent upon macroeconomic performance, this is not so in 
the case of age, education, income, economic prospects and perceived fairness of 
the economic system. In Kyrgyzstan, due to the government's failure to achieve 
sustained economic growth, a perceived lack of opportunity and hopelessness re- 
garding the country’s economic prospects are widespread. Young and middle-aged 
people are just as likely to support egalitarian economic justice principles as those 
who worked most of their life under the Soviet Union and have already retired. In 
this context, hopes of future improvements in the standard of living and perceived 
fairness of the economic system carry little weight. Only factors gauging present 
vulnerability—education and household income—affect economic justice attitudes. 
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However, in the context of Kazakhstan’s growing economy, the opposite 
appears to be the case. Higher levels of income and education do not diminish 
support for egalitarianism. Instead, its people are optimistic about their future 
financial situations and the perception of the economic system as fair, which 
reduces enthusiasm for the redistribution of wealth. The Kazakhstan case sug- 
gests that during periods of relative prosperity, subjective perceptions, such as 
one’s views about society's fairness and expectations of future economic well- 
being, trump objective characteristics, such as income and education. It appears 
that an American Dream effect is operating in Kazakhstan: People’s economic 
attitudes are framed by their long-term expectations, rather than by their cur- 
rent situation.'° 

Although this study focuses on a part of the world far removed from the West, 
these findings provide insight that should be useful in understanding shifts in 
public opinion beyond Central Asia. The particular arrangement of factors found 
in contemporary Kazakhstan is strikingly similar to the long-established finding 
that the dominant ideology of the American Dream reduces support for egalitari- 
anism and redistributive economic policies, despite one’s objective socioeconomic 
conditions (e.g., Hochschild 1981). In the face of economic crisis, however, there 
appears to be a transformation of attitudes underway in the United States, wherein 
condemnation of the excesses of Wall Street is—at the moment—no longer con- 
fined to the political left but has become decidedly mainstream, perhaps suggest- 
ing the pliability of the American Dream ideology. 

The unique quasi-experimental design of this study allows us to compare eco- 
nomic justice attitudes in different economic environments without the benefit of 
longitudinal data. The next wave of surveys, planned for the fall of 2012, should 
go a long way toward confirming or disconfirming our findings. Kazakhstan’s 
economic fortunes are intimately connected to the price of oil and gas in the 
world market. With current oil prices far below their record levels, the country is 
already experiencing a degree of economic difficulty not seen in the past several 
years. Should this trend continue, I would expect to see greater convergence in the 
factors driving egalitarian economic justice attitudes in the two societies. 

This is the first article using data from the 2007 Kazakhstan and Kyrgyzstan 
Inequality Survey. Future analyses will focus on other aspects of inequality and 
their causes and consequences in the two countries. Who are the winners and 
losers in this transition, especially considering the unique position of privilege his- 
torically enjoyed by ethnic minorities in the two countries? Do people see the new 
economic systems as legitimate, and which characteristics bolster the perception 
of legitimacy? What is the relationship between individual economic trajectories 
and support for democratic values? What are the main social and economic cleav- 
ages behind the growing influence of Islam? What other economic, social-and 
political attitudes are affected by Islamic orthodoxy? Careful examination of these 
questions will not only enrich our understanding of the countries in question, but 
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also add to our general knowledge of these subjects by looking beyond the usual 
quarters of the world where most studies have been conducted. 


Notes 


1. “Kazakhstan’s... proven hydrocarbon reserves [are] estimated between 9 and 40 
billion barrels.” (U.S. Energy Information Administration 2008) 


2. Although the underdog principle may appear self-evident from a rational-choice 
perspective, it is worth testing empirically because prior studies have shown that 
beliefs about economic justice are not always based on strictly rational calculations 
(e.g., Robinson 1983; Davis and Robinson 2006; Blouin and Robinson 2007). 


3. ‘The cross-sectional comparative design of this study precludes testing hypotheses 
involving change over time. However, | hope that these data will serve as a baseline 
for future replications, enabling longitudinal comparisons. 


4. Additional information about the two organizations is available upon request. 


According to both survey research companies, these response rates are typical in 
their practice. The lower response rates in Kazakhstan are commonly attributed to 
growing weariness with survey researchers brought about by the recent proliferation 
of marketing studies. In Kyrgyzstan, few marketing studies take place due to the poor 
economy and, as a consequence, people are “unspoiled” by attention from researchers. 
Both companies reported some difficulty in accessing wealthy households—a problem 
that is ubiquitous to survey research all over the world. 


6. Factor analysis reveals that these four items load on a single factor with an eigenvalue 
of 1.21 in Kazakhstan and .98 in Kyrgyzstan. 


7. “Using the degree of support for implementing the shari’a to measure Islamic orthodoxy/ 
modernism is consistent with the orthodox desire to establish and uphold what they 
see as divinely ordained eternal laws that apply to all members of the community, and 
with the contrary modernist belief that legal codes should reflect the times and draw 
upon multiple sources, including secular ones.” (Davis and Robinson 2006:176) 


8. When tested only among people of Muslim ethnic origin, without imputing a mean 
response for all non-Muslims, the effect of sharia variable on economic attitudes is 
unchanged in both countries. 

9. In Kazakhstan, household income, although insignificant when introduced into the 
regression on its own or with the first block of variables (male, age, etc.), becomes 
significant after other variables are entered into the model. 


10. I am grateful to Robert Robinson for suggesting this interpretation. 


References 


Andress, Hans-Jiirgen, and Thorsten Heien. 2001. “Four Worlds of Welfare State 
Attitudes? A Comparison of Germany, Norway, and the United States.” European 
Sociological Review 17(4):337-56. 

Arts, W.A., and John Gelissen. 2001. “Welfare States, Solidarity and Justice Principles: 
Does the Type Really Matter?” Acta Sociologica 44(4):283-99. 

Blekesaune, Morten, and Jill Quadagno. 2003. “Public Attitudes toward Welfare State Policies: 
A Comparative Analysis of 24 Nations.” European Sociological Review 19(5):415-27. 


1700 © Social Forces 88(4) 


Blouin, David, and Robert V. Robinson. 2007. “What Makes People Compassionate? 
The Social Bases of Compassion in the United States.” Paper presented on April 5 at 
the joint annual meetings of the Midwest Sociological Society and the North Central 
Sociological Association, Chicago. 

Brooks, Clem, and Jeff Manza. 2007. Why Welfare States Persist. The Importance of Public 
Opinion in Democracies. University of Chicago Press. 

Central Intelligence Agency. 2008. World Factbook. Available at: https://www.cia.gov/ 
library/publications/the-world-factbook/geos/kz.html. 

Davis, Nancy J., and Robert V. Robinson. 1999. “Their Brothers’ Keepers? Orthodox 
Religionists, Modernists and Economic Justice in Europe.” American Journal of 
Sociology 104(6):1631-65. 

. 2006. “The Egalitarian Face of Islamic Orthodoxy: Support for Islamic Law and 
Economic Justice in Seven Muslim-Majority Nations.” American Sociological Review 
71(2):167-90. 

Dudwick, Nora, Elizabeth Gomart and Alexandre Marc, with Kathleen Kuehnast. Editors. 
2003. When Things Fall Apart: Qualitative Studies of Poverty in the Former Soviet 
Union. Washington, DC: World Bank. 

Energy Information Administration. 2008. Country Analysis Briefs: Kazakhstan. Available 
at: www.eia.doe.gov/emeu/cabs/Kazakhstan/Oil. html. 

Fong, C. 2001. “Social Preferences, Self-interest, and the Demand for Redistribution.” 
Journal of Public Economics 82(2):225-46. 

Funk, C. 2000. “The Dual Influence of Self-Interest and Societal Interest in Public 
Opinion.” Political Research Quarterly 53(1):37-62. 

Hochschild, Jennifer L. 1981. What’ Fair? American Beliefs about Distributive Justice. 
Harvard University Press. 

International Monetary Fund and World Bank. 2006. Kyrgyz Republic: Enhanced Initiative 
for Heavily Indebted Poor Countries— Preliminary Document (IMF Country Report No. 
06/417). Washington, DC: International Monetary Fund. 

Kelley, Jonathan, and M.D.R. Evans. 1993. “The Legitimation of Inequality: Occupational 
Earnings in Nine Nations.” American Journal of Sociology 99(1):75-125. 

Kluegel, James, David S. Mason and Bernd Wegener. Editors. 1995. Social Justice and Political 
Change: Public Opinion in Capitalist and Post-communist states, Aldine de Gruyter. 

Kluegel, James R., David S. Mason and Bernd Wegener. 1999. “The Legitimation of 
Capitalism in the Post-Communist Transition: Public Opinion about Market Justice, 
1991-1996.” European Sociological Review 15(3):251-83. 

Kluegel, James R., and David S$. Mason. 2004. “Fairness Matters: Social Justice and 
Political Legitimacy in Post-Communist Europe.” Europe-Asia Studies 56(6):813-34. 

Linos, Katerina, and Martin West. 2003. “Self-interest, Social Beliefs, and Attitudes 
toward Redistribution: Re-addressing the Issue of Cross-national Variation.” 
European Sociological Review 19(4):393-409, 

Luo, X. 1998. “What Affects Attitudes toward Government’s Role in Solving 
Unemployment? A Comparative Study of Great Britain and the United Statess| 
International Journal of Public Opinion 10(2):121-44. 

Mason, David S., and James R. Kluegel. 2000. Marketing Democracy: Changing Opinion 
about Inequality and Politics in East Central Europe. Rowman & Littlefield. 


Economic Justice Attitudes in Central Asia ¢ 1701 


Mikhalev, Vladimir. Editor. 2003. Inequality and Social Structure during the Transition. 
Palgrave Macmillan. 

Miller, D. 1992. “Distributive Justice: What the People Think.” Ethics 102(3):555-93. 

National Statistical Committee of the Kyrgyz Republic. 2006. Sotsialnye Tendentsii 
Kyrgyzskoi Respubliki. Bishkek, Kyrgyzstan: Natsionalnyi statisticheskii komitet 
Kyrgyzskoi Respubliki. 

Rashid, Ahmed. 2003. Jihad: The Rise of Militant Islam in Central Asia. Penguin Books. 

Robinson, Robert V. 1983. “Explaining Perceptions of Class and Racial Inequality in 
England and the United States.” British Journal of Sociology 34(3):344-66. 

Robinson, Robert V., and Wendell Bell. 1978. “Equality, Success, and Social Justice in 
England and the United States.” American Sociological Review 43(2):125-43. 

Shokomanoy, U. Editor. 2006. Etnodemographichesky Ezhegodnik Kazakhstana: 
Statistichesky Sbornik. Almaty, Kazakhstan: Republic of Kazakhstan Statistical Agency. 

Svallfors, Stefan. 1997. “Worlds of Welfare and Attitudes to Redistribution: A Comparison 
of Eight Western Nations.” European Sociological Review 13(3):283-304. 

United Nations Development Program. 2002. Rural Development in Kazakhstan. Available 
at: http://hdr.undp.org/en/reports/nationalreports/europethecis/kazakhstan/ 
name,3208,en.html. 

United Nations Statistics Division. 2008. Statistical Databases. Available at: http:// 
millenniumindicators.un.org/unsd/snaama/selectionbasicFast.asp. 

Zagorski, Krzysztof. 1994. “Hope Factor, Inequality, and Legitimacy of Systemic 
Transformations: The Case of Poland.” Communist and Post-Communist Studies 
27(4):357-76. 

Zaslavsky, Victor. 1995. “Contemporary Russian Society and its Soviet Legacy: the 
Problem of State-dependent Workers in Social Change and Modernization.” Pp. 39- 
56. Lessons from Eastern Europe. B. Grancelli, editor. Aldine de Gruyter. 





"SISAJEUB OU} Ul PaSJONel ole Sonje/\ -o]ON 
‘B,UEYS BU} JO SMe] ay} AJUO JUaWA|dWUI Pinoys }| ZaAeY 0} ]UBWLUAAOB poob e Jo} JUeYOdUWUI 
JON (G) 40 ‘Jueyodui! seo (y) ‘ueHodwU Jeymawios (¢) ‘jueoduy) (Z) Ueyodu AJA (|) SI S}1e4) BS@Y) JO 

YIM ‘JUaWLUSAOH pooh e jnoge smaIA JNOA MOUY 0} ay!| PINOM ay :UOSanb Bulmo}jo} ay} 0} esuodsel y = e.ueYs 10} yoddns 

‘SISAJEUB U! PASIBABJ Be SEN|e/ :a}ON 

‘JaAaU Ajjeonoeid ‘aAAN (Z) ‘Yayo Ssa7 (g) ‘WEA e BoUQ (g) ‘sAep Ajoy JelNads UO AjUC (7) 

, yuo @ 80K (¢g) ‘yaeM e BdUK (Z) ‘Yaem e BOUO UeY} OW (|) ,{SKeP aay] SadIAas snoiBlja1 puaye NoA 
Op UaYO Moy jnoge ‘sHulua}suyo pue sjesouny ‘sHulppam wou Ledy, :uosenb Bulmoyjo) au} 0} asuodsal y = gouepuayje sadiAJas snolbiay 
"yyleam pue Aanod Jo sasned uo suolsanb / 0} siaMsue yy JO aNjeA URAL Se PayNdwod ajeos = wWa}SAs DIWOUOIE JO SS@UUIe} PSAIOJEd 


0 = aSINuaYjo ‘paAojdwaun si y JI ‘| = peAojdweun 

0 = OSINUAUjO ‘Ajl|IGesip JO ae 0} ANP paulo SI y JI ‘| = pawijay 

0 = ASINUAU}O ‘sseulsng Ajiwe) YIM Buldjay S| Jo Jayewawoy e Ss! y JI‘; = sseuisng Ajiwey ym Buidjey ‘ayimesnoy 
0 = 9SIMUaYjO ‘JUapNys e SI y JI ‘| = juapns 

0 = esimsayjo ‘gol Jejjoo anjq e sey y fi ‘| = Jejjoo anjg 

0 = aSIMJAU}O ‘gol Je\]O9 ayiyM e Sey y JI ‘| = Je{|09 SyUAA 

0 = ASINUAUJO ‘jeUdISsajold © SI y I ‘| = |BUOISSAJOld 


0 = Ssimuayjo ‘Jebeuew Jo JeuMO Ue SI Ys‘, =  (Aobayeo eouasajau) sabeueyy /YaUME 
"SISAJEUE Ul PASJAAS Ble SEN|e/ :a]ON 
.LOSIOM }O| Y/ (G) 10 ‘@s1OM afi y (7) “ewes ay} YON (¢) 
‘Ja 9q A)! Y (Z) ‘JeHeq Yon Aq oO] (|) :Aepo} UOEN}Is INOA 0} asedWOd 0} Ua} UONEN}IS jeoUeUY Pjoyssnoy 


\JEJ9A0 INOA JOadxs NOA Op Moy ‘sueai aAy noge peaye BuryulyL, :uoysenb Buimoyjo) ayy 0} asuodsed y = sjoadsoud d1wWOU0da aunjn4 
pezipsepue}s ‘awooul pjoyesnoy AjyjUOW = SWOOU! pjouesnoy 

peja|dwoo uoljeonpe jo syeek = uojeonp3 

0 = asinuaujo ‘AjI0 abye] e Ul Sapisay y JI ‘| = ueqin 

Q = Ssimuayjo ‘dnoub ojuyje Jejnj}-uou e 0} shuojaq si y JI ‘| = Ayuoulw duu} 

(Ajoba}e9 adualajal) 79 JOA0 ‘79-SG ‘PS-Sp ‘ph-Se ‘Pe-GZ ‘pZ-91 = sailuwnp aby 


0 = SSIMISUJO ‘BJEW SI Y JI ‘| = s|eN 
uolUuyag sajqeie, juspusdapu| 
ijenbeu| ueysZzADJAy pue ueysyyezey ‘sajqeie, JUepusdepuy jo SuOI}Uag “Ly ajqeL 





xipueddy 


1702 © Social Forces 88(4) 


Survivalism and Public Opinion on Criminality: 
A Cross-National Analysis of Prostitution 


Steven Stack, Wayne State University 
Amy Adamczyk, City University of New York 
Liqun Cao, University of Ontario Institute of Technology 


Explanations of variability in public opinion on crime have drawn disproportionately 
from the literature on specific symbolic orientations including religious fundamen- 
talism and racial prejudice. In contrast, this article hypothesizes that public opinion 
is linked to the strength of a general cultural axis of nations: survivalism vs. self- 
expressionism. Data are from the fourth wave of the World Values Survey. Hierarchi- 
cal modeling techniques are used to sort out the bi-level effects of survivalist culture 
on the approval of prostitution. Controlling for all other predictors, the personal 
survivalism index was the most powerful predictor of prostitution acceptability, fol- 
lowed by the country-level survivalism index. Unlike previous investigations, which 
relied on specific symbolic orientations, the present results suggest that attitudes about 
criminality are linked to a generalized cultural axis. 





In democratic societies public opinion is an expression of the will of the people 
and needs to be considered by policy makers in shaping decisions about social 
life. It is frequently cited by political leaders in legitimating their proposals for 
public policy (Baldassarri and Gelman 2008; Beckett 1997; Brooks and Manza 
2006; Browning and Cao 1992; Burstein 1998, 2003; Garland 2000; Jones 1994; 
Shichor 2000; Stack, Cao and Adamczyk 2007). Public opinion is often em- 
pirically linked to social policy. A review of 30 investigations from 1990-2000 
determined that 75 percent of the 52 findings linked public opinion on a variety 
of issues including war, equal opportunity, health care and abortion to policy. In 
general, the more salient the policy issue, the stronger the impact of public opin- 
ion on policy. In addition, the responsiveness of policy makers to public opinion 
over time has not weakened (Burstein 2003). While the majority of the findings 
reviewed by Burstein (2003) were based on a single nation, the United States, re- 
search on other nations has presented results illustrating a synergy between public 
opinion and policy. These include government spending on health in 21 nations 
(Kikuzana, Olafsdottir and Pescosolido 2008), links between opinion and defense 
spending trends in the United Kingdom (Soroka and Wlezien 2005), and public 


opinion’s effect on welfare spending in 15 democracies (Brooks and Manza 2006). 
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Given such reasons as its tie to social policy, there has been much interest in the 
predictors of public opinion. Such predictors have included symbolic orientations 
such as political conservatism, authoritarianism and religious fundamentalism, 
as well as the quantity and quality of mass media exposure, and demographics 
(Baldassarri and Gelman 2008; Danigelis, Hardy and Cutler 2008; Finke and 
Adamczyk 2008; Stack and Kposowa 2006, 2008; Tyler and Weber 1982). For 
example, political conservatism is related to public opinion on a host of economic 
issues including health insurance and federal spending to assist poor people, civil 
rights issues including affirmative action and school integration, moral issues 
including abortion, gays in the military and suicide, and foreign policy includ- 
ing defense spending and foreign aid (Baldassarri and Gelman 2008; Stack and 
Kposowa 2008). ‘The literature on the predictors of public opinion has, however, 
neglected an assessment of the link between public opinion and broad cultural 
axes of nations such as survivalism vs. self expressionism (Inglehart and Baker 
2000). Such broad cultural dimensions of nations may encompass a number of 
specific symbolic orientations. As such, they might provide a more parsimonious 
explanation of public opinion on specific issues. 

‘This article focuses on explaining cross-national variation in public opinion on 
criminal behavior. In recent decades trends in public opinion about criminality 
have been related to a rise in punitive justice policies. The increase in the use of the 
death penalty in the United States, the fall of indeterminate sentencing, and the 
rise of mandatory minimum sentences are some aspects of the rise in punitiveness. 
For example, prison populations in the United States and abroad have more than 
doubled in the past two decades (Garland 2000; Sutton 2000). Further, work 
that seeks to isolate the causes of public opinion about crime can be used to make 
predictions about such attitudes in the future. Such predictions about opinion 
can be useful for forecasting the associated trends in public policy (Stack, Cao 
and Adamczyk 2007). 

Research on public attitudes towards crime has focused on such specific is- 
sues as opinions regarding the death penalty, law enforcement, confidence in the 
courts, tax fraud, prostitution and factors contributing to a generalized punitive 
orientation (Barkan and Cohn 1994; Cotton 2002; May 1999; Stack 2000, 2003; 
Stack and Cao 1998; Stack, Cao and Adamczyk 2007; Stack and Kposowa 2006; 
Torgler and Schneider 2007; Tyler and Weber 1982; Unnever and Cullen 2006, 
2007; Finke and Adamezyk 2008). As with the more general literature on social 
phenomena including economic, moral and foreign policy issues, there are three 
limitations to this body of research. First, previous research has drawn on specific 
demographic factors and symbolic orientations, not generalized cultural systems 
of nations. While there are many focused symbolic orientations such as authori- 
tarianism, religious fundamentalism and political conservatism that are connected 
to public opinion on crime and justice, a unified, parsimonious explanation has 
been largely absent from the literature. Second, due to data limitations, research- 
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ers have been unable to incorporate macro-level factors into their analyses of 
how macro factors may affect individual-level attitudes toward crime and justice 
(for an exception see Stack, Cao and Adamczyk 2007). Third, there has been 
relatively little cross-national work. In particular, it is unclear if the results based 
on the particular cultural and institutional framework of American society can be 
generalized to a diverse group of nations. For example, American culture has been 
termed a deviant case given its high ranking in traditionalism, in spite of its high 
level of economic development (Inglehart and Baker 2000). 

The present investigation addresses these three limitations. Specifically, we ap- 
ply a recent development from comparative cultural sociology to the problem of 
public opinion on crime. A parsimonious theory of public opinion is presented, 
which integrates a host of current specific symbolic orientations under one syn- 
thetic or master dimension of societal culture: survivalism vs. self-expressionism 
(Inglehart and Baker 2000). Second, we suggest that residents of nations where the 
survivalist dimension of societal cultural systems is relatively high will be more apt 
to hold negative views towards criminality. Third, unlike most previous investiga- 
tions, our study is cross-national. More than 45,000 persons in 32 nations are 
included. The resulting generalizations on public opinion apply to a diverse set 
of societies from different levels of economic and social development, and from 
different culture zones of the world (Inglehart and Baker 2000). 


An Integrated Theory of Symbolic Orientations and Attitudes towards Crime 
Cultural Axes of Nations: Survivalist Culture 


The research on public opinion about crime encompasses more than a hundred 
books and articles (for reviews see Cullen, Fisher and Applegate 2000; Stack 2004). 
Much of this work is largely descriptive and does not rigorously test hypotheses on 
public opinion. The research that tests explanatory models has drawn principally 
on two broad perspectives: symbolic orientations and instrumentalism (Stack 
2003; Stack and Kposowa 2006; Tyler and Weber 1982; Unnever and Cullen 
2006, 2007). Symbolic orientations are comprised of specific domains includ- 
ing political, social and personality orientations. For example, a “good” political 
conservative would be expected to not only support punitive policies towards 
criminality, but, as a conservative, to favor many other policies including those 
against abortion, a relatively laissez-faire economic organization, and minimal ex- 
penditures for government sponsored health and welfare benefits. In this sense, a 
punitive orientation towards criminality is part of a much larger attitude complex 
in the domain of politics. The second major theme in explanations of public atti- 
tudes towards criminality is instrumentalism, wherein attitudes toward the specific 
problem of crime are largely a function of how important the crime problem is per 
se to the individual. Individuals who have been victims of crime—those who fear 
ctime and those who see crime as out of control or rising—are among those who 


might be the most punitive (Tyler and Weber 1982; Stack 2003). 
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The symbolic orientation approach to explaining variation in public opinion 
towards crime has pursued a number of themes including personality, religious 
and political orientations. Personality orientations such as authoritarianism, aggres- 
siveness, empathy and dogmatism have been linked to citizen perceptions of crime 
and punishment (Stack 2000, 2003; Unnever, Cullen and Fisher 2005). Religious 
orientations such as fundamentalism have been the focus of other research (Stack 
2000; Stack and Kposowa 2006; Torgler and Schneider 2007; Unnever and Cullen 
2006; Young 1992). Specialized measures of religiosity including God images, have 
further refined this perspective (Unnever, Cullen and Bartkowski 2006). Racial 
prejudice has also been linked to public opinion on crime and justice (Barkan and 
Cohn 1994; Stack 2000; Unnever and Cullen 2007). Relatively harsh responses to 
crime and justice are often connected to a broad political orientation such as political 
conservatism (Stack 2000; Stack and Cao 1998; Unnever and Cullen 2006). 

These links between various specific symbolic orientations and public opinion 
on crime and justice may be tied together under a master cultural orientation. For 
example, concepts such as racial prejudice, authoritarianism and religious funda- 
mentalism, which have been used extensively in the criminological literature on 
public opinion, may be drawing on an underlying or global concept of intolerance. 
Specifically, we suggest that recent developments in comparative cultural sociology 
provide such a synthetic concept. Ronald Inglehart and his colleagues have collected 
data on the cultural dimensions of nations during four waves of their World Values 
Surveys (Halman, Inglehart et al. 2008; Inglehart 1990, 2003; Inglehart and Baker 
2000; Inglehart and Norris 2003). The surveys conducted in 1980-1983, 1990- 
1993, 1995-1997 and 1999-2001 include 65 nations and represent 75 percent of 
the world’s population (Inglehart and Baker 2000). From this body of work, there 
are several principal findings. First, as nations undergo the industrialization and 
modernization process, their cultural systems tend to shift in a predictable manner 
from a concern with physical and economic security to post- materialist values. 
Second, there is secondary variation in such changes. The secondary variation can be 
partially explained by location in a set of eight culture zones of the world including 
Orthodox, Confucian, Protestant Europe and former Communist Europe. 

More than 100 specific values are measured in the World Values Surveys. Using 
factor analysis techniques, these measures have been collapsed into two principal 


factors or dimensions of cultural life: survival vs. self-expressionism and traditional 
vs. secular rational. 


Survivalism vs. Self-Expressionism 


National cultural systems have been classified along a continuum from survival- 
ism to self-expressionism (Inglehart and Baker 2000). Survivalist culture places 
an emphasis on material security. Until recent centuries, world history has over- 
whelmingly been a battle for survival. Famines, disease pandemics, wars, droughts 
and other problems contributed to a widespread sense of insecurity for most 
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people. Unfortunately, even today, much of the less-developed world still struggles 
for survival against the forces of nature, impoverishment, illiteracy, imperialism, 
\disease and other adverse circumstances. In response to these material conditions 
of economic and political insecurity, there is a relatively high probability that the 
societal cultural system will tend towards the survivalist mode. 

At the root of survivalist culture is economic and physical insecurity. Societies 
that stress survivalist values are marked by a materialistic orientation, which values 
financial security over elements of self-expression. Insecurity contributes to a host 
of values and beliefs. These include a distrust of others including out-groups such 
as racial and ethnic minorities, gays and lesbians, political opponents, political 
minorities, religious minorities, and so on. Insecurity may be further dealt with by 

strict obedience to the norms of one’s group and a desire to punish any members 

who depart from these norms. These are the breeding grounds for many sym- 
bolically related orientations including prejudice, authoritarianism, intolerance, 
dichotomous thinking, religious fundamentalism, political conservatism, less em- 
pathy for others and generally low tolerance of individual differences (Inglehart 
and Baker 2000; Inglehart and Norris 2003). Hence, the concept of survivalism 
integrates many of the specific symbolic orientations that have been used to ex- 
plain a variety of public attitudes towards criminality. 

At the other end of the survivalist vs. self-expressionist cultural continuum rests 
the culture of individualism or self-expression. Self-expressionism represents post 
materialism and taps a syndrome of trust, tolerance, subjective well-being and po- 
litical activism. Self-expressionism thrives on the tolerance of individual differences. 

‘The economic and physical security that underlies self-expressionism is associated 
with a new set of values such as environmentalism, and active citizen involvement 
in social and political affairs. The chief concerns of parents in socializing children 
shift from teaching the value of hard work to teaching tolerance of individual dif 
ferences. Applied to the criminological literature on public opinion, persons high in 
self-expressionism would be expected to be low in authoritarianism, low in religious 
fundamentalism, high in political liberalism, high in empathy and so forth. 

The axis of survivalist vs. self-expressive culture is predictive of specific value sets 
_and behaviors. For example, the lower a nation’s score on the survivalist cultural 
| index, the higher its level of gender equality (Inglehart and Norris 2003). However, 

this cultural perspective has not been widely applied in the criminological litera- 
ture. One limited exception is work on the Netherlands. Observers of the Dutch 
egal reaction to crime and deviant behavior (including prostitution, drugs and 
euthanasia) have interpreted the leniency of the criminal justice system as part 
of the larger, historically based Dutch culture of tolerance (Buruma 2007). For 
example, today in the Netherlands prostitution is not a crime. Sex workers have a 
union, pay taxes and qualify for unemployment insurance as well as social security. 

It is hypothesized that the tolerance characteristic of cultures high in self- 

expression would lower the odds of disapproval of public order crimes such as 
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prostitution. In contrast, cultural systems high in survivalism would be less ap- 
proving of nontraditional behaviors including prostitution. 

A related perspective on world cultural systems deals with the concept of cul- 
ture zones (Inglehart and Baker 2000). The relationship between modernization 
and the decline of survivalist culture proceeds at different velocities and intensities. 
Secondary variation in survivalism is encountered as modernization flows through 
the cultural apparatus of the eight world culture zones. These zones are funda- 
mentally distinguished by centuries old religious traditions. The present analysis 
includes controls for indicators of these culture zones. This part of the analysis 
is largely exploratory, constituting a new point of departure for criminological 
analysis of public opinion. 

The present research analyzes the association between survivalist culture and 
individual attitudes concerning criminality, herein prostitution. Some caution 
should be exercised in generalizing the results of this research to public opinion on 
criminality in general. Prostitution is arguably a kind of crime that is more easily 
viewed as “victimless” when compared to violent crime and predatory forms of 
theft. Prostitution, substance abuse, gambling and other crimes without victims 
may be more apt than crimes with clear victims to be seen as alternative life 
styles and possibly encompassed by “self-expressionism,” related herein by a low 
survivalism score. The reader is cautioned that while we often refer to prostitution 
as a generic form of criminality, our findings may apply best to crimes without 
victims. Future work will be needed to assess the influence of survivalism on the 
acceptability of crimes with victims. It is anticipated that while the acceptability 
of crimes with victims would not be as strongly related to survivalism as crimes 
without victims, the social reaction to such crimes with victims would be. That 
is, survivalism may not be strongly related to the acceptability of crimes such as 
murder, which nearly everyone is against. However, the punishments for murder 


(e.g., death penalty) would be strongly predicted by survivalism (e.g., its intoler- 
ance of deviant behavior). 


Previous Work on Public Opinion on Prostitution 


There are 10 previous investigations on the predictors of public opinion concern- 
ing prostitution (Abrams and Fave 1976; Basow and Campanile 1990; Cosby et 
al. 1996, 2002; McCaghy and Cernkovich 1991; May 1999; Peracca, Knodel and 
Saengtiencham 1998; Polk and Cowan 1996; Rasanen and Wilska 2007; Sawyer 
et al., 2001). Table 1 presents the authors, dates, sample size and principal find- 
ings of the 10 previous investigations. These have been based overwhelmingly on 
samples within a single nation, principally the United States. It is not clear if their 
findings are generalizable to the world at large. 

Most of these investigations are further limited in their generalizability given their 
disproportional reliance on student and/or local samples. Some were largely qualita- 
tive descriptive studies. For example, a study of 88 adults in Thailand used a focus 
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group design to simply demonstrate that 
while there was a strong negative view of 
sex workers, a majority of focus group 
participants thought that it was accept- 
able for sex workers to marry. 

In addition, most contain findings 
on at least one symbolic orientation 
that is linked to prostitution acceptabil- 
ity. Educational attainment, often used 
as a crude indicator of social liberalism, 
is a significant predictor of prostitu- 
tion acceptability in two investigations. 
Three investigations found an effect 
of religiosity on prostitution attitudes. 
Political ideology and authoritarianism 
are included in one study each and are 
found to predict prostitution attitudes. 
Only one study by May (1999) em- 
ployed a “tolerance index.” However, its 
four-item tolerance index only included 
questions on the legitimacy of casino 
gambling, which limited its scope. 
None of these investigations draws on 
a primary axis of societal culture for its 







tolerance index, marital status and education 


Political ideology 


Gender (m), race (white), region (West), religious affiliation, 
n.a. 
na 


Authoritarianism and religiosity 

Gender(m), race (white), age, Catholicism and region 
Gender (m) and offender experience 

Gender and major 

Age and education 


Feminism 


2 


— LOND 


D) N Variables (E) Significant Predictors of Opinion Concerning Prostitution 
8 
na. 





explanatory scheme. 


Control Variables 


A set of control variables from other 
theoretical and demographic perspec- 
tives on public opinion is incorporated 
into our present model. First, from 
control theory, bonds to conventional 
social institutions provide individuals 
with a stake in the social order. This 
stake gives them something to lose from 
adopting deviant beliefs and/or engag- 
ing in deviant behavior (Hirschi 1969; 
Stack, Cao and Adamczyk 2007). The 
present investigation employs measures 
of ties to marriage, family and religious 
institutions as measures of control 
theory. The degree of involvement in 


Polytechnic University 
1990 89 students, Lafayette College 


1996 
2002 


161 students, Virginia 
1,514 adults, USA 

783 students 

1491 adults, USA 

413, Toledo, Ohio 

160 students, California 
1,208 students, Finland 
140 Johns 


1998 88 adults, Thailand 


1976 
1999 
1991 
1996 
2007 


2001 


Previous Investigations on Public Opinion Concerning Prostitution 
Notes: (D) N Variables, refers to number of independent variables. 


A) First Author 


Abrams, et al. 

Basow and Campanile 
McCaghy and Cernkovich 
Peracca, et al. 

Rasanen and Wilska 
Sawyer, et al. 


Cosby, et al. 
Cotton, et al. 
Polk and Cowan 


Table 4 
May 
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conventional religious institutions can be assessed by the amount of time spent 
with co-religionists. Persons with greater involvement in social support networks 
of co-religionists have more to lose if they are caught holding deviant beliefs or 
engaging in deviant actions. 

Controls are also drawn from social learning theory. Exposure to values and 
norms militating against prostitution and relegating sexual behavior to marriage 
should reduce individual’s acceptance of prostitution. To the extent that religions 
differ in their level of condemnation of prostitution, we would expect that affili- 
ation with more conservative religions would be associated with less approving 
attitudes. The degree of exposure to religious teaching is also expected to predict 
attitudes. Level of education will be employed as a measure of social liberalism. 

The present study also employs controls for demographic variables including 
age and gender, factors which are generally linked to public opinion on crime 
and deviant behavior (Tyler and Weber 1982; Stack, Cao and Adamczyk 2007; 
Stack and Kposowa 2006). Finally, financial strain has been often linked to devi- 
ant behaviors and attitudes (Torgler and Schneider 2007), and a measure will be 
incorporated into the present analysis. 


Methodology 


Data are from the fourth wave of the World Values Survey (Inglehart 2004). 
The analytic sample refers to 32 nations! for which complete data were available. 
Typically less than 3 percent of the cases were missing data on any given variable. 
We plotted the quartiles of missing data by country and found that they followed 
a normal distribution. We then used a formal skewness/kurtosis test of normal- 
ity’ and found that we were unable to reject the null hypothesis. Differences in 
nonresponse (missing) values appeared to be random across nations. 

In order to make full use of the data, we present our results using multiple impu- 
tation techniques, which rely on other variables in the model (including country- 
level variables) to impute missing data. For this procedure several datasets are created 
(in this case five), and the data are then summarized across the results. Multiple 
imputation techniques are considered one of the best ways to handle missing data 
because they take full advantage of all of the available missing information (e.g., 
adjusting missing responses according to other individual and country character- 
istics included in the model), and adjust the standard errors so that they account 
for the uncertainty included with simulated data (something that is not done with 
deterministically imputed data‘) (Allison 200 1). Missing data were imputed using a 
procedure written by Royston (2004) based on a technique outlined in van Buuren 
et al. (1999). The final parameter estimates are the parameter estimated averages ob- 
tained from each regression produced through the imputation option in the statisti- 
cal computing program, Hierarchical Linear and Nonlinear Modeling (Raudenbush, 
Bryk and Congdon 2004). Our final individual-level sample size consists of 45,102 
respondents. See Appendix 1 for a list of the nations and number of respondents 
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Table 2: Descriptive Statistics for Variables Included in the Analysis 


ee ee ee Mean... SD. Min. Max: 
Country-Level Variables 
Country mean of survivalism 2.75 58 Vor 3.66 
Logged GDP per person 3.07 .65 2.39 4.6 
Logged population 7.42 JT 5.84 8.98 
Country Cultural/ Religious Tradition: Catholic .28 46 0 { 
Protestant .09 3 0 1 
Hindu 03 18 0 1 
Confucian .09 o 0 1 
Orthodox ak 34 0 1 
Muslim 38 AQ 0 1 
Individual-Level Variables 
Prostitution acceptability .28 A5 0 1 
Individual survivalism 2.74 1242 0 5 
Social Control Measures 
Married 58 49 0 1 
Number of children: 1-2 children nL) 48 0 1 
3-4 children OR 42 0 1 
> 5 children a2 Wd 0 1 
Amount of time spent with people from church 2.47 1.24 1 4 
Social Learning Measures 
Religious attendance 4.5 2.07 1 7 
Religious importance Baa .92 1 4 
Religious affiliation: No religion 14 34 0 1 
Catholic 22 At 0 1 
Protestant 14 34 0 1 
Orthodox .08 .28 0 1 
Jewish 0 .06 0 1 
Hindu 04 2 0 1 
Buddhist 03 16 0 1 
Muslim .26 At 0 1 
Other religion .05 21 0 1 
All else .04 $3 0 1 
Education 5.2 2.44 1 9 
Demographic Controls 
Age 39.04 15.36 15 97 
Female i io 0 1 
Financial satisfaction moO 204 1 10 


Note: Individual N = 45,104; Country N = 32 


within each that are included in our analysis. Table 2 presents descriptive statistics 
for all variables included in the analysis. 


Dependent Variable 


Our key outcome variable is prostitution acceptability, which is measured using 
a single question that asks whether prostitution can always be justified, never be 
justified or something in between? Responses range from always wrong =1 to 
always right = 10. A strong global disapproval was registered on this measure. 
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* 


Fully 71 percent of our sample reported that prostitution was always wrong (never 
justifiable). Given this skewness in the distribution, a dichotomous variable was 
used where 0 = always wrong, and 1 = all others. 


Individual-Level Variables 


Individual level variables include the degree of survivalism measured on the five 
item index from Inglehart and Baker (2000) and a set of control variables drawn 
from previous work on public opinion towards crime and deviant behavior (Stack 
and Kposowa 2006; Stack, Cao and Adamczyk 2007). The index of survivalist 
values is taken from previous work in comparative cultural sociology (Inglehart 
1990, 2000). It is comprised of five items all coded as binary variables. These items 
include: homosexuality is never justifiable (0,1), you have to be very careful about 
trusting people (0,1), the respondent has not and would not sign a petition (0,1), 
the respondent gives priority to economic security over self-expression (0,1), and the 
respondent describes himself/herself'as not very happy (0,1). The index of survivalist 
values ranges from 0 = low survival values (high individual self-expression) to 5 = 
high survival values (low individual self-expression). The five-item index is highly 
correlated with a larger eleven item index, r = .96 (Inglehart and Baker 2000). 


Social Control Theory 


We include three measures of bonds to conventional institutions: marriage, num- 
ber of children and time spent with people who share one’s religion. Marriage is 
measured as a binary variable where 1= married and 0 = all others. Attachment to 
the institution of parenthood is measured with the number of children employing 
a series of binary variables where 1-2 children (0, 1), 3-4 children (0, 1), and five 
or greater (0, 1). The reference category is zero children. Involvement with co- 
religionists is measured with a single question that assesses how much respondents 
participate in activities with people at their churches, mosques or synagogues. 
Question responses were reverse coded so that 1 = not at all and 4 = weekly. 


Social Learning Theory 


The degree to which people learn norms against sexual deviance, including pros- 
titution, is assumed to vary across the world’s religious systems. We, therefore, in- 
clude a series of binary variables that measure religious affiliation: Buddhist (0,1), 
Catholic (0,1), Hindu (0,1), Jewish (0,1), no religious affiliation (0,1), Orthodox 
(0,1), Protestant (0,1), other religion (0,1), all else (0,1), which includes people 
who were not asked the question, don’t know or for whom the question was not 
applicable. Islam is used as the reference category. Given especially severe doctrinal 
teaching against sexual deviance, it is assumed that Western religions will be more 
accepting of prostitution than Islam (Zaehner 1988). 

The degree of exposure to religious teaching is measured as the frequency of 
attendance at religious services. Question responses were reverse coded so that 
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I = never or practically never attend religious services and 7 = more than once 
a week. The importance of religion is measured by a single question that asks, 
“Independently of whether you go to church or not, would you say you are a 
religious person, not a religious person, or a convinced atheist?” Responses were 
reverse coded so that the highest category, 3, indicates “a religious person.” 

Because education is associated with social liberalism, a respondent's level of 
education is taken as a measure of social learning theory. Education is taken from 
a question that asks respondents, “What is the highest educational level that you 
have attained?” Response categories range from 1 = no formal education to 9 = 
university with degree. 

Finally, we control for demographics. Age is measured in years. Gender is 
measured as a binary variable where female = 1 and male = 0. Finally, financial 
satisfaction is measured with a single question that asks respondents to rank on a 
10-point scale how satisfied they are with the financial situation of their household. 
Higher numbers indicate greater satisfaction. 


Country Variables 


Our key country-level indicator is the mean level of the survivalist culture index 
in each of the 32 nations. A second set of macro variables that approximate the 
culture zones referred to in Inglehart and Baker (2000) are also included. Five 
binary variables measure the principal religious tradition of the country where the 
respondent is a resident. They are: Catholic (0,1), Confucian (0,1), Hindu (0,1), 
Orthodox (0,1) and Protestant (0,1). Islam constitutes the reference category. We 
also control for Gross Domestic Product per person® and population, both of 
which are logged and taken from the 2004 CIA’s World Factbook. 


Statistical Methods 


To assess the association between a national survivalist culture and individual 
survivalist orientation on approval of prostitution, we use generalized mixed 
models (Breslow and Clayton, 1993). Unlike conventional regression models, 
hierarchical modeling correctly estimates the standard errors of the contextual 
variables (Raudenbush and Bryk 2002). This method adjusts for the correlated 
errors among individuals within the same context (e.g., country) and uses the 
appropriate degrees of freedom for country-level units. As a result, coefficients at 
the individual and contextual levels are simultaneously estimated. 

Before examining any explanatory variables, we first estimate a baseline model. 
This model is necessary to assess whether attitudes about prostitution acceptability 
vary significantly across nations. This model will also tell us the amount of varia- 
tion to be explained at the individual and country levels. 

After estimating a baseline model, we assess the contribution of individual-level 
variables for attitudes about prostitution acceptability. Formally, the individual- 
level model with all of the individual-level measures is: 
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Prob(Prostitution acceptability=1|B)= @ 


Log [@ 9, (1- g. d= 
Ne B, +B, fete self-expression vs. survival values) 
mE BS (are 
+ B, , (Number of children) 
+B, (Time spent with people at church) 
+ B, (Religious attendance) 
+ B, (Religious importance) 
+B, 45 (Religious affiliation) 
+ B,, (Education 
+ B,, (Age) 
+ B,, (Female) 
+ B,, (Financial satisfaction) + rs 


where i indexes individuals and j indexes country-level influences. The distribu- 
tion of rij is assumed to be random normal with a mean of 0 and variance of 0°. 
To explore the effects of a survivalist national culture on prostitution acceptability, 
we estimate the intercept of GDP, population, country cultural/religious tradition, 
and country mean of the national survivalist index. Because our sample contains 
only 32 countries, there are limited country degrees of freedom, which can make it 
difficult for the model to converge and produce meaningful results. While we have 
theoretical reasons to think that all of our country-level variables should be included 
in the model, they may not be statistically (p < .05) important. Hence, we first test 
each of the country variables separately. If any of the country variables are significant 
by themselves or when included with the national survivalist index, we include them 
in the final set of models with our key country-level predictor—national survivalist 
culture. If all of the variables are significant, the final intercept model will be: 


Bo= Yoo + You (Logged GDP) 
+ Y,,(Logged population) 
+ Yo3.g(Country cultural/religious tradition) 
+ Yp9(Country mean of self-expression vs. survival culture) + uy 


In this model j indexes the country-level influence, B,,is the intercept term from 
the individual-level equation (representing attitudes ‘about the acceptability of 
prostitution, adjusted for individual attributes), and u, is a country-level norm 
disturbance assumed to be normally distributed with a mean of 0 and variance of 
t, The intercept will help us estimate the coefficient for the relationship between 
a survivalist national culture and attitudes about the acceptability of prostitution. 

Aside from dummy variables, all variables in the analysis are centered (mean 
= 0), which means that the intercept term represents the average attitude about 
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Table 3: Results of the Effect of Individual Survivalist Culture on 


Prostitution Acceptabili 


Model 1 Model 2 Model 3 
Odds dds dds 
Ratios t-Ratios Ratios t-Ratios Ratios t-Ratios 
Intercept AO 2-538 PAZ 91372 ned? 410178 
Individual survivalism 00*** — -18.06 
Social Control Measures 
Married 94 -142 90° -2.29 
1-2 children 97 -75 1.00 09 
3-4 children (O3—  121/42> 298 -.40 
> 5 children 1.01 108 Rede it .92 
Amount of time spent with people from church oi e2/Ole y.02.)- --0.00 
Social Learning Measures 
Religious attendance OG ee 1A lia) p26 4 52185 
Religious importance Bos eOde soko Eb. 5 
Religious Affiliation 
No religion ISSa a OOO Sennen C00 
Catholic 20P ES AGT 2008s Wi15 
Protestant TSI e 83. 15 cee oF 3.19 
Orthodox IO Sie Si BW Ne WL ak AS Ss he) 
Jewish Sa OOo 200 One 
Hindu VASP SOT 115 2"F 539 
Buddhist 1,03 ee 68.) 1.8072 01 
Other religion TN tie oak hil Castanea 30 
All Else 2A sds 2.40 8 EoOS 
Education 1305") 72:59" 54,01 36 
Demographic Controls 
Age gor* -402 99% -3.01 
Female Of eeO,0h5 CO a) Bnee.co 
Financial satisfaction TO2 Rr? Gos. M00 .00 
Variance Oo, 1.50 .99 1.01 
Variance t,” 1.00" 1.09*** eat 


Outcome: Prostitution, where 0 = not accepting; 1 = accepting 


Individual N = 45,104; Country N = 32 
rege <0 <0. 2 <.001 


Zero is the reference category for number of children. Muslim is the reference category 


for religious affiliation 


prostitution for people who are assigned the suppressed category for all dummy 
variables and the average value on all other variables. To appropriately analyze the 
data as nationally representative samples, we use the recommended weight, which 
accounts for the unequal probability of selection within nations. 


Results 


Tables 3 and 4 present the multivariate analysis of the relationship between in- 
dividual and country survivalist values and attitudes about the acceptability of 
prostitution. In Model 1, Table 3, the variance component estimate indicates 
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that 74 percent of the variation in attitudes about prostitution is at the individual 
level and 26 percent of the variation is at the country level. Compared to other 
outcomes modeled with hierarchical models, 26 percent is quite high, suggesting 
that a much higher-than-average share of attitudes about prostitution is shaped 
by the country context. 

Model 2 presents the influence of the individual-level control variables on 
attitudes about prostitution’s acceptability. The only social control measure that 
is significant is the amount of time devoted to activities with people from one’s 
synagogue, mosque or church. A one-unit increase in the amount of time spent 
with co-religionists is associated with a 7 percent decrease (= .7/ .93) in the odds 
that that the respondent will find prostitution acceptable. Three of the four social 
learning measures are significant. Religious importance is associated with more 
conservative prostitution attitudes. A one-unit increase in religious importance is 
associated with a 20 percent reduction (= .17/.83) in the odds that the respondent 
will find prostitution acceptable. Not surprisingly, people who identify as Catholic, 
Protestant, Orthodox, Jewish, Hindu, Buddhist, another religion, no religion, or 
did not answer the question/don’t know their religion have more liberal attitudes 
about prostitution than people who identify as Muslim. As expected, people with 
a higher level of education have more accepting prostitution attitudes. Our three 
demographic variables were significant. Whereas older people and women have 
more conservative attitudes about prostitution, people who are more financially 
satisfied hold more liberal attitudes. 

Model 3 introduces the individual self-expression vs. survival values index. 
As expected, respondents who have a stronger survivalist orientation have more 
conservative attitudes about prostitution. A one-unit increase in the individual 
self-expression vs. survival values index is associated with an 81 percent decrease (= 
.45/.55) in the odds of having accepting attitudes towards prostitution. Together, 
the individual-level variables in Model 3 are explaining a total of 11.52 percent of 
the total variance in the model. 

Table 4 introduces the macro-level variables. The first two models include 
logged GDP and population in addition to the individual-level variables. Neither 
GDP nor population is significant when included without other country-level 
variables, and in a separate analysis we found that they do not become significant 
when included with the indicator for national survivalist culture. To preserve 
degrees of freedom, they are not included in subsequent models. 

Model 6 includes the five country cultural/religious traditions. Respondents 
residing in Catholic and Hindu countries appear more accepting of prostitution 
than respondents in Muslim countries/territories. 

The national self-expression vs. survival values index is included in Model 7. 
Regardless of their personal survivalist orientation, people in countries/territories 
that have a stronger survivalist orientation have more conservative attitudes about 
prostitution. A one-unit increase in the country survivalist index is associated with 
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a 55 percent decrease (= .32/.58) in the odds of having liberal attitudes towards 
prostitution. When compared to Model 3 of Table 3, this model is explaining an 
additional 14 percent of the variance across countries. 

The final model includes the country cultural/religious tradition. The coef- 
ficient for the country-level survivalist index changes minimally and remains 
significant when these other variables are included. However, there is no longer a 
significant difference in prostitution attitudes between people residing in Catholic 
vs. Muslim countries when the index is included with the country cultural/reli- 
gious traditions. The coefficient for Hindu cultural/religious tradition changes 
minimally, with people residing in Hindu countries/territories having more ac- 
cepting attitudes towards prostitution than people residing in Muslim nations. 
This model explains 62 percent of the variance across countries, and 16 percent of 
the total variance (individual and country). 

To gauge the influence of the country survivalist index relative to other vari- 
ables in the model we can multiply significant (p < .05) logged coefficients by their 
standard deviations (Pampel 2000). Applying this rule to Model 8, we see that the 
personal survival index (-.70 = -.62*1.12) is the most powerful predictor, followed 
by the country survival index. (-.34 = -.58*.58). These variables are followed by 
several of the individual-level religious/denominational categories (in decreasing 
size: Catholic, no religion, N/A and don’t know, Orthodox and other religion). 
Next comes age (-.10 = -.01*15.36), followed by female (-.081 = -.16*.50), reli- 
gious importance (-.076 = -.18*.42), married (-.05 = -.101*.49) and activities with 
people from one’s place of worship (-.027 = -.084*.32). 


Discussion 


Social scientists have neglected cross-national investigations of public opinion on 
criminality and deviant behavior, especially as they relate to macro level, cultural 
systems. The present study is the first to build a model that uses a general index 
of societal cultural systems as a principal and parsimonious predictor of public 
opinion. This sweeping cultural index encompasses many of the specific symbolic 
orientations such as prejudice, religious fundamentalism, authoritarianism and 
political conservatism, which have dominated most of the work not only on 
public attitudes towards prostitution, but public opinion about the death penalty, 
the criminal justice system, gun ownership, and other crime and justice issues 
(Barkan and Cohn 1994; Stack 2000, 2003; Stack and Cao, 1998; Stack, Cao, and 
Adamzyck 2007; Torgler and Schneider 2007; Unnever, Cullen and Fisher 2005; 
Unnever and Cullen 2006; Unnever, Cullen and Bartkowski 2006; Young 1992). 
The results demonstrate that globally, survivalism provides the most powerful 
predictor of public opinion about prostitution. 

In contrast to the present investigation, all previous studies on the specific is- 
sue of public opinion regarding prostitution have used rather limited theoretical 
frameworks. Many of these studies simply associated demographic variables (i.e., 
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gender) with attitudes, or focused 
on a very specific, single quasi sym- 
bolic orientation such as tolerance 
as measured by attitudes towards 
casino gambling (Abrams and Fave 
1976; Basow and Campanile 1990; 
Cosby et al. 1996; Cotton et al. 
2002; McCaghy and Cernkovich 
1991; May 1999; Peracca, Knodel 
and Saengtiencham 1998; Rasanen 
and Wilska 2007; Sawyer et al. 
2001). The present investigation, 
instead, builds a theoretical frame- 
work around a master cultural axis 
of nations. The analysis found that 
this axis (survivalism vs. self-expres- 
sionism) is a significant predictor of 
attitudes about a specific criminal 
behavior, prostitution. 

The cultural axis of survivalism vs. 


pA ae 


Ae 


00 
= 32 


.64**™* 


45,104; Country N 


self-expressionism was the leading 
predictor of attitudes toward prosti- 
tution. First, at the individual-level, 
controlling for the other variables 
in the model, individual-level sur- 
vivalism was the most important 
predictor of prostitution acceptabil- 
ity. The disapproval of prostitution, 
then, may be largely a function of 
a larger cultural nexus. Second, the 
national level of survivalist culture 
was also a significant predictor of 
individual-level prostitution accept- 
ability. Indeed, the national level of 
survivalist culture was the second 
most powerful predictor of indi- 
vidual-level attitudes. Hence, both 
parts of the bi-level model held up, 
confirming that the broad cultural 
axis of survivalism is a key predictor 
of public opinion on prostitution. 
This is the first article demonstrat- 


.00 


.85*** 


Dae 
not accepting; 1 = accepting Individual N 


kK < 001 


eu 





=<105 
For models 6 and 8 the intercept's significance and standard error could not be computed, likely because of the high number of level-2 variables in the model. 


Variance t,” 
Muslim is the reference category for religious affiliation. 


Zero is the reference category for number of children. 


Outcome: Prostitution, where 0 
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ing the importance of the application of comparative cultural sociology to the 
problem of public opinion on criminality. Specifically, public opinion on prostitu- 
tion is found to be related to a much larger cultural nexus, survivalism. 

A secondary new finding of the present investigation refers to religious affili- 
ation, specifically that of Islam. No previous investigations of public opinion on 
prostitution have included Muslim nations and no previous investigations used 
a measure of religion that included Islam. This was a function of focusing on 
samples restricted to the United States where few Muslims reside. Our study is 
the first to draw attention to the strong norms against prostitution among the 
Muslims of the world. Literally persons of all other religious faiths were more 
approving of prostitution than the world’s Muslims. Future work is needed to 
interpret the religious, historical and other root causes of this strong condemna- 
tion of prostitution (Zaehmner 1988). 

‘The present study focuses on attitudes toward a single deviant behavior, prostitu- 
tion. Its focus on prostitution is justified in terms of prostitution’s relationship to a 
series of social problems, which have attracted considerable public concern. These 
include the pandemic of AIDS in many underdeveloped nations, human traffick- 
ing and the mental health of sex workers (Alegria, Vera, Freeman et al. 1994; El- 
Bassel, Witte, Wada et al. 2001; Farr 2005; Gilchrist, Gruer and Atkinson 2005 : 
Graycar and McCusker 2007; Halli, Ramesh, O’Neil, Moses and Blanchard 2006; 
Hong, Li, Fang and Zhao 2007; Nagelkerke, Jha and DeVas 2002; Outshoorn 
2004; Surratt, Inciardi et al. 2004). Estimates of the annual number of persons 
traficked into the sex industry range from a few hundred thousand to 1 million 
(Farr 2005; Miller, Decker, Silverman and Raj 2007). Research investigations on 
the mental health of sex workers have found relatively high levels of depressive 
symtomatology, substance abuse and suicide, as well as stressful life events such 
as partner violence and child abuse (Alegria, Vera, Freeman et al. 1994; El-Bassel, 
Witte, Wada et al. 2001; Gilchrist, Gruer and Atkinson 2005; Hong, Li, Fang and 
Zhao 2007; Surratt, Inciardi Kurtz et al. 2004). 

While public opinion tends to predict social policy (Burstein 2003), it is not 
known if this is the case for the issue of prostitution. Further work is needed to 
determine the extent to which public opinion on prostitution is related to social 
policies. For example, where prostitution acceptability is high, is there a relatively 
high probability that it will be legal, in at least some form? In democratic societies 
social policies are expected to take into account the opinions of the people. However, 
many of the world’s nations do not have democratic political institutions. Future 
work is needed to determine if democratic institutions provide better conduits be- 
tween public opinion and the status of prostitution than do nondemocratic nations. 

Our findings suggest that work on public opinion regarding crime should con- 
sider using a master cultural axis of nations, survivalism, for predicting attitudes. 
Survivalism is at a higher level of abstraction than specific symbolic orientations 
used frequently in the literature on public opinion about crime. For example, racial 
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prejudice and authoritarianism can be subsumed under survivalism in the sense 
that survivalism connotes low tolerance of group differences. Use of survivalism as 
a master cultural axis, which draws on numerous sub-concepts or symbolic orienta- 
tions used in previous work, would help to produce more parsimonious models 
of public attitudes about crime and deviant behavior, as well as other social issues. 

Caution should be exercised in interpreting the results of the present study 
for other categories of criminal offending. Prostitution is a public order crime in 
many contemporary societies. The strength of the link between public opinion 
on prostitution and survivalism may be stronger than that for the association 
between crimes with victims and survivalism. The normative consensus against 
crimes with victims (e.g., homicide) would be assumed to be stronger than any 
consensus against prostitution, a behavior that may be viewed as an alternative 
lifestyle or aspect of self-expressionism. Survivalism may be best linked to the 
degree of acceptability of other public order crimes such as drug offenses. Further 
work is needed to explore the link between survivalism vs. self-expressionism on 
other forms of crime and deviant behavior. Further, the types of punishment, 
amount of punishment, and/or types of rehabilitative treatment due offenders 
may be linked to a survivalist cultural orientation. 

In any event, the results of the present study would predict that as the less 
developed nations of the world win their battles against physical and economic 
insecurities, their cultural axis will shift in a predictable direction. Economic 
development will tend to be associated with greater tolerance of deviant behavior 
including prostitution. 

However, cultural change proceeds at different velocities according to the na- 
ture of long standing cultural imperatives in different parts of the world (Inglehart 
and Baker 2000). Given the results of the present study, we would expect that this 
cultural shift may not proceed as expected in certain culture zones, mainly that of 
Islam. Given the strong condemnation of prostitution in that culture zone, gains 
in economic security may not produce the expected gains in self-expressionism 
and tolerance of deviant behavior. 


Notes 


1. The following countries could not be included in our analytical sample because their 
country/territory survey did not include the question about prostitution acceptability, 
which is our key dependent variable: Morocco, Iraq, Turkey and Puerto Rico. We 
also had to eliminate China, Saudi Arabia and Iran because their country survey did 
not ask respondents about signing a petition, which is one of the questions included 
in the survivalist cultural index. Finally, we had to exclude Singapore because their 
country survey did not ask about financial satisfaction and Israel is not included 
because they did not ask respondents about their religious attendance. Because it 
would be inappropriate to impute values that are not missing at random (Allison 
2001), we, unfortunately, had to listwise delete these cases. 


2. The largest proportion (10.5%) of cases were only missing information on at least 
one item in the five-item individual-level survivalism index. No other variable had 
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more than 2.8 percent missing. The survivalism index is a contposite of five variables. 
No single variable accounted for all of the missing data from the index. The variable 
within the index that had the highest proportion missing was whether or not the 
respondent had signed a petition, and 5.8 percent of people were missing information 
only on this measure in the survey. We conducted a sensitivity analysis to see if our 
results would change when countries that had a high proportion (>20%) of missing 
cases on the index were excluded from the analysis. There were four countries — India, 
Moldova, Albania and Uganda-that met this criterion. We found that regardless 
of whether these countries were included in the analysis, the significance level and 
direction of key findings did not change. 


3. We used STATA’ “sktest” which estimated the probability of skewness to be .08, the 
probability of kurtosis to be .71, an adjusted joint chi-squared test of 3.54, and a 
joint probability > chi-squared of .17 


4, We also conducted our analysis using mean replacement for the survivalism index, 
which had the highest proportion of missing values, and found that key relationships 
did not change in direction or significance based on the technique used. 


5. We considered using the larger index, however, some countries did not ask 
respondents about all of the items included in the 11-item index. Because it would 
be inappropriate to impute missing values for data that are systematically missing 
(Allison 2001), we would have to drop the entire country from our analysis if all of 
the questions on the index were not asked. Given the high correlation between the 
indices in Inglehart and Baker's (2000) work, and our interest in maintaining the 
largest sample size possible, we felt that the five-item index was warranted. 


6. Serbia and Montenegro did not have GDP estimates for 2004, so we assigned them 
the same GDP as Bosnia. 
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Appendix 1. Nations Included in the Analysis 


Nation Frequency % of Sample 
Albania 1,000 2ae 
Algeria 1,282 2.84 
Argentina 1,280 2.84 
Bangladesh 1,500 Gia0 
Bosnia 1,200 2.66 
Canada 1,931 4.28 
Chile 1,200 2.66 
Egypt 3,000 6.65 
India 2,002 4.44 
Indonesia 1,004 223 
Japan 1,362 3.02 
Jordan 1,223 ali 
Kyrgyz 1,043 2.31 
Macedonia 1,055 2.34 
Mexico 1,535 3.4 
Moldova 1,008 ee 
Montenegro 1,060 230 
Nigeria 2,022 4.48 
Pakistan 2,000 4.43 
Peru 1,501 3.33 
Philippines 1,200 2.66 
South Africa 3,000 6.65 
South Korea 1,200 2.66 
Serbia 1,200 2.66 
Singapore 1,512 3.30 
Spain 1,209 2.68 
Tanzania ial 2.6 
Uganda 1,002 2he 
USA 1,200 2.66 
Venezuela 1,200 2.66 
Vietnam 1,000 2.22 
Zimbabwe 1,002 2.22 





Total 45.102 
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Military Spending and Economic Well-Being in the American 
States: The Post-Vietnam War Era 


Casey Borch, University of Alabama-Birmingham 
Michael Wallace, University of Connecticut 


Using growth curve modeling techniques, this research investigates whether mili- 
tary spending improved or worsened the economic well-being of citizens within 
the American states during the post-Vietnam War period. We empirically test the 
military Keynesianism claim that military spending improves the economic condi- 
tions of citizens through its use by politicians as a countercyclical tool to reduce the 
negative effects of economic downturns. However, due to deindustrialization and the 
emergence of the “new military,” there are reasons to believe that military spending 
will not effectively improve economic well-being during the post-Vietnam War era. 
Using longitudinal data we find that states with high levels of military spending are 
better equipped to stave off the deleterious effects of economic recession than are 
states with lower levels of military spending. 


Introduction 


Since World War II, the drive to maintain global military superiority has pushed 
the United States toward what Melman (1974) called the “permanent war economy.” 
This fact has not diminished a decade and a half after the end of the Cold War. In 
fiscal year 2006, the United States spent about $419.3 billion on defense; that is 
about 44 percent of all military spending worldwide and almost six and a half times 
that of the next largest spender, Russia (Center for Defense Information 2006).! 
While most agree that the permanent war economy benefits U.S. corporations and 
workers directly associated with the defense industry, there is considerable debate 
over whether it improves the economic well-being of the rest of the population. To 
illustrate this point, consider the rise and fall of defense-related spending in Groton, 
Connecticut, the self-proclaimed “Submarine Capital of the World.”* 
Connecticut’s New London County, including Groton, at one time had the 
most military-dependent economy in the nation (Judson 1992). While some com- 
munities are heavily dependent on defense procurement and others are reliant on 
defense personnel (i.e., military bases), Groton has both. It is home to the nation’s 
largest submarine manufacturer, the General Dynamics subsidiary Electric Boat 
Company, and the U.S. Navy’s largest submarine base (Ravo 1991). At its peak in 
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1987, Electric Boat employed more than 30,000 people, roughly one-quarter of 
Groton’s total job base (Biddle 1990). At the same time, another 10,000 military 
and civilian employees worked at the submarine base. Including all support and 
service jobs, about half of the 110,000-person labor force in Groton worked in 
jobs related to the defense industry. In the 1980s, the defense industry either 
directly or indirectly produced about 60 percent of the total economic output of 
New London County and each defense-related job spun off another 2.5 jobs in 
the region (Casey 1993; see also Duchin 1983). Thus, the defense buildup of the 
1980s represented an economic boon for both New London County and Groton. 
With the end of the Cold War and the subsequent scale-down in military 
spending in the early 1990s, the Department of Defense reduced its orders for 
submarines from Electric Boat. The layoffs of more than 20,000 Electric Boat 
employees, which took place over nearly 10 years, had a “ripple effect” on the 
economies of Groton and New London County. Dozens of bars, restaurants and 
street vendors that flourished by supplying meals to Electric Boat’s workers began 
to close, until today there are only a handful left (Driscoll 1992). 
Manufacturing-based support companies were also badly hurt by the loss of 
defense contracts to Electric Boat. Its largest supplier of parts, the New London 
County-based UNC Naval Products, scaled back production leaving another 
1,000 workers jobless (Brisgone 1992). In addition, the Naval Undersea Warfare 
Center moved from New London County to Newport, Rhode Island. This move 
displaced another 3,000 workers. Thousands of other jobs also left the area, as 
dozens of private contractors and other support companies moved to be closer to 
other customers (Hamilton 2004). In total, Connecticut lost 30.5 percent of its 
private sector, defense-related employment from 1989-1994 (Kodrzycki 1995).3 


Background and Theory 


In the Cold War era, politicians rallied public support for military spending in 
response to the growing threat posed by the Soviet Union (Hartley and Russett 
1992). ‘This, in part, led to a surge in the production of material goods used to 
support personnel and military infrastructure in the event of a war. Along with 
the end of the Cold War in the late 1980s came a downsizing of the military on 
a scale not seen in U.S. history due to the shift towards technologically advanced 
weapons systems and “smart bombs” that have replaced the infantry soldier. This 
“new military” has a much greater need for technological support than mate- 
rial support of the kind provided by the manufacturing sector. The reduction of 
forces would seemingly be detrimental to economic strategies such as military 
Keynesianism, which are based on the assumption of an ever-increasing (or at 
least stable) defense budget. Further, in today’s era of deindustrialization, the 
manufacturing sector in which most defense industries reside is a much smaller 
component of the total economy. This suggests that increasing the production of 
defense materials in response to threats to economic stability would not have as 
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great an effect on the overall economy as it did during the post-WWII era when 
manufacturing was clearly the engine that drove the economy. 


Military Keynesianism 


In their seminal work Monopoly Capital, Baran and Sweezy (1966:165) stated, “if 
military spending were reduced once again to pre-Second World War proportions, 
the nation’s economy would return to a state of profound depression, characterized 
by unemployment rates of 12 percent and up.” Baran and Sweezy (1966) argue 
that government officials use military spending as a countercyclical tool to avoid 
recession, reduce unemployment and stimulate economic growth. They contend 
that the U.S. capitalist system is prone to periodic bouts of economic stagnation that 
are caused by surplus productive capacity and insufficient demand for private sector 
goods. The state-sponsored quest for military goods satisfies a virtually endless ap- 
petite for new products and helps absorb the surplus capacity of the capitalist system. 

Further, military spending is unique in that it does not directly compete with 
other products made in the private sector and politicians can easily rally widespread 
public support for military spending (Reich 1972). Politicians from both political 
parties use military spending to stimulate economic growth (or avoid stagnation) in 
their districts, ensure an acceptable level of economic well-being for their citizenry, 
and maintain their incumbency in political office (Nincic and Cusack 1979; Griffin, 
Devine and Wallace 1982). While Baran and Sweezy’s (1966) articulation of military 
Keynesianism originates in a neo-Marxist analysis of U.S. political economy, the 
argument resonates with scholars of different political orientations (e.g., Hitch and 
McKean 1961; Galbraith 1967; Cypher 1974; Smith 1977). 

Griffin and his colleagues (Griffin et al. 1982; Griffin, Wallace and Devine 
1982) refined the military Keynesian argument and used annual time series data 
for the United States (1949-1977) to test its hypotheses. They disagreed with 
Baran and Sweezy that military spending is driven by aggregate economic condi- 
tions such as unemployment and argued instead that it is responsive to powerful 
segments of the capitalist political economy; specifically, monopoly capital and 
organized labor (see also O'Connor 1973). Their findings indicated that the state 
was likely to increase military spending as a percent of GNP following downturns 
in monopoly sector profits (not aggregate profits) or unemployment in the union- 
ized sector (not aggregate unemployment). 

Griffin et al. (1982:S135) also found that military spending was responsive 
to the electoral cycle. However, they found that “Republican and Democratic 
administrations behave similarly for structural economic reasons.” In short, ir- 
respective of party, politicians manipulate the defense budget in order to ensure 
their own incumbency. Mayer's (1992) monthly time series analysis found that 
defense contracts peak in months prior to national elections to provide a boost to 
the economy and ensure the election of incumbent politicians, but he did not test 
whether there is variation based on the political party in power.‘ 
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Extending this line of research, Mintz and Hicks (1984) argued for the disag- 
gregation of military spending into four components: personnel, organization and 
maintenance, procurement, and research and development, measured as propor- 
tions of GNP. They found some support for the military Keynesianism argument, 
specifically, that procurement and personnel spending were used as countercyclical 
tools while spending on the other two components was not. Spending on defense 
procurement and personnel was responsive to aggregate concentration, attesting 
to the influence of powerful corporate actors in the private sector. Similarly, in 
one of the few state-level studies of this kind, Mehay and Solnick (1990) found 
that aggregate military spending positively affected economic growth, but once 
disaggregated, only increases in investment-type outlays increased growth. 

In a recent study, Wallace, Borch and Gauchat (2008) extend the military 
Keynesian argument to an analysis of the American states for the period 1977-2004. 
They disaggregate military spending into two components, defense contracts and 
defense personnel, and develop multivariate models to assess their determinants. 
Their findings show that defense contracts increased in a countercyclical fashion 
in response to downturns in the economy and also in direct proportion to several 
state-level measures of corporate power, but contracts are negatively related to 
the strength of labor unions. Wallace et al. (2008) also find no effect of political 
partisanship on military spending. All these findings—with the exception of the 
negative finding for union strength —follow a pattern largely consistent with earlier 
accounts of military Keynesianism. On the other hand, Wallace et al.’s results also 
showed that the determinants of defense personnel expenditures were only mildly 
consistent with military Keynesian explanations. 

Most research using the military Keynesian perspective tends to focus more on 
the causes of military spending. Studies have been less consistent in addressing the 
outcomes of military spending, yet this is a complimentary component of military 
Keynesian theory. If politicians use military spending counter cyclically to boost the 
economy and maintain their incumbency in office, it is reasonable to ask whether 
these efforts actually work—even in the short run—to improve the economic well- 
being of the population. One of the first studies to look at outcomes of military 
spending is Szymanski’s (1973) comparative study of 18 capitalist countries. He 
found that military spending decreased unemployment, consistent with the mili- 
tary Keynesian hypothesis, but also hindered economic growth, contrary to military 
Keynesianism. Nincic and Cusack (1979) conduct a bivariate time series analysis 
(1949-1976) of the effects of military spending on economic growth (i.e., real GNP 
per capita). ‘This analysis showed that military spending increases economic growth. 
‘They also showed that, under various assumptions about the multiplier effects of 
defense-related unemployment, military spending generates jobs for millions of 
people who might otherwise go unemployed. While overcoming many of the 
flaws in the cross-national studies, Nincic and Cusack’s study spans the Korean and 
Vietnam wars, which complicates the task of cleanly identifying the role of military 
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spending during a peacetime economy. More recently, Dunne and Smith’s (1990) 
analysis of 11 OECD countries for the 1962-1985 period found that military 
spending was unrelated to changes in unemployment. 

One could also argue that military spending hurts productivity as the non- 
competitive military sector crowds out the private sector. As more resources are 
devoted to military spending, the government must either raise taxes or borrow 
from the international capital market to balance its budget. The latter option is 
especially detrimental to economic well-being as it increases interest rates, reduces 
business investment and consumer demand, and slows economic growth (Russett 
1969; Deger and Smith 1983). In fact, Goldstein (1988) estimated that every 1 
percent of GNP allocated to military spending decreases economic growth by .5 
percentage points. Further, increased military spending can consume exportable 
goods from defense industries, thus exacerbating the trade deficit. This contrib- 
utes to a weaker national currency, rising inflation and higher unemployment 
(Chan 1985). In addition, as the military sector expands, salaries increase luring 
skilled workers away from non-defense companies to work for defense contractors. 
Finally, the defense industry diverts research and development efforts away from 
the private sector and hoards technologies that could be useful to civilians and 
private businesses (seeDeger 1986; Ward, Davis and Lofdahl 1995; Heo and Eger 
2005; Abell 1990, 1994; Henderson 1998). 

Other studies have reached conclusions that are generally supportive of the 
military Keynesian argument. For example, Hooker (1996) used state-level data 
from 1963-1994 to examine the effects of military spending on growth of real 
per capita personal income. He found that increases in military spending led to 
increases in personal income, net of other controls. (See also Westing 1978.) 


Data and Method 


In this research, we follow Hooker and Knetter (1997:412) who, after comparing 
national-level and state-level analyses, concluded, “It appears that research using 
state-level data has more power to identify a relationship between military spend- 
ing and economic activity.” States offer a range of economic, social and political 
conditions with which to test the variable effects of military spending while still 
sharing the common geopolitical constraints of the United State’s “permanent war 
economy.” In addition, pooling annual data across states provides more degrees of 
freedom for statistical analyses. 

We constructed a time-series dataset of 49 U.S. states covering the post-Viet- 
nam War era, 1977-2004, a design that yielded 1,372 observations.’ Although our 
series is five years shorter than the state-level series (1963-1994) used by Hooker 
(1996) and Hooker and Knetter (1997), we chose the period on theoretical and 
empirical grounds (Isaac and Griffin 1989). In terms of theory, we purposefully 
excluded the Vietnam War era to concentrate on the role of military spending 
during a peacetime economy. Most observers (e.g., Schulzinger 1997) mark the 
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end of the Vietnam War in 1975 when the last of the U.S. Marines left Saigon, 
so 1977 was chosen as the start date for our analyses. The time span also coin- 
cides with a 30-year period of deindustrialization, which potentially accentuates 
the importance of military spending in generating new manufacturing jobs. On 
empirical grounds, the time period of our state-level data extends beyond all other 
time series analyses of military spending to 2004, which includes the beginning of 
another potentially large military buildup post-9/11. Table 1 provides descriptive 
statistics of all variables used in the analyses across states and over time. 


Dependent Variables 


We investigate the determinants of economic well-being using four dependent 
variables. To address one of the main outcome variables common to previous 
analyses we examined the unemployment rate, computed as the proportion of the 
state's labor force that is unemployed. Focusing on income, for each state, we use 
the median family income in constant 2000 dollars. Although the conclusions were 
substantively identical, this measure was chosen as a more realistic indicator of 
income than another common measure, logged per capita income. We also derived 
a measure of income inequality using the Gini index to assess the distributional 
consequences of military spending (Abell 1994). Recent research indicates that the 
Gini index, where 0 represents perfect equality and 1 represents perfect inequality, 
is increasing at both the national (Bernstein, Mishel and Brocht 2000) and state 
levels (Langer 1999). Finally, we focus on the most vulnerable segment of the 
population, those in poverty. Percent poverty is the percentage of the state's residents 
below the federal poverty level. Although there is some disagreement about the 
measurement of poverty (see e.g., Gundersen and Ziliak 2004), the federal poverty 
rate has been widely used in analyses similar to ours (e.g., Henderson 1998). 


Independent Variables 


For comparability, each of our statistical models includes the same control vari- 
ables, but some of the interaction terms differ by model. To control for region- 
specific factors, we include a dummy variable for South. In addition, our study 
spans three distinct historical periods: the Cold War (1977-1989), the collapse of 
the Soviet Union and end of the Cold War (1990-2000), and the post-9/11 period 
(2001-2004). Hence, we include dummy variables for the latter two periods, end 
of the Cold War and post-9/11, and omit the Cold War period as the reference 
group. To account for macro-economic cycles, we include growth of real GDP 
per capita (t-1) measured at the national level. We decided to use growth of real 
GDP per capita because the U.S. population was also growing over our period 
of study. Next, the percent change in aggregate concentration measures the annual 
change in the percent of assets controlled by the top 100 manufacturing firms 
also measured at the national level. Although manufacturing has become a smaller 
portion of the national economy, the degree of concentration in this sector has 
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trended upward throughout the period of study (1977-2004). Also, inflation (t-1) 
is the annual percent change in the cost of living measured at the regional level. 
We experimented with other lag structures, but decided to lag percent change 
in real GDP per capita and inflation one year (t-1) for two reasons: (1. because 
military Keynesianism predicts that politicians will react to short-run changes in 
the economy, and (2. because additional lags (up to 10 years) did not substantively 
alter the findings presented below. 

We also include several variables that control for demographic features of the 
state’s population. Percent college-educated, which proxies human capital levels, is 
the proportion of a state’s population over age 25 with a four-year college degree. 
This variable has been shown to affect economic growth in previous research (Barro 
and Sala-i-Martin 1995). Percent black is the proportion of a state's population 
who are black. Residential population is total number of people living in the state. 

We also control for the political and business climates of the states. First, to 
tap partisan control of state government, we include the proportion of Democrats 
in state government, computed as the average of the percent of Democrats in the 
upper and lower chambers of the state's legislature.° Next, party of the governor is 
coded 1 if the governor is Democrat and 0 if Republican or Independent.’ Third, 
we include the percent employed in large establishments, that is, establishments with 
more than 500 employees. This measure reflects the high-employment sectors of 
the economy that are especially vulnerable to economic downturns. Fourth, to 
account for the effects of deindustrialization, we include the percent of GSP from 
manufacturing. As expected, this variable decreases substantially throughout our 
period of study.* Fifth, to capture the influence of monopoly sector firms, we 
include the natural logarithm of the number of Fortune 500 firms located in the 
state. Sixth, wnion density is the percentage of the state’s non-agricultural work- 
force belonging to unions. 

Finally, we include several measures of how much federal spending is flow- 
ing into the state economies. Following Heo and Eger (2005), who found that 
federal, non-military spending has significant effects on economic growth, we 
include non-defense expenditures as a proportion of GSP (t-1). This variable taps 
all non-military federal funds flowing into the state and controls for alternative 
ways in which federal funding might affect economic well-being. Following 
Mintz and Hicks (1984) and Mehay and Solnick (1990), we disaggregated 
state-level military spending into two components. Defense contracts (t-1) is the 
percent of the state's GSP from government contracts through procurement of 
defense-related materials from companies in those states; while defense person- 
nel (t-1) is the percent of the state’s GSP devoted to spending on civilian and 
military defense personnel (including retirement and veteran’s benefits). Several 
scholars have shown that different components of military spending have dis- 
tinct effects on key well-being outcomes (e.g., Mintz and Hicks 1984; Mehay 
and Solnick 1990; Henderson 1998). Finally, we lagged both defense measures 
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one year to capture the assumption among military Keynesianists that changes 
in military spending have a delayed effect on economic well-being.’ 


Model Estimation 


To examine the determinants of military spending in the U.S. states from 1977 
through 2004, we employ the class of multilevel modeling techniques known as 
“growth curve” models (Bryk and Raudenbush 1987)."° In the dataset analyzed be- 
low, the time-varying measures can be conceptualized as nested within each of the 
49 states in the study. The analyses rely on estimating two equations. The level 1 
model represents the hypothetical change that each state is expected to experience 
over the observed time period, controlling for specific time varying predictors. The 
level 1 model is normally presented with a linear change function, although it can 
easily handle higher-order polynomials. The general form of the level 1 model is: 


Y. =n, +2. [IME.+ 2X... + 1,X,. ...4 WX +8, (1) 
ij 0i li ij 2 27 Saou) n ni i] 


where Y, is the predicted outcome for the th state at the jth time point; TIME, 
is some measure of time for state at time j; X,, X,, X, are additional time varying 
predictors at the within-state level; 1,, is the value of Y at the baseline (or “initial 
status”) and when all other time varying predictors equal zero; 1, , is the slope (or 
“rate of change”) for the ith state; m, is, net of other predictors, the effect of X, 
on Y (and similarly for 2, and so on); é, is the within-state error term, which is 
assumed to be normally distributed, N(0,0°). With all time varying predictors 
removed except TIME, the model in Equation | is commonly referred to as the 
“unconditional growth model.” In general, the UGM assumes that each state’s 
change trajectory has a common form (in this case, linear), but it does not assume 
that each state’s intercept or slope is identical. 

The level 2 model uses time-invariant predictors to elaborate the relationship 
between within-state measures and between-state measures. There are two main 
features of the level 2 model. First, the outcomes of interest are the initial status 
(intercept) and rate of change (slope) parameters of level 1, ,, and z,,. This indi- 
cates that the specific distributions of these parameters can vary independently, and 
that the predictors at level 2 relate to the individual growth parameters. Second, 
the level 2 model allows for random variation in the individual growth parameters 
of states who share common values on the level 2 predictors (e.g., region of the 
country). That is, the differences in state-level measures place the change param- 
eters into categories (e.g., South/non-South) within which individual trajectories 
can take different forms. The general form of the level 2 model is: 

Ty = Yoo + Yor (Zai) + Yor (Zoi) + Yon (Ze) + bo 
|. =Vot Vn (Z,,) TYp (Z,,) ces (Z,,) bi (2) 


Fe cael la (Z,,) T Yao (Z,,) FY on (Z,,) ‘ bn 
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where y,, is the adjusted value of Y when all time-invariant predictors equal zero; 
Z, Zy Z, are time-invariant predictors; 7,, is, net of other predictors, the mean 

effect of X, when Z, equals zero (and similarly for X, and Z, etc); y,, is, net of 
other predictors, the mean effect of Z on the mean value of Y (and similarly for i 
etc.); 7, is, net of other predictors, the effect of Z, on the effect of X,on the mean 

value of Y (and similarly for Z,, etc.). The Z terms represent contextual level vari- 
ables. The error terms, f,,, ¢,, ... ¢., represent unexplained state differences in the 

intercepts and slopes, respectively. Additionally, each error term has a variance 

and a covariance that is also estimated. In traditional regression, the initial status 

(x,,) and growth rate (x, ) are treated as fixed effects, and the random effects (error 
terms) of each are not estimated. Thus, by estimating the variance and covariance 

of each error term, the unique estimates for each state can be examined more 

completely than by conducting separate regressions for each state. 

Rather than using OLS estimates of the level 1 intercepts and slopes, which are 
free to vary among states in growth curve modeling, estimates of these coefficients 
are determined using empirical Bayes procedures. An EB estimate “utilizes not 
only data from that unit but also data from all other similar units.’(Raudenbush 
and Bryk 2002:66) This estimation strategy allows for optimal estimates of indi- 
vidual intercepts and slopes given a variety of data-related concerns. After these 
estimates have been determined, the differences in average intercepts or slopes 
between categories of specific level 2 variables can be ascertained. For the level 2 
equation, again OLS does not offer the best estimates. Instead, generalized least 
squares estimates are used. These estimates are precision-weighted by taking into 
account the contributions of different units. Finally, the residual variance at level 
1, the variances of the random effects at level 2, and covariances among the level 
2 random effects are determined by maximum likelihood procedures (for more, 
see Bryk and Raudenbush 1992; Snijders and Bosker 1999).!! 

As evidence that multi-level models are appropriate for our data, we computed 
the intraclass correlation coefficient (ICC) (Singer and Willett 2003:96) by run- 
ning a model with no level 1 or 2 predictors (termed the “unconditional means 
model”) and examining the variance components. If the ICC is not high enough, 
as de Leeuw and Kreft (1995) note, multilevel modeling is not an improvement 
over more traditional models in terms of estimating fixed effects. Briefly, the ICC 
(or p) is a measure of the proportion of total variation in the dependent variable 
that lies between states. Formally, p = of (0 +0), where a, is between-state 
variance, o” is within-state variance, and (o° +0°,) is the total variance. 

For unemployment, the ICC = 1.161/(2.934+1.161) = .284, meaning that 
about 28 percent of the total variation in unemployment is due to differences 
among states. For median family income, the ICC = 44.856/ (20.549+44.856) 
= .686, meaning that nearly 69 percent of the total variation in median family 
income is due to differences among states. For income inequality (Gini index), the 


ICC = 2.778/(6.614+2.778) = .296, meaning that nearly 30 percent of the total 
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variation in income inequality is due to state differences. For poverty, the ICC 
= 11.891/(4.275+11.891) = .736, indicating that nearly 74 percent of the total 
variation in levels of poverty is due to differences among states. ICCs as large as 
these indicate that multilevel modeling is an appropriate way to analyze our data. 
In growth curve models, the ICC also provides a measure of autocorrelation. Thus, 
for our models, the results suggest that significant autocorrelation is present. 


Results 
Predicting Economic Well-being 


In growth curve modeling, the “main” effects represent the “initial status” of the 

variable. That is, the main effect coefficient represents the average effect of the 

independent variable at the first time point. This effect remains constant if there is 

not a significant interaction with time. To check for change over time, it is custom- 
ary to create interaction terms between the predictors and a measure of time where 

significant interactions indicate steeper or shallower slopes modifying the main 

effect of time. For example, assume a positive and significant main effect of X on 

Y and a positive and significant interaction with time. This means that a one-unit 
increase in X increases Y by the main effect coefficient, and that the effect of X on 

Y is growing larger over time; if the interaction with time was negative, this would 
mean that the effect of X on Y is getting smaller over time. 

The model of unemployment, presented in Table 2, is considered by Baran and 
Sweezy (1966) and other proponents of military Keynesianism to be a key barometer 
of economic well-being. Descriptively, the results indicate that the post-9/11 years 
have higher unemployment relative to the Cold War years. Also, states in the South, 
those with growing capital concentration, those with larger populations, those with 
higher unionization rates, and those experiencing increases in non-defense spending 
had higher unemployment compared to states without such characteristics. On the 
other hand, states following national economic growth, those with increased infla- 
tion, with more Democratic state legislatures, and those with Democratic governors 
had lower unemployment rates relative to states without such traits.” 

In terms of our key independent variables, states with higher proportions of 
GSP from military spending on contracts in the previous year did not have sig- 
nificantly different unemployment rates than states with lower levels of spending 
on contracts. However, the results suggest that states with higher proportions of 
GSP from spending on military personnel tended to have lower unemployment 
rates relative to states with lower proportions of GSP from spending on personnel 
(y = -.190, p < .05). This result suggests that funneling spending on personnel into 
needy states could be an important way in which the federal government could 
reduce unemployment and improve economic well being in those states. 

In terms of the changes over time, the results imply that the difference in 
unemployment rates between Southern and non-Southern states is declining over 
time (y = -.040, p < .05). These effects can be interpreted as follows: the effect 
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of South on the intercept is positive and significant (y = 1.835, p < .01) and the 
trend term is negative and significant (y = -.137, p < .001). This indicates that 
the average unemployment rate is declining across all states, but the slope for 
Southern states is declining at a faster rate than for states outside the South (y = 
-.040, p < .01). A similar pattern for the effect of time was evident in states with 
greater proportions of GSP in manufacturing (y = -.002, p < .05) and with higher 
levels of non-defense spending (y = -.007, p < .001); while the opposite pattern 
was found in states with higher inflation rates (y = .009, p < .01) and Democratic 
governors (y = .039, p < .001). 

In terms of our key independent variables, there was no significant modifica- 
tion to the personnel spending slope, which means that on average the significant, 
negative effect seen above (y = -.190, p < .05) remained relatively stable across 
the period of study (1977-2004). For spending on contracts, however, the non- 
significant slope was augmented by a negative and significant effect of time (y 
= -.006, p < .05). This result indicates that states with higher levels of military 
spending on contracts experienced lower levels of unemployment over time than 
those with lower levels of spending on contracts. Importantly, these findings con- 
form to predictions from a military Keynesianism perspective, as lower state-level 
unemployment rates could be interpreted as a sign of improved economic well- 
being. The fact that the negative effect is constant for personnel and is growing 
over time for contracts suggests that the beneficial effects of military spending on 
economic well-being persist even in an era of deindustrialization and a decreased 
emphasis on the military. 

The variance components of this model tos a and o,) indicate that, because 
all values differ significantly from zero, additional level 1 and level 2 predictors 
could be added to improve the fit of the model. Note also that the fitted model 
represents a significant improvement over the null model (i.e., the “unconditional 
means model” or a model with no predictors at levels 1 or 2). Given these find- 
ings and that the models are nested, we can simply subtract the fitted model’s 
deviance value from that of the null model, with degrees of freedom equal to the 
number of parameters added to the null model to construct the fitted model. The 
difference is Gas = 5476.1-4454.2 = 1021.9, p <.001). Thus, without a measure 
analogous to R-square in OLS, this figure indicates that: (2) the fitted model fits 
the data significantly better than the null model, and (ii) the predictors explain 
about (1021.9/5476.1)*100 = 18.7% of the variance in the model. 

The results of our model predicting median family income (second column of 
Table 2) indicate that both the post-Cold War and post-9/11 periods witnessed 
significantly lower levels of median family income (in real 2000 dollars) than the 
Cold War period. Further, states in the South (especially those with higher pro- 
portions of blacks), those experiencing growing capital concentrations, and those 
following increases in non-defense spending all had lower median family incomes 
compared to states without such characteristics. Conversely, states following na- 
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tional economic growth, those experiencing increased inflation, non-Southern 
states with greater proportions of blacks in their populations, those with more 
Democratic state legislatures, and those with higher union density had higher 
median family income values relative to states without such traits. 

Considering the main effects of the military spending variables, neither coeffi- 
cient was significant at the customary .05 level. This suggests that the average level 
of military spending in the state does not affect median family income at the first 
time point. But, the interaction with time indicates that spending on contracts has 
a positive effect on median family income over the period considered in this study 
(y = .029, p< .01). While encouraging for military Keynesian theory, the findings 
for spending on personnel suggest an opposite interpretation. Specifically, the 
interaction between time and spending on personnel is negative and significant 
(y = -.045, p < .01), which indicates that states with larger proportions of GSP 
from personnel spending have lower median family income values over time. It 
is possible that states with high levels of both types of military spending will not 
see much change in median family income, as these effects will essentially “wash 
each other out.” This finding lends mixed support to the military Keynesianism 
argument. On the one hand, defense contract spending improves economic well- 
being, as represented by median family income; but, on the other hand, spending 
on defense personnel hinders economic well-being. 

As above, the variance components of this model indicate that the inclusion 
of additional level 1 and level 2 predictors could improve the fit of the model. 
Purther, the fitted model is a significantly better fitting model than the null model 
(7°,, = 8316.0 — 6281.8 = 2034.2, p < .001) and it explains about 24.5 percent of 
the variance in the model. 

Our third model predicts income inequality using the Gini index (third column 
of Table 2). The results indicate that states in the South (except those with higher 
percentages of workers employed in large establishments), those following national 
economic growth, those with higher percentages of blacks, those with larger popu- 
lations, and those with higher proportions of GSP from non-defense spending all 
had greater income inequality relative to states without such traits. Alternatively, 
states experiencing increased inflation and those with higher percentages of workers 
employed in large establishments had lower levels of income inequality than states 
with lower inflation levels and lower numbers of workers in large establishments. 

Turning to the military spending variables, neither had significant main ef- 
fects on income inequality. However, over time, the findings suggest that income 
inequality tends to decrease in states with higher levels of contract spending com- 
pared to states with lower levels of contract spending (y = -.014, p < .05). The 
interaction of personnel spending with time was not significant, which indicates 
that spending on personnel does not significantly increase or reduce income in- 
equality either initially or over time. Once again the findings for spending on 
contracts support the military Keynesianism perspective. 
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The variance components of this model indicate that the inclusion of ad- 
ditional level 1 and level 2 predictors could improve the fit of the model. The 
fitted model is again a significantly better fitting model than the null model 
(179 = 6587.3 — 5352.7 = 1234.6, p < .001) and this time it explains about 18.7 
percent of the variance in the model. 

The final model, representing the poverty rate (column 4 of Table 2), indicates 
that, net of other predictors, the post-Cold War and post-9/11 periods experi- 
enced higher levels of poverty than the Cold War period. Also, the main effect of 
South is negative at the beginning of the time series (y = -4.446, p < .05). When 
only main effects are included, the effect of South is positive and significant. 
However, the inclusion of interaction terms South*percent Democratic legisla- 
tures and South*Fortune 500 (log) turns the main effect of South negative. The 
interaction terms suggest that these two variables operate differently in the South 
than in other regions of the country. 

Turning to the time-varying predictors, states experiencing the growth of capital 
concentration, those with higher percentages of blacks, those with larger populations, 
those with greater proportions of GSP from manufacturing (i.e., /ess deindustrializa- 
tion), and those with higher rates of non-defense spending all had higher poverty 
rates compared to states without such characteristics. On the other hand, states 
following years of national economic growth, those experiencing higher inflation 
rates, those with more Democratic state legislatures, those with greater percentages 
of workers employed in large establishments, and those with more Fortune 500 firms 
all had lower poverty rates relative to states without such traits. 

As for the linear trend term, it was positive and significant, which indicates that 
the average poverty rate across the states grew larger over our period of study. We 
found, however, that this effect was quadratic and took the shape of an inverted-U, 
which suggests that poverty rose early in the period but then fell later in the period. 
In general, states with greater proportions of GSP from contracts and personnel 
experienced declining rates of poverty over time. However, the decline in states 
with high levels of personnel spending was also found to be non-linear and, in fact, 
reversed course toward the end of the period. These results support the military 
Keynesian approach with regard to contract spending, but offer mixed support 
with regard to personnel spending. 

Finally, the variance components of this model indicate that the inclusion of 
additional level 1 predictors could improve the fit of the model, but the variance 
for the rate of change needs no additional variables. Again, the model presented in 
Table 2 fits the data significantly better than the null model (x°,,= 6094.6 — 5379.3 
= 715.3, p < .001) and explains about 11.7 percent of the variance in this model. 

Overall, the results presented above suggest that the two military spending 
measures have distinct effects on the four measures of economic well being. 
Specifically, states with high levels of spending on contracts were found to have 
lower unemployment rates, higher levels of median family income, lower levels 
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of income inequality, and lower poverty rates over time. These results support the 

military Keynesianism hypothesis that increased military spending improves eco- 
nomic well-being. When considering personnel spending, the results were more 

ambiguous. That is, states with higher levels of spending on military personnel 

were found to have lower unemployment and poverty rates, but also to have lower 
levels of median family income and about the same levels of income inequality as 

states with less spending on personnel. These results highlight the need to disag- 
gregate the components of military spending and suggest that spending on defense 

contracts is more conducive to increased economic well-being. Moreover, the find- 
ings reported above are consistent with Wallace et al.’s (2008) previous state-level 

findings that, over the same time period, the determinants of defense contracts 

were closely aligned with the countercyclical argument of military Keynesianism, 
while the determinants of defense personnel were not. 


Conclusions and Implications 


‘This article has shown that military spending improves several indicators of eco- 
nomic well-being at the state level. In broad strokes, the results in Table 2 tend 
to support a military Keynesian interpretation. Our results imply that military 
spending staves off the deleterious effects of deindustrialization on economic 
well-being. Further, we find that improvements in economic well-being are more 
closely linked to spending on defense contracts than defense personnel.” 

Our primary analyses focused on the short-term effects of military spend- 
ing. These are typically of concern to elected officials seeking to maintain their 
incumbency according to the military Keynesianism perspective. However, there 
seemed to be some evidence in our growth curve models for longer-term ef- 
fects. The permanent war economy carries tremendous bureaucratic momentum 
(Nincic and Cusack 1979) that mitigates dramatic reform of the military estab- 
lishment or long-term reductions in military spending. The entrenched interests 
of the military-industrial complex including the armed forces, veterans, private 
contractors, military-dependent communities and political lobbies auger for the 
continuance of military spending. This powerful constellation of forces makes it 
easier to ratchet up military spending in the aftermath of 9/11, for example, than 
to decrease it in times of peace. In the face of a largely acquiescent and malleable 
citizenry, politicians can easily mobilize patriotic fervor to increase military spend- 
ing, effectively pre-empting other alternatives for stimulating the economy. 

Our results hold important implications for the current era, which is permeated 
by the “global war on terrorism.” Under George W. Bush, the military budget ex- 
panded at a rate reminiscent of the Reagan build-up of the 1980s. Although slowed 
by the Obama administration, the buildup continues. On the surface, the potential 
consequences of this increase in military spending seem similar to those of the 
1980s, but there are some underlying differences. During the Reagan build-up, the 
personnel and contracts components of the defense budget grew proportionately. In 


1744 © Social Forces 88(4) 


~ 


the era of the “new military,” however, the conventional military force is decreasing 
as reflected in the personnel budget; yet, the resources devoted to the procurement 
of weaponry and equipment is increasing. This shift in defense priorities has been 
justified by the unique nature of the American adversaries in the 21* century. 

None of our findings should be read as advocating higher levels of military 
spending for the sake of improving economic well-being. Instead, the results 
simply underscore one of the stubborn realities of contemporary American capital- 
ism: 60 years after the end of World War II, the United States is still dominated 
by a “permanent war economy” that requires large amounts of military spending 
to stave off economic stagnation. Importantly, military spending remains crucial 
for mustering higher levels of economic well-being among the citizenry which, in 
turn, is vital for the re-election of political incumbents. 


Notes 


1. These figures do not include funding for the Department of Energy's nuclear weapons 
programs or expenses for ongoing combat operations in Afghanistan and Iraq, an 


additional $230 billion. 


2. ‘The discussion of Groton is included as an example of the impact of military spending 
at the local level. For an exhaustive case study of this type, see Hicks and Raney (2003). 


3. In 1989, manufacturing jobs in New London County outnumbered service jobs by 
about 2 to 1, but just eight years later the ratio reversed (U.S. Census Bureau 1989; 
1997). Today, most new jobs are located in two Native American casinos, which 
employ more than 21,000 people. However, these new jobs are far less desirable (i.e., 
lower wages, fewer benefits, and less job security) than those lost in the manufacturing 


sector (Greenwald 1994; Benedict 2005). 


4. Also, Derouen and Heo (2000, 2001) found that spending on defense contracts was used 
by U.S. presidents to increase approval ratings as well as to improve a weak economy. 


5. Our dataset actually begins in 1976, but the first year is lost because we include lagged 
values of several predictors in each model. We excluded the District of Columbia 
because it is not a state and because it has an inordinately high amount of military 
spending. We also omitted Alaska, because tests for spatial autocorrelation (e.g., Anselin 
1988; Odland 1988) and other regression diagnostics identified it as a problematic case. 


6. Nebraska does not report the party composition of its legislature, so, as a proxy, we 


used proportion of the delegations to the U.S. Senate and House of Representatives 
that were Democrats. 


7. We understand that the ideology of Independents might differ significantly from 
that of Republicans; however, we chose this coding on purely empirical grounds as 
Independent governors made up only 18 of 1,372 state-years (or 1.3%). Also, when 
partisanship was coded separately, the conclusions were the same. 


8. As an alternative control for deindustrialization, we experimented with the percent 


of the state population employed in the service sector. The conclusions of the models 
were unchanged. f 


9. We also tried soldiers/population (Kick, Davis and Kentor 2006) and expenditures/ 
soldier (Jorgenson 2003) and found these measures to be highly correlated with 
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personnel spending. Using these alternatives did not change the conclusions of the 
models presented below. 


10. One alternative to the growth curve model for multilevel models such as ours is the 
“latent” growth curve model from the structural equation model framework (Hox 
and Stoel 2005). The LGC model has several strengths: (1. it can examine multiple 
dependent variables simultaneously in a single model, (2. there is a test of overall 
model fit, and (3. it produces a graphical representation of the model that makes it 
easy to understand the effects. 


11. Weused PROC MIXED in SAS to estimate our models (Littell et al. 1996; Singer 1998). 


12. It is customary to “trim” models when conducting multilevel modeling; where 
“trim” means to remove predictors that are not significant (e.g., Singer and Willett 
2003). However, for consistency, we kept the same controls across models and only 
“trimmed” the non-significant interaction terms. 


13. There are many possible reasons that our findings might differ from those of previous 
studies, including: (1. differences in the unit of analysis (countries vs. states); (2. 
single country vs. cross-national analysis; (3. advanced vs. developing countries; (4. 
longitudinal vs. cross-sectional designs; (5. different time periods; (6. measurement of 
independent and dependent variables; (7. different model specifications; (8. varying 
statistical techniques; and (9. emphasis on short-run vs. long-run effects. A thorough 
discussion of these differences is beyond the scope of this research. Because of these 
considerations, we are cautious about extrapolating our findings to other countries 
during other historical time periods. Military Keynesian theory was devised and is 
best-suited to address developments in causes and consequences of military spending 
in post-World War II United States; whether this logic extends to other advanced 
countries (possible) or developing countries (unlikely) is an empirical question. 


14. Despite the sophistication of the F-22, its critics claim that it is a Cold War relic and is 
not strategically necessary in today’s military configuration. In July, 2009 both the U.S. 
House of Representatives and the Senate voted to cut funding for the F-22 from their 
defense authorization bills. At this writing, the two bills are in conference committee. 
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Appendix A. The Allocation of Military Spending to the U.S. States 


The claim in this article that political incumbents use defense contract spending as 
a countercyclical fiscal tool to forestall economic stagnation does not require that 
they do so with pinpoint accuracy as to where these funds are allocated. The process 
takes place within a well-established institutional structure that severely constrains 
the geographical allocation of resources, but also allows some institutional autonomy. 

The U.S. Congress makes budget allocations for annual defense expenditures 
more or less in line with a budget proposal by the President and the Executive 
Branch. Prior to these allocations, many different interest groups, including 
Department of Defense officials, military leaders, private defense contractors, 
policy experts, as well as national and state political incumbents, weigh in both 
publicly and privately with their defense funding priorities. A complicated bid- 
ding process that tends to favor a few elite private contractors usually precedes 
the development of major defense initiatives and continues over the life of the 
project. Larger projects typically involve multiple defense contractors, even fierce 
rivals in the industry, as production partners, and segments of the production and 
design process are subcontracted to other more specialized companies. Governors 
and state legislatures have a less visible, but still critical, role in maintaining an 
economic climate that makes their states attractive for locating defense contrac- 
tors in their state. In effect, there is a “war between the states” in which state 
governments provide competitive economic incentives, tax breaks and other con- 
siderations for business to locate in their states. Ultimately, the exact allocation of 
defense funds represents the ongoing confluence of interests and struggles among 
a complex network of defense contractors and subcontractors; federal, state and 
local governmental actors; and the defense establishment, i.e., a variegated form 
of Mills’ (1956) power elite. 

Consider the F-22 Raptor fighter jet which is a new-generation advanced tactical 
fighter with supersonic speed, stealth capability and other features that cannot be 
matched by any known or projected fighter aircraft. The current production contract 
for the F-22 Raptor is owned by the Bethesda, Maryland-based Lockheed Martin 
Corporation. Lockheed Martin’s main production facility for the F-22 is located in 
Marietta, Georgia and key subassembly plants are located in Florida, Mississippi, 
Pennsylvania and West Virginia. Program partner Boeing, with production facilities 
in several states, provides the wings, aft fuselage, avionics integration, and all of the 
pilot and maintenance training systems. The engine is produced in Connecticut by 
Pratt & Whitney, part of defense conglomerate United Technologies. All told, the 
design and production of the F-22 Raptor involves contributions from more than 
1,000 suppliers in 44 states and generates about 95,000 jobs, many of them for 
workers in unions like the International Association of Machinists and Aerospace 
Workers. This made the F-22 politically unassailable for almost three decades and 
cost the taxpayer about $67 billion to maintain." 
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The role of state and local governments in attracting defense contractors and 
the benefits they derive from such relationships can be illustrated by a single pro- 
posal which was intended to expand a small Lockheed Martin facility in Oswego, 
New York from 80 employees to 255 employees (Tioga County Industrial 
Development Agency 2004). This 2004 bid involves a partnership among the 
Tioga County Department of Economic Development and Planning, the Tioga 
County Economic Development Agency, County of Tioga, Town of Oswego, 
Empire State Development, and Lockheed Martin. The proposal responded to a 
request by Lockheed Martin to the town of Oswego for “economic development 
assistance” and would be contingent on the Defense Department's final approval. 
The project would involve a $29.1 million investment from Lockheed Martin 
for construction of a new building and would be subsidized by $2.6 million in 
tax savings on construction of the facility and an additional $19.7 million in real 
property tax savings over a 20-year period. 

The new facility would add 175 jobs and $13.3 million in income for Lockheed 
Martin employees in Tioga County and another 1,185 jobs and $47.6 million for 
other employees in the county based on multiplier effects. The new facility would 
also increase property values (and thus property taxes) for other commercial prop- 
erties in the town. The proposal further notes that the presence of the Lockheed 
Martin Oswego site could make Oswego a “state-of-the-art center of excellence” 
and lead to future high technology employment in the area. The proposal also 
warns that without the proposed tax incentives, the Lockheed Martin Oswego 
facility is likely to relocate out of state. 
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ls the United States Experiencing a “Matrilineal Tilt?”: Gender, 
Family Structures and Financial Transfers to Adult Children 


Shelley Clark, McGill University 
Catherine Kenney, Bowling Green State University 


Furstenberg et al. (1995) suggested that one unanticipated consequence of current 
high levels of divorce might be a “matrilineal tilt” in intergenerational wealth flows. 
This research uses six waves of the Health and Retirement Survey (1992 to 2002) to 
investigate this possibility with respect to financial transfers from parents to their adult 
children. We find that although divorced single fathers continue to make transfers to 
their adult biological children, remarriage substantially reduces fathers’ transfers while 
it increases mothers’ transfers to their biological children. Our findings are consistent 
with both socio-evolutionary and exchange theories predicting women’s vs. men’s 
investments in biological vs. stepchildren. 


Introduction 


Since the mid-20" century, American society has witnessed remarkable changes 

that could transform the way financial resources are transferred from parents to 

their adult children. Among the most notable trends are: (1. a prolonged financial 

dependence of adult children on their parents, (2. a dramatic rise in the rate of 
divorce and remarriage, and (3. a steady increase in women’s labor force participa- 
tion and control over assets and wealth. Much recent scholarly and media attention 

has been given to what is sometimes dubbed the emerging stage of “adultolescence.” 
It is becoming increasingly common for children older than 18 to remain at home, 
supported at least partially by their parents, particularly while they are completing 
schooling (Fussell and Furstenberg 2005). After leaving home, children often ex- 
pect and receive considerable financial assistance from their parents well into their 
20s and 30s to help pay for college and other educational or vocational training, 
weddings, and new houses. Parents may also be called on to help out during diffi- 
cult circumstances, such as job loss, divorce or illness (Aquilino 2005; Goldscheider 
et al. 2001; Ploeg et al. 2004). At the same time, the “divorce revolution” that 
occurred between the mid-1960s and the late-1970s, when divorce rates more 

than doubled, means that many of the biological parents of today’s adult children 

are no longer married to each other and have acquired new spouses (Furstenberg 
1994). The introduction of stepparents, who may or may not have previous bio- 
logical children of their own, undoubtedly complicates financial transfers to adult 


The authors wish to thank Céline LeBourdais and the anonymous Social Forces reviewers for their 
valuable comments as well as Michelle Pannor Silver and Daphne Lofquist for their helpful research 
assistance. Direct correspondence to Shelley Clark, McGill University, Department of Sociology, 
Stephen Leacock Building, Room 713, 855 Sherbrooke St. West, Montreal, Quebec, H3A 277. 
E-mail: shelley.clark@megill.ca. 


© The University of North Carolina Press Social Forces 88(4) 1753-1776, June 2010 


1754 © Social Forces 88(4) 


children. Who gets what assistance and who makes these decisions can be a touchy 
issue, especially when norms about step-parent and step-child relationships are not 
clearly defined (Cherlin and Furstenberg 1994; Cherlin 1978). Added to this mix 
is women’s increasing control over individual and household resources, through 
both their increased participation in the labor force and their rising levels of wealth 
(Raley et al. 2006). The confluence of these three forces has important implications 
for intergenerational wealth flows that have yet to be fully understood, but which 
are likely to differ radically from the past. 

Historically, financial transfers from parents to adult children occurred primar- 
ily through inheritance. Before 1840, when the practice of coverture under com- 
mon law dictated that all property women brought into marriage or subsequently 
acquired was legally owned by their husbands, women were generally unable 
to transfer wealth to their children either while they lived or upon their deaths 
(Gundersen 1998). By contrast, when a married man died, two-thirds of his estate 
transferred directly to his children, while a third was reserved for the support of his 
widow (Seed 1995). Consequently, the bulk of wealth flowed directly from fathers 
to their children, and mothers were viewed more as “competitors” for wealth 
transfers than as sources of financial assistance to their adult children (Shammas 
1987). Changes in women’s rights to control and manage property and to hold 
jobs outside the home have dramatically shifted mothers’ financial situation vis- 
a-vis their children. Equally important today, divorce rivals spousal death as the 
most likely cause of marital dissolution (Cherlin 1992). 

Such changes have important implications for how wealth flows from one 
generation to the next. Divorced mothers enjoy far greater contact with, and 
support from, their adult biological children than do divorced fathers, according 
to a large and growing body of literature (see Kalmijn 2007 for a thorough sum- 
mary). A much smaller body of research assesses whether these stronger maternal 
intergenerational ties translate into greater financial transfers from mothers to 
their biological adult children following divorce and remarriage. The potential 
presence of such effects has prompted some researchers to suggest that the United 
States may be experiencing a “matrilineal tilt” in its kinship system (Furstenberg 
et al. 1995). This research investigates the possibility by examining the effects of 
divorce and remarriage on financial transfers to adult children from their biologi- 
cal parents’ households. We pay particular attention to gender differences in the ef- 
fects of divorce and remarriage on parental financial transfers. Specifically, we ask: 
Do single divorced mothers give less than single divorced fathers, after controlling 
for differences in their financial resources? Does remarriage provide women with 
new opportunities to divert household resources toward their biological children, 
while simultaneously hindering men’s ability or willingness to financially support 
their adult biological children? Controlling for available parental resources and 
for characteristics of adult children, are divorced fathers more likely to give cash 
transfers to their biological children if they are single than if they remarry? Does 
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the same relationship hold for mothers? To address these questions we use data 
from six waves (1992 through 2002) of the Health and Retirement Survey. The 
use of this large, longitudinal data set allows us to employ several different analytic 
strategies to tease out potential reporting and selection biases as well as to mitigate 
the effects of unobserved fixed characteristics of the parents or children. 


Theoretical Perspectives and Previous Empirical Findings 


Although much of the recent sociological research on transfers between genera- 
tions leaves its guiding theory unstated, such research is most often implicitly 
influenced by some form of socio-evolutionary theory, intergenerational exchange 
theory or both. Here, we briefly review the predictions of evolutionary and ex- 
change theories for the association between parental family structure and intergen- 
erational transfers and discuss the findings of previous empirical research. 


Socio-Evolutionary Theories of Parental Investments 


Socio-evolutionary theories predict differences in how much parents invest in 
children by their degree of genetic relatedness and by the sex of the parents. 
Parental investment is generally defined as a contribution toward a particular 
offspring’s survival that entails some cost to the parent (in terms of resources, 
time, energy and the parent’s ability to make investments in other offspring) 
(Daly and Wilson 1987; Nielsen 1994). “Hamilton’s Rule,” which states that 
individuals will exhibit sympathy toward and favor others in proportion to the 
degree to which they are genetically related, suggests that parents will invest in 
their biological children but not in their stepchildren (Bergstrom 1996). ‘This 
principle is consistent with men’s supposed reluctance to “raise other men’s chil- 
dren’ and related insecurities about the paternity of the children they are raising 
(Nielsen 1994). Given that women, as the bearers of children, are certain of their 
genetic relationship to the children they raise, less has been written about their 
relative willingness to “raise other women’s children.” Yet, gender differences 
in reproductive strategies suggest that women may have stronger incentives to 
follow Hamilton’s Rule than do men. To maximize the number of surviving 
offspring, individuals may adopt either a parental investment strategy (directing 
resources toward existing offspring) or a mating strategy (attempting to produce 
more offspring). Because the costs of producing more offspring are lower for 
men than for women, men tend to favor the mating strategy, while women tend 
to favor parental investment strategy (Trivers 1972). The key insight provided 
by Anderson, Kaplan and Lancaster (1999b) is that these two strategies are not 
always in opposition to one another. Instead, they argue that, as part of their 
mating strategy, men may be willing to provide resources to a woman's previous 
children (to whom they are not genetically related) in order to improve their 
chances of securing her as a mate and having children with her (Anderson et al. 
2001). This arrangement also complements women’s preferred strategy to invest 
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in existing biological children. From a theoretical perspective, we might expect 
that, on average, men may be relatively more willing than women to support 
non-biological children. 

Empirical studies suggest that evolutionary theories provide substantial ex- 
planatory power. In their study of parental investment in young children, Biblarz 
and Raftery (1999) find that the evolutionary theoretical framework offers a 
more consistent explanation of their empirical findings than the other six theories _ 
they consider. However, they argue that any stepparent will act as competitor for 
resources and therefore diminish parental investments, whereas Anderson and his 
colleagues (1999b) contend that the addition of a stepfather may increase total 
investments in a woman’s previous biological children. Anderson and colleagues 
(1999a, 1999b) found that men invest equally in their co-resident stepchildren 
(mating effort) and in their non-co-resident biological children (parenting effort). 
Perhaps because relatively few biological mothers live apart from their children, 
there are no comparable studies involving mothers and stepmothers. However, 
Case, Lin and McLanahan (1999) found evidence of lower investments in chil- 
dren (in the form of spending on food) in households with a stepmother and 
biological father than in those with biological mothers, but there were no dif- 
ferences in spending on food for children living with their biological father Vs. 
their stepfather. Studies examining blended families in which both biological and 
stepchildren co-exist in the same household also tend to find that the biological 
children of the mother are favored relative to her stepchildren (i.e., the biological 
children of the father). Case, Lin and McLanahan (2001), for example, found 
that children raised by a stepmother received, on average, a year less schooling 
than did the woman's biological children. These results are consistent with evo- 
lutionary theory's prediction that women invest more in parenting effort relative 
to men, leading women to direct resources toward existing biological children 
over stepchildren, while men direct resources to both. Whether these differential 
investment strategies persist after the child leaves the household is difficult to 
determine. Zvoch (1999), for example, finds that parents of high school seniors 
plan to devote significantly fewer economic resources to the college educations of 
stepchildren vs. biological children, but they do not differentiate between men’s 
and women’s stepchildren. 


Exchange Theories of Intergenerational Transfers 


While socio-evolutionary theories focus on understanding reproductive strategies 
and, hence, parental investments in young children, theories of intergenerational 
exchange are often applied to parent-child relationships later in the life course. In 
their summary of the intergenerational exchange literature, Ganong and Coleman 
(2006) distinguish between intergenerational exchange theories based on reciproc- 
ity (i.e., parents invest in children because they expect those children to care for 
them in their old age) and those that are based on a looser sense of intergen- 
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erational solidarity, suggesting that transfers serve as both a symbol of closeness 
between family members and a means for binding the generations together. Levels 
of intergenerational exchange are most often measured in terms of provision of 
time, emotional support and financial transfers between parents and their adult 
children. These transfers can flow in both directions, but in general adult children 
tend to provide care and support to their older parents, while the overwhelming 
majority of financial transfers flow from parents to children, at least until their 
parents reach the age of 70 (Cooney and Uhlenberg 1992). 

‘The bulk of the literature focuses on the impact of divorce and remarriage on 
upward transfers of emotional support, social contact and care from adult children 
to their older parents. Both the reciprocity and the solidarity perspective would 
predict that divorce and remarriage would weaken upward transfers, particularly 
to fathers. Specifically, if the divorce occurred before the child left the home, there 
will be reduced contact with, and support from, non-custodial parents (primar- 
ily fathers) resulting in a diminished sense of obligation (reciprocity) as well as 
reduced closeness (solidarity). In his thorough review of prior studies of the effects 
of divorce on upward transfers from adult children, Kalmijn (2007) concluded 
that a// studies found that divorce reduced contact between fathers and children, 
but there was considerable variation across studies in whether divorce was associ- 
ated with reduced contact between mothers and their biological children (more 
recently, see also Lin 2008). In addition, Kalmijn finds that remarriage further 
weakens ties between parents and their adult children, with this effect being stron- 
ger for fathers than mothers. Kalmijn interprets these findings through the lens 
of intergenerational solidarity and argues that women in their role as “kinkeepers” 
are primarily responsible for maintaining family ties. 

If women act as “kinkeepers,” do they also serve as “holders of the purse strings,” 
moderating when and to whom downward financial transfers are made? Although 
some evidence suggests that gift-giving may be a household spending domain over 
which women have more control than men (Woolley and Marshall 1994), overall, 
the empirical evidence addressing this question is rather thin and the theoretical 
implications of intergenerational exchange theory are ambiguous. On the one hand, 
theories of reciprocity might indicate that mothers are more likely to have already 

“paid their dues” and, thus, not be expected to provide their adult children with 
additional financial support. On the other hand, mothers may wish to secure their 
investments in their children through continued financial support, so that in years to 
come they will be assured of their support. The theory of intergenerational solidarity 
unambiguously suggests that to the extent that women are closer to their adult bio- 
logical children, they will also be more likely to provide them with financial transfers. 

A handful of articles have addressed the effects of divorce (and, to a lesser 
extent, remarriage) on financial transfers to adult children. Many of these articles 
either face substantial data limitations or fail to examine whether these effects 
differ for mothers vs. fathers. Studies generally agreed that divorced parents give 
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or are willing to give fewer financial transfers to their adult children than parents 
who remain married to each other, although at least part of this difference may be 
attributable to lower levels of wealth following divorce (Cooney and Uhlenberg 
1992; Eggebeen 1992; Furstenberg et al. 1995; Marks 1995; Pezzin and Schone 
1999; White 1992). Unfortunately, many of these studies rely on data from the 
National Survey of Families and Households, which asks about transfers to any 
child and does not indicate whether or not this child is genetically related to the 
parent (Cooney and Uhlenberg 1992; Eggebeen 1992; White 1992). The one 
study that draws on data from the Health and Retirement Survey, which does ask 
about transfers to specific children, analyzes transfers at the household level rather 
than the adult child level, limiting its ability to make direct comparisons between 
step- and biological children (Killian 2004). Other studies examine parents’ at- 
titudes toward giving financial support or children’s perceptions of their parents as 
a potential source of help rather than actual transfers (Aquilino 2005; Cooney and 
Uhlenberg 1992; Marks 1995). Using data from AHEAD, Pezzin and Schone 
(1999) found that divorce reduced cash transfers from single elderly parents (over 
the age of 70) to their adult children, particularly for fathers. They also found that 
having ever been remarried further reduced cash transfers, but given their sample 
of single parents, they could not examine whether being currently remarriéd 
affected transfers. Using supplementary data from the 1988 PSID, Furstenberg, 
Hoffman and Shresha (1995) offer the most thorough investigation of the effects 
of divorce by gender. They find that about 22 percent of divorced mothers, but 
only 11percent of divorced fathers, gave their children financial assistance. They 
also found that remarriage reduced fathers’ propensity to give assistance more 
than mothers’, but that none of the effects of remarriage were statistically sig- 
nificant (Furstenberg et al. 1995). In comparison, in his household-level analysis 
of the HRS, Killian (2004) found that 31 percent of households with paternal 
stepchildren made transfers to their adult children compared to only 21 percent 
of households with maternal stepchildren made transfers. 


Other Predictors of Financial Transfers to Adult Children 


Beyond divorce and remarriage, several other characteristics of parents and adult 
children are likely to be associated with financial transfers. Perhaps most impor- 
tantly, parents’ ability to give financial assistance affects their actual transfers. 
Divorce tends to reduce household wealth, especially for women, while remarriage 
increases it. Race and ethnicity have also been found to be important, with blacks 
consistently engaging in fewer intergenerational transactions than whites (Hogan 
et al. 1993; Lee and Aytac 1998). Parents’ educational status, age and overall 
health may affect whether and how much they give to their adult children. Parents 
with more children and stepchildren tend to give less to each child (Killian 2004). 
The adult child’s characteristics and needs also influence whether they receive 


financial help from their parents (Hao 1996). Parental financial support typically 
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declines with the age of the child (Cooney and Uhlenberg 1992), while whether 
sons or daughters receive more help depends on the type of help (Cooney and 
Uhlenberg 1992). Finally, economic need appears to be important, but current 
marital status of the child is not highly predictive of transfers (Eggebeen 1992). 


Data and Methods 
Sample 


This study uses longitudinal data from six waves of the Health and Retirement 
Survey's main cohort from 1992 to 2002. The HRS main cohort is drawn from 
a stratified random sample designed to be representative of households with at 
least one member born in 1931-1941 (approximately 51 to 61 years old in 1992 
and approximately 61 to 71 years old in 2002). The survey oversampled blacks, 
Hispanics and Florida residents. Extensive information was gathered on the 
individual and household characteristics of the main HRS cohort respondents, 
including income and assets, health, and marital status and histories. Because we 
are interested in the effects of divorce and remarriage on transfers to adult chil- 
dren, all never-married parents in the HRS were dropped from our sample. The 
survey includes detailed information on intergenerational transfers. Particularly 
important for this study is the fact that, unlike the NSFH, the HRS asks specifi- 
cally about transfers to each adult child, jncluding step- and biological children 
of the family respondent in the parental household.' Thus, we are able to identify 
transfers to specific adult children and determine their relationship to the adults 
in the parental households. In addition, the HRS collected information about 
each child’s own socio-demographic characteristics.Although the information in 
the HRS was collected from parents, our analysis is conducted at the level of adult 
child (over the age of 18). We pool data from the six survey years, resulting in a 
sample of 80,007 observations on 22,517 adult children associated with 5,295 
parental households. Thus, our unit of analysis is adult children survey years. In 
addition to attrition through the loss of HRS respondents across survey waves 
(through death or inability of HRS to contact the household), our focus on the 
adult children complicates who is included in different years. For example, be- 
cause we are interested in transfers to adult children, we include only children over 
age 18 in our sample. Thus, a child who was under 18 in 1992 but reached age 18 
some time before 2002 will first appear in our sample in a later wave. ‘The largest 
number of adult children, representing 21 percent of the sample, is represented in 
all six survey years, while 17 percent are represented in only one year. 


Analytic Strategy 


In our basic model, we draw on this pooled cross-sectional sample to examine how 
the adult child’s relationship to the parental household is related to the probability 


that he or she was given a financial transfer of over $500. In our initial logistic 
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regression, we include dummy variables for each survey year. In all our models, 
we also use robust standard errors, which are clustered at the parental household 
level because there are multiple observations on individuals across the waves and 
potentially more than one adult child per parental household.” We then add 
controls for parental household characteristics and adult-child characteristics to 
examine how the inclusion of these variables modifies the effect of parent-adult 
child relationship on financial transfers. 

Despite the inclusion of these control variables, however, our results remain 
vulnerable to biases from both the selectivity into different parental households 
as well as the study design. We therefore conduct additional analyses to explore 
the extent of three potential sources of bias. First, divorce and remarriage reflect 
a highly selective process and thus, for example, women with only their own 
biological children may differ in unmeasured or even unobservable ways from 
women who have stepchildren. In our second set of analyses, we rely only on data 
from “blended families” to minimize these selection effects. We define blended 
families as households containing both a biological child of the mother as well 
as a stepchild of the mother, and we use reports given solely by the mother.’ The 
biological child of the mother could be from either a previous marriage or her 
current marriage. Because the same woman is thus reporting transfers to different 
kinds of children within her household, this analysis avoids bias due to unobserved 
characteristics of stepmothers. In our pooled sample of blended families, we are 
able to identify 5,131 adult children in blended families, resulting in a total 
sample of 13,377 observations of these children across all six waves. 

‘The second potential source of bias stems from how the HRS was administered. 
Specifically, within all couple households, in an effort to minimize the burden 
imposed on any one respondent, HRS designated one adult in the couple as the 
“family respondent” (who reported all transfers to and from parents and children) 
and the other adult in the couple as the “financial respondent” (who reported 
income, assets, etc.). Based on a belief that women are likely to be more informed 
than men about family relationships, the HRS (particularly prior to 1998) gave 
preference to the female in a couple as the family respondent. As a result, the large 
majority of HRS households with remarried parents of adult children are based 
on women’s reports about financial transfers to her adult biological children (his 
stepchildren) and her stepchildren (his biological children). This preference for 
female family respondents — and the resulting over-representation of stepmoth- 
ers — as reporters of financial transfers could generate two potential sources of 
bias. First, stepparents may be generally less knowledgeable about transfers to 
their stepchildren and under report these transfers. This bias could be quite large 
for relatively small financial transfers, but we expect it would diminish as the size 
of the transfer increases. A second form of bias may arise from selectivity among 
remarried fathers who are selected as the family respondent. The relatively small 
percentage of remarried men who act as the family respondents (only .6% of our 
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sample) are likely to represent households in which the adult female was unavail- 
able, unable, or unwilling to be interviewed. These may also be households in 
which men may have particularly close relationships with their children. 

A third potential source of bias is with respect to divorced, single fathers in 
the HRS. About a fifth of non-co-resident fathers do not remain in any kind of 
contact with their children (de Graaf and Fokkema 2007), and such men may 
not even report those children when surveyed (Cherlin et al. 1983). As a result, 
those single divorced fathers in the HRS who report having children are likely 
to be a select group, more likely to be engaged in exchanges with those children. 
To partially address both the second and third potential source of bias, we create 
a third sample which is limited only to respondents’ reports of their biological 
children. We then run fixed-effects models on this sample. Removing stepparent 
reports reduces our total number of observations across all waves from 80,007 
to 69,791. This analysis avoids potential problems with stepparents’ knowledge 
about transfers as well as addressing concerns about potential unobserved fixed 
characteristics of fathers or mothers in our sample. 


Measures 


Dependent Variable 

Transfer to Child. Table 1 provides descriptive statistics on the variables included 
in our models.* The dependent variable is a dichotomous variable based on a 
question in the HRS that asked the family respondent whether he or she or his 
or her spouse or partner® gave financial help to any of their children totaling 
$500 or more during the intervening two years since the prior HRS interview.° 
If the answer was “yes,” the respondent was asked to specify which of the chil- 
dren associated with the household received such transfers. Our dependent 
variable equals one if the adult child received a transfer and equals zero if he or 
she did not. Financial help specifically includes giving money, paying bills, or 
covering other expenses relating to medical care, schooling, insurance, down 
payment on houses, and rent. Approximately a third of transfers are for tuition 
or schooling. In the pooled analysis sample, 18.1 percent of adult children had 
received a transfer in the preceding interval, which is roughly comparable to the 
16 percent reported to have received transfers over $200 in the NSFH collected 
in 1988 (Hogan, Eggebeen and Clogg 1993). 


Independent Variables 
Parent-Child Relationship Type 


The key independent variable in our analyses indicates the adult child’s relation- 
ship to the parental household. This categorical variable is defined as follows: with 
respect to the adult child the parental household consists of (1. two biological 
parents (reference category), (2. biological mother and stepfather, (3. biological 
father and stepmother, (4. single biological mother only, and (5. single biological 


1762 © Social Forces 88(4) 


Table 1: Descriptive Statistics on Financial Transfers and Parent and 


Adult-Child Characteristics 
Blended Biological 
FullSample _- Families | Respondents 
N=80,007a N=13,377b N=69,791c 
Dependent Variables 
Adult child received help > $500 % 18.1 14.7 19.1 
Independent Variables 
Parent-Child Relationship (%) 


Two biological parents 63.9 48.2 730 
Biological mother with stepfather 41 7.0 43 
Biological father with stepmother 13.0 44.8 0.7 
Biological single mother 15h na 15.8 
Biological single father oa na 6.0 
Parent's Characteristics 
Age 57.8 55.6 58.3 
Race % 
White 78.3 77.2 78.1 
Black 17.7 19.4 17.8 
Other 4.0 3.4 4.1 
Hispanic % 11.5 9.1 tee 
Education % 
Less than high school 320 30.7 33.6 
High school diploma 37.4 36.9 37.5 
Some college 18.1 21.0 17.4 
College and above 12.0 11.3 11.5 
Total housing assets ($) 73,439 65,091 74,179 
Total financial assets ($) 62,563 61,931 61,601 
Total income ($) 48,342 53,744 46,696 
Sibling Composition of Adult Child 
Number of biological siblings 3.0 2.4 3.0 
Number of step-siblings 2 2 4 
Number of half-siblings A 25 an 
Any sibling living at home % 35.6 36.6 36.6 
Adult Child's Characteristics 
Child is female % 49.2 49.0 49.2 
Child's age 33.1 Bo.0 32.9 
Child is married % Sort. 92.9 99.8 


*Number of individual adult children = 22,517 
’Number of individual adult children = 5,131 
*Number of individual adult children = 19,536 


father only. In the analysis of blended families, these indicators are limited to the 
first three categories. The distribution of adult children by their relationship to 
their parents in our three samples is presented in Table 1. 


Parental Household Characteristics 


Prior evidence suggests that a variety of socio-demographic and economic char- 
acteristics of parental households influences transfers to children. We use the 
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race variable available in the HRS, which distinguishes among three categories: 
white (78%); black (18%) and other (4%). In addition, we include an indicator 
that is equal to one if the family respondent is Hispanic (12%) and equal to zero 

otherwise. We also include a continuous measure of the respondent's age and a 

categorical education variable, which is coded as (1. less than high school, (2. a 
high school degree, (3. some college, and (4. a college degree or more. To take 

into account the older parent household’s ability to give, we include measures of 
the natural log of the value of the household’s housing assets, the household’s non- 
housing financial assets, and the household’s annual income. In addition, because 

parental household resources may be distributed across all children affiliated with 

each household, we include measures of the number of biological siblings, step- 
siblings and half-siblings with respect to each adult child. Finally, because having 
siblings still residing at home may also influence whether transfers are made to 

non-residential adult children, all our regression models contain a dummy variable 

indicating the presence of any siblings living in their parents’ home. A third of our 
sample has one or more siblings residing with their parents. 

Not surprisingly, parental household characteristics of blended families differ 
from our full-pooled cross-sectional sample. On average, blended families have 
lower levels of housing and financial assets, but higher current income. Mothers 
in blended families are also younger. 


Adult Child Characteristics 

Because the adult child’s own characteristics may also influence transfers to him 
or her, we include the following measures: an indicator equal to one if the adult 
child is female, zero otherwise; an indicator equal to one if the child is married, 
zero otherwise; and a continuous variable measuring the child’s age. Unfortunately, 
information on adult children’s wealth and income status is poorly reported and, 
thus, is not included in our models. 


Results 


Table 2 reports the odds ratios from logistic regressions of transfers to adult children 
on the parent-child relationship alone (Model 1), the parent-child relationship and 
the parent’s characteristics (Model 2), and the Model 2 variables plus the adult child’s 
characteristics (Model 3). All models include indicators for each survey year. The first 
model confirms our expectation that, in general, transfers from biological mothers’ 
and fathers’ households are lower following divorce and/or remarriage than from 
two biological parents who remain married to one another. Indeed, in the case of 
a remarried biological father (i.e., a parental household with a biological father and 
stepmother), the odds that the adult child receives a transfer are reduced by half. All 
of the reductions in the odds of transfers are significantly different relative to married 
biological parents, except for single divorced fathers. This result likely reflects the 
selectivity of single divorced fathers who maintain relationships with adult children. 
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Table 2: Effects of Socio-Demographic Characteristics on Odds of Adult Children 


Receiving Financial Support 
Model 1 Model 2 Model 3 


Odds Ratio Odds Ratio Odds Ratio 
Parent-Child Relationship (two biological = reference) 


Biological mother with stepfather 1978 Toe os 
Biological father with stepmother Mors 36° SOT 
Biological single mother 613 Oe Seta 16*** 
Biological single father 89 1.08 85+ 
Parent's characteristics 
Parent's age Soe 1.00 
Race (white = reference) 

Black .99 .95 
Other 1.12 1.05 
Hispanic (non-Hispanic = reference) a 68 

Education (less than high school = reference) 
High school diploma 1.54*** 140 
Some college 214° TGF <a 
College and above 3.44*** 24255 
Total housing assets (log) 1.01** 1.025 
Total financial assets (log) 1.04*** 1.0575 
Total income (log) remy Te 
Sibling Composition of Adult Child . 
Number of biological siblings Ea 18 
Number of step-siblings .96 .94* 
Number of half-siblings 84" .84*** 
Any sibling living at home (none = reference) too i a ae 
Adult Child's Characteristics 
Female (male = reference) 1406 
Child's age Q3-* 
Married (not married = reference) ‘Oe s 
Year Indicators (1992 = reference) 
1994 1.06* 13m 1.0659) 
1996 1:59=* Wie. 2.06"; 
1998 1.10* lon 1.64*** 
2000 1.00 1.48*** 1:66" 
2002 .88** 7.40" 162" 
Pseudo R? 01 11 15 


“difference between biological mother/stepfather and biological father/stepmother 
significant at p < .05 

"difference between biological mother/stepfather and biological single mothers 
significant at p < .05 

“difference between biological father/stepmother and biological single father 

significant at p < .05 

“difference between biological single mother and biological single father significant at p < .05 
+p'<n107 = pi<0om p10 ep <0) 
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While our findings are consistent with the literature on the impact of divorce, 
we are mainly interested in whether remarriage following divorce differentially 
affects transfers to adult children according to the sex of the biological parent. 
For biological fathers, Model 1 shows that, without any income or other controls, 
divorced fathers are only moderately less likely to make transfers to their adult 
children while they remain single. If divorced biological fathers are remarried, 
however, the odds of providing financial support to their adult children drops 
dramatically (from .89 to .49). For biological mothers, being single is associated 
with providing far less financial support to their adult children, but getting remar- 
ried actually increases the odds that they provide financial support (from .61 to 
.79). Thus, we find that divorced single fathers are more likely than divorced single 
mothers to make transfers to their adult biological children, most likely because 
men fare better financially than women after divorce, as well as because of the se- 
lectivity of the single fathers. However, remarried fathers are significantly less likely 
than remarried mothers to help financially support their biological adult children. 

To help take into account the ability of parental households to make transfers as 
well as the total number of children to whom the household might make transfers, 
Model 2 adds controls for the characteristics of the family respondent and parental 
household, including all children associated with the parental household. In the case 
of single divorced mothers — those whose financial circumstances are most likely to 
limit their ability to make transfers — there is no longer a significant difference in 
the odds of a transfer relative to remarried mothers. By contrast, the inclusion of 
variables controlling for the parental household's ability to pay actually reduces the 
odds that an adult child will receive a transfer from a biological father-stepmother 
household (from OR = .49 without controls to OR = .36 with controls), suggesting 
that these households give little relative to their ability to pay. In Model 3, which 
includes characteristics of the adult child as well as characteristics of the parental 
household, the basic pattern established in the bivariate analysis remains. Single di- 
vorced fathers, however, are now marginally significantly less likely to make transfers 
relative to households with two married biological parents (p < .10). In addition, 
key gender differences remain important: remarried biological fathers are less likely 
to give than those who remain single, and remarried biological mothers are signifi- 
cantly more likely than remarried biological fathers to help their biological children. 
Indeed, across all three models, biological father/ stepmother families stand out for 
being least likely to make contributions to adult children. 

For ease of interpretation, Figure 1 displays the probability that an adult child 
receives any transfer over $500 according to the parent-adult child relationship 
type in 1998. The first set of percentages refers to the probability of receiving a 
transfer without any additional controls (Model 1), while the second set of per- 
centages are calculated holding parental and adult-child characteristics constant at 
their mean or modal values (Model 3).” Our results, without any controls, reveal 


very different implications of remarriage for mothers and fathers. While remarried 
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mothers are more likely to give to their biological children than single divorced 
‘mothers (21% vs. 17%), remarried fathers are 10 percentage points Jess likely to 
‘make a transfer to their biological children than single divorced fathers (14% vs. 
24%). Holding parental wealth, number of other siblings and adult-child charac- 
teristics constant, we find that, on average, divorce is associated with a decrease in 
‘the probability of receiving any transfers from both single biological mothers (a 4 
percentage point drop) and single biological fathers (a 2 percentage point drop). 
However, controlling for other factors we find that divorced single mothers are 
now equally likely to make a transfer as remarried mothers. In comparison, while 
the gap between single divorced fathers and remarried fathers shrinks somewhat 
(from 10 percentage points to 7 percentage points), it remains large and significant. 
Moreover, this change is largely attributed to a reduction in the probability that 
single fathers provide transfers after accounting for the characteristics of the adult 
child such as their age and marital status. 
Model 3 also reveals important associations between several other explanatory 
-yariables and transfers to adult children. Like Furstenberg, Hoffman and Shrestha 
(1995), we find that once income and other family characteristics are taken into 
account, transfers are no less likely from black parental households than from 
white households, but the odds of transfers are significantly lower from Hispanic 
than from non-Hispanic households. After taking into account the age of the 
adult child, parental age is no longer associated with transfers. We note, however, 
that our sample consists of relatively young parents with few over the age of 70. 
Because previous research finds that parental transfers generally do not taper off 
until after 70 (Cooney and Uhlenberg 1992), it is likely that parental age would be 
negatively associated with transfers if we had an older sample of parents. Parents’ 
education and financial resources both play a major role: the odds of a transfer are 
almost three times higher from a college-educated parent than from a parent with 
less than a high school degree, and transfers increase with the parents’ housing 
wealth, non-housing financial wealth and prior year’s income. Contrary to previ- 
ous studies which find no difference by the sex of the adult child (Cooney and 
Uhlenberg 1992; Furstenberg et al. 1995), we find that women are significantly 
more likely than men to receive transfers. At any given age, women are less likely 
to live in their parent’s home than men. Consequently, parents may be more 
willing to provide some financial support to non-residential daughters to partially 
compensate them for living independently. Transfers also decrease with children’s 
age and are lower to married children than to unmarried children. Finally, adult 
siblings appear to compete with each other for resources from older parents with 
full biological siblings having the strongest negative effect. Parents who have one 
or more children currently residing at home appear to be somewhat more likely 
to provide transfers to their non-residential children. 
Does the sex of the family respondent or whether he or she is a biological or 
stepparent influence whether he or she reports transfers to adult children? Are 
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Table 3: Percentage of Adult Children Receiving Help by Parent-Child Relationship 
and Gender of Family Respondent 


Female Respondent Male Respondent Significance 


Two biological parents 20:1 (50,353) 24.2 (764) ‘ 
Biological mother & stepfather 16.6 (2,968) 21.6 (342) a 
Biological father & stepmother 11.1 (9,883) 18.1 (492) 


Single biological parent 12.5 (10,990) 19.1 (4,210) = 


Note: Numbers in bold indicate reported by stepparent 
Significance tests for gender differences within each family type. 
BOIS Ode m0) 201), wits 1 S600) 


stepmothers systematically different from other women in ways that can account 
for the differences in transfers reported in Table 2? Are there other unobserved 
individual characteristics of parents in different relationship types that are driving 
these findings? Taking these questions in order, Table 3 reports the percentage of 
adult children receiving help by relationship to parental household and sex of fam- 
ily respondent. ‘The key difference shown, within every category of family type, is 
that male respondents are more likely to report transfers than female respondents, 
in some cases by fairly large margins. Perhaps providing financial help to adult 
children is a household spending domain controlled or managed primarily by men, 
and women are less likely to be aware that children have received help. Alternatively, 
given the overall preference for female family respondents, the men who end up as 
family respondents in the HRS may be an unusually involved or generous group. 
Surprisingly, we find no evidence that stepparents (indicated in bold in Table )) 
systematically under report transfers. For example, in biological mother and step- 
father families, we find that stepfathers report a higher probability of giving adult 
children financial help than do biological mothers in these households. 

In Table 4, we limit our sample to blended families reported by female family 
respondents. These female family respondents are reporting on adult children who 
are their own biological child with their current partner (two biological parents), 
their own biological child but not that of their current partner (biological mother 
with stepfather), or their current partner's biological child (biological father with 
stepmother). Because these are blended families, all of these women are reporting 
on children in more than one of these categories. Thus, differences cannot be 
attributed to differences in the selection of the family respondent. Consistent 
with our full sample in Table 1, the odds of a transfer to an adult child who is 
biologically related to the father but not to the mother are less than half compared 
to an adult child who is biologically related to both parents (OR = .45). Being 
biologically related to the mother, but not the father, also diminishes the odds 
that an adult child receives a transfer relative to adult children biologically related 
to both parents, but the negative effect of having a stepfather is much less than 
that of a stepmother. Specifically, holding all other variables at their mean and 
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Table 4: Effects of Parent-Child Relationship and Other Socio-Demographic 
Characteristics on the Odds of Adult Children Receiving Any Financial Support 


(Blended Families 1992-2002) 


eS Sei OE anh wv Odds Ratio: 
Parent-Child Relationship (two biological = reference) 
Biological mother with stepfather FT 
Biological father with stepmother Ab 
Parent's Characteristics 
Parent's age 1.02+ 
Race (white = reference) 
Black Wee 
Other 84 
Hispanic (non-Hispanic = reference) .61* 
Education (less than high school = reference) 
High school diploma 1.45* 
Some college 1.874 
College and above Pocus 
Total housing assets (log) 1.01 
Total financial assets (log) 103% 
Total income (log) 1.06+ 
Sibling Composition of Adult Child 
Number of biological siblings ‘Bo. 8 
Number of step-siblings 1.03 
Number of half-siblings OTe 
Any sibling living at home (none = reference) 1.02 
Adult Child's Characteristics 
Female (male = reference) 1.05 
Child's age lag 
Married (not married = reference) shoe 
Year Indicators (1992 = reference) 
1994 1.40** 
1996 2G 
1998 2.00*** 
2000 5 Oe 
2002 ee 
area eeeenerremee 
N = 13,337 


difference between biological mother/stepfather and biological father/stepmother 
households significant at p < .05 
m<10 *p<05 *p<.01 “p< .001 


model values in this sample, we find that the probability of receiving a transfer is 
20.1percent for children who are biologically related to both parents, 15.1percent 
if the child is biologically related only to the mother and 10.2 percent if the child 
is biologically related only to the father. These results are particularly important 
because they show that it is not differences across stepmothers and biological 
mothers that account for the findings in Table 2. Instead, the same woman reports 
that significantly higher levels of transfers are directed toward her biological chil- 


dren than toward her stepchildren. 
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Table 5: Effects of Parent-Child Relationship and Other Socio-Démographic 
Characteristics on the Odds of Adult Children Receiving Any Financial Support 
(Biological Parent Respondents 1992-2002) 

Odds Ratio 


Parent-Child Relationship (two biological = reference) 
Biological mother with stepfather 


Biological father with stepmother on 

Biological single mother Do" 

Biological single father 89 
Parent's Characteristics 

Total housing assets (log) .99 

Total financial assets (log) 1.01* 

Total income (log) ws 
Sibling Composition of Adult Child 

Number of biological siblings oo 

Number of step-siblings OL 

Number of half-siblings or 

Any sibling living at home (none = reference) .99 
Adult Child's Characteristics 

Married (not married = reference) 03” 
LR Chi-2 162.2°"* 
N = 24,282° 


‘difference between biological mother/stepfather & biological father/stepmother 
significant at p < .05 

difference between biological mother/stepfather & biological single mothers 
significant at p < .05 

“difference between biological father/stepmother & biological single father 
significant at p < .05 

“difference between biological single mother & biological single father 
significant at p < .05 

°24,282 observations on 5,480 individuals 

+O<s10 nee = Uoe aD aOE 5 Ds.001 


In our last set of analyses presented in Table 5, we limit our sample to family 
respondents who are biologically related to the adult child to avoid potential 
biases in the reporting of transfers to stepchildren. Because we are now assured 
that the same family respondent is interviewed across the different waves, we 
can employ fixed-effects logistic regressions. Fixed-effects regression techniques 
can help minimize concerns of selectivity bias as a result of unobserved, constant 
characteristics. This technique also helps to redress the selectivity bias inherent in 
the selection of male family respondents. In this analysis, the overall pattern of 
the effect of divorce and remarriage on transfers to adult children is similar to that 
found in Model 3 of Table 2. In particular, we find that remarried mothers tend to 
give more transfers to their biological children than do remarried fathers. In addi- 
tion, remarried fathers give significantly less than single fathers to their biological 
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children, whereas after controlling for parents’ and children’s characteristics, single 
mothers give less — but not significantly less — than remarried mothers. The major 
difference is that neither remarried mothers nor single fathers are significantly 
less likely to give financial support to their biological children relative to parental 
households with two biological parents. 


Discussion and Conclusions 


We are all familiar with the Brothers Grimm and Disney versions of Cinderella 
which depict the archetype of a “wicked stepmother.” Yet it is interesting to note 
that in the 1817 opera of Cinderella by Gioacchino Rossini, it is a “cruel stepfa- 
ther” rather than a “wicked stepmother” who forbids Cinderella to attend the ball. 
There is consistent empirical evidence supporting both scenarios, and in general 
both mothers and fathers tend to favor their biological children over their stepchil- 
dren. Our findings indicate that the more commercially popular interpretation of 
this tale, however, is more common, but the full implications of these findings are 
important to consider. While it appears that women discriminate more strongly 
against their stepchildren, conversely, they also favor and support their biological 
children. Furthermore, while men are sometimes accused of “abandoning” their 
biological children from a previous relationship, they provide greater support for 
stepchildren. In the absence of any agreed-upon norms about appropriate levels 
of support and obligation between stepparents and stepchildren, it is difficult to 
assign labels “wicked” and “cruel” to their different behaviors. Instead, it appears 
as if both mothers and fathers are acting broadly in accordance with what would 
be expected from both socio-evolutionary and exchange theories. 

In particular, with respect to socio-evolutionary theory, we find that men’s be- 
haviors are consistent with a “mating strategy,” while women are more likely to 
follow a “parental investment strategy.” Our findings suggest that men’s “mating 
strategy” (i.e., supporting a woman's previous biological children in the hopes 
of having more children together with her) may create a set of expectations and 
behaviors that continue—even after the woman is no longer of reproductive age. 
Consistent with Andersen et al. (2001), we find that while stepfathers may to 
some extent compete for mothers’ attention and resources, compared to chil- 
dren with single divorced mothers, stepfathers appear to augment the amount 
of money mothers give to their biological children. Much less is known about 
stepmothers’ willingness to support the biological children of their husbands. It 
appears, however, that women's dominant parental investment strategy provides 
relatively little motivation for investing in stepchildren either at young or at 
older ages. Our results on blended families are highly complementary to the 
findings by Case et al. (2001) with respect to stepmothers’ treatment toward 
younger stepchildren living at home. Stepmothers, with both adult biological 
and stepchildren, report being only half as likely to give to their stepchildren 
as to their and their current husbands’ biological children. Even if stepmothers 
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were simply unaware of transfers to their stepchildren, it would nonetheless be a 
meaningful finding that they believe and willingly report such large inequalities. 

Considerable research, drawing on theories of exchange, finds that moth- 
ers’ stronger ties to their adult biological children facilitate far greater upward 
transmissions of emotional support and contact with mothers than with fathers 
(Kalmijn 2007). Building on these studies, we find that women may not only 
influence upward transfers, but also direct downward cash flows. As the period of 
“adultolescence” creeps upward and as expectations about financial support from 
older parents grow, the total magnitude of transfers, particularly from mothers, 
may increase over time. The extent to which mothers will be able to bear these 
increased financial demands and the strain such demands may place on their 
relationships with their adult children is beyond the scope of this article, but an 
important question to consider. 


/mplications 


If there has been a “matrilineal shift” in intergenerational transfers of money, (1. what 
is the magnitude of these shifts, (2. is this tilt likely to become steeper in the future, 
and (3. what are the implications for intergenerational transmission of inequality? . 

In general, we find that single divorced fathers are more likely to make transfers 
than single divorced mothers, while remarried fathers give much less than remar- 
tied mothers. To examine the overall magnitude of these differences on intergen- 
erational transfers, it is important to take into account the different distributions 
of men and women in different marital states. In particular, men are more likely 
to remarry than women and as a result there are fewer single fathers than single 
mothers. Using a sample of ever-married individuals with children from the 1992 
HRS cohort, we find that following a divorce 76.9 percent of fathers are currently 
remarried compared to 62.5 percent of mothers. 23.1 percent of ever-divorced 
fathers are currently single, while 37.5 percent of ever-divorced mothers are single. 
Combining these numbers with our estimated probabilities of giving a financial 
transfer (see Figure 1, Model 1), we can make a “back of the envelope calcula- 
tion” about the expected probabilities that money flows along fathers’ or mothers’ 
biological lines following divorce. We estimate that overall 21.4 percent of ever- 
divorced mothers gave a financial transfer to their biological children over the past 
two years, while only 16.4 percent of ever-divorced fathers made similar transfers. 
These estimates reflect the differences in the probability of transfers without adjust- 
ing for differences in mothers’ and fathers’ ability to provide assistance. 

Over the long term, it is unclear whether the “matrilineal tilt” is likely to get 
stronger or weaker as norms about stepparent and stepchild obligations and rela- 
tionships become more established. On the one hand, by taking a more active role 
in their young children’s lives, fathers may establish closer and stronger ties that 
last into their children’s adulthood, thereby potentially reducing the matrilineal 
tilt. On the other hand, over recent decades women have become more likely to 
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participate in the labor force and exert greater influence over household financial 
resources. Thus, we would expect that the financial resources available to the next 
cohort of single divorced mothers would be closer to those of single divorced 
fathers. In our analyses, when we hold the financial resources constant, we find 
that the gap between the probability that single fathers make transfers relative to 
single mothers falls from 7 percentage points to 2 percentage points. 

Finally, if the matrilineal tilt is a consequence of divorce and remarriage, then 
we would expect a stronger tilt among populations with relatively high rates of 
divorce and remarriage. In the United States, approximately 70 percent of mar- 
riages among blacks are expected to end in divorce within 30 years of marriage, 
compared to 47 percent of marriages among whites (Raley and Bumpass 2003). 
In addition, only 49 percent of non-Hispanic black women will remarry within 
10 years of divorcing, compared to 79 percent of non-Hispanic white women 
(Bramlett and Mosher 2002). More generally, among all racial and ethnic groups, 
divorce is higher at lower levels of socioeconomic status. Overall, children with 
divorced parents receive less money from both their biological mothers and fathers 
than children without divorced parents. Moreover, if their biological mothers do 
not remarry, they are unlikely to receive any compensating increase in the amount 
of support they receive. As a result, adult children in population subgroups ex- 
hibiting high divorce and low remarriage are likely to lose some financial support 
from their biological fathers without receiving a boost in support from a new 
stepfather. Thus, our current pattern of highly gendered intergenerational wealth 
flows may further exacerbate the transmission of intergenerational inequality. 

The full implications of these findings are difficult to estimate given some of our 
data limitations. Specifically, our data are limited to in vivo transfers of as little as 
$500. While such transfers are important, in order to fully measure the extent of 
the “matrilineal tilt,” future research on the magnitude of inheritance from divorced 
and remarried biological mothers and fathers would be particularly interesting. 
Moreover, while researchers have begun to pay more attention to divorced fathers, 
we are still lacking a representative picture of the relationships between divorced 
fathers and their adult biological children, rendering predictions about future trends 
difficult. Nonetheless, our findings draw attention to a relatively unnoticed, but 
potentially important, trend: the change in intergenerational wealth flows along 
biological gender lines. While additional studies are needed both to confirm our 
findings and to estimate the extent of these wealth flows through men and women, 
these initial findings are provocative and warrant further investigation. 


Notes 
1. The HRS does not distinguish between biological and adopted children. Therefore, 


some of the children we refer to as biological may have been adopted. 


2. Additional analyses (available at http://www.mcgill.ca/sociology/faculty/clark/) that 
restrict our sample to one adult child per parental household, thereby avoiding 
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potential biases associated with clustering within parental households, yield similar 
results with the exception that stepfather families are even (slightly) more likely to 
make transfers to their stepchildren. 


3. Similar analyses of male respondents reporting both biological and stepchildren were 
not conducted because the sample size was too small and potentially biased. 


4. Our sample consists of the adult children of HRS respondents, rather than respondents 
themselves. Weights are used in Table 2 to account for the over-representation of large 
families. 


5. The HRS asked about children of both spouses and partners residing in the household. 
Thus, cohabiting step-relationships are included. 


6. In 1992, the question asked whether parents had made any transfers in the past year. 
Consequently, the percentage of households who have transfers is lower in 1992 than 
in any other year. 


7. All estimates are for white, non-Hispanic parents aged 57.8 with high school diplomas 
earning average incomes and holding average levels of assets. Adult children’s characteristics 
are fixed to married males age 33.1, who have two non-residential full siblings. 
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The Skinny on Success: Body Mass, Gender and 
Occupational Standing Across the Life Course 


Christy M. Glass, Utah State University 
Steven A. Haas, Arizona State University 
Eric N. Reither, Utah State University 


Several studies have analyzed the impact of obesity on occupational standing. This study 
extends previous research by estimating the influence of body mass on occupational at- 
tainment over three decades of the career using data from the Wisconsin Longitudinal 
Study. In a series of covariance structure analyses, we considered three mechanisms that 
may alter the career trajectories of heavy individuals: (1. employment-based discrimina- 
tion, (2. educational attainment, and (3. marriage market processes. Unlike previous 
studies, we found limited evidence that employment-based discrimination impaired 
the career trajectories of either men or women. Instead, we found that heavy women 
received less post-secondary schooling than their thinner peers, which in turn adversely 
affected their occupational standing at each point in their careers. 





Introduction 


In recent years, social scientists have devoted much effort to documenting the 
effects of body mass on socioeconomic status and career outcomes (Averett and 
Korenman 1996; Baum and Ford 2004; Cawley 2004; Cawley, Grabka and Lillard 
2005; Conley and Glauber 2005; Gortmaker et al. 1993; Haskins and Ransford 
1999; Mitra 2001; Pagan and Davila 1997). Most of these studies have relied on 
cross-sectional data to examine the effects of elevated body mass on wages, rather 
than tracing the cumulative effects of weight on occupational trajectories over 
the life course. Furthermore, most cross-sectional studies lack a baseline measure 
of body mass that precedes career entry. As a result, many such investigations are 
unable to disentangle the precise nature of the relationship between body mass 
and labor market outcomes. 

Several longitudinal analyses that trace the effects of adolescent body mass on 
career outcomes rely on the National Longitudinal Survey of Youth (Averett and 
Korenman 1996; Baum and Ford 2004; Cawley 2004; Gortmaker et al. 1993; 
Register and Williams 1990). Unfortunately, NLSY data only permit analyses of 
individuals during the earliest career stages, tracing careers up to seven yeats fol- 
lowing career entry. One important exception is the recent analysis by Conley and 
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Glauber (2005) that uses the Panel Study of Income Dynamics to track the influence 
of obesity on career outcomes up to 15 years following career entry. Consistent with 
previous findings (Averett and Korenman 1996; Baum and Ford 2004; Cawley 
2004; Pagan and Davila 1997), Conley and Glauber find that elevated body mass 
is associated with reductions in women’s wages, family income and the probability 
of marriage. Although Conley and Glauber provide a useful extension of previous 
work, we argue that the first 15 years of an individual's career is an inadequate time 
frame in which to trace the effects of body mass on occupational trajectories. 


Body Mass & Occupational Standing across the Life Course 


Understanding the full impact of body mass on occupational standing requires a 
life course analysis that follows individuals from career entry through later career 
stages. We know a great deal about the effects of adolescent body mass on adult 
health and mortality (Bjorge et al. 2008; Must et al. 1992). By comparison, we 
know very little about how body mass affects career trajectories over the life course. 
Consistent with a life course perspective, we define careers as age variations in 
occupational status that individuals experience during their work lives (Brueckner 
2004; Sorensen 1974). A life course perspective permits a comprehensive view of 
how inequalities accumulate over time and a better understanding of the mech- 
anisms by which inequalities are produced and/or reduced as people progress 
through their careers. 

Why might individual career trajectories vary according to adolescent body 
mass and why might body mass impact those trajectories differently in the early-, 
mid- and late-career stages? At least three distinct mechanisms may influence 
career trajectories over the life course: (1. employment-based discrimination, (2. 
educational attainment, and (3. marriage market processes.' In the United States, 
body mass is strongly and inversely associated with socioeconomic status (Averett 
and Korenman 1996; Baum and Ford 2004; Cawley 2004; Cawley, Grabka and 
Lillard 2005), particularly among women (Register and Williams 1990). To ex- 
plain how elevated body mass might result in lower SES, recent scholarship has 
sought to document the labor market and marriage market penalties faced by 
heavy individuals and how these penalties vary considerably by gender. 


Employment-Based Discrimination 


Previous research suggests that heavy women tend to earn less than non-heavy 
women in both professional and blue collar occupations (Conley and Glauber 
2005; Mitra 2001; Pagan and Davila 1997; Register and Williams 1990). 
Compared to their non-heavy peers, heavy women also obtain less education 
(Gortmaker et al. 1993) and experience more exclusion from managerial positions 
(Haskings and Ransford 1999; Pagan and Davila 1997). More recent work shows 


that young white women face especially harsh penalties for elevated body mass 
(Cawley 2000, 2004; Conley and Glauber 2005). 
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These findings suggest that heavy women may face employer discrimination, 
particularly early in their careers. A large body of scholarship suggests that physical 
attractiveness is strongly associated with higher levels of career success, particu- 
larly among young women (Biddle and Hamermesh 1998; Bosman, Pfann and 
Hamermesh 2006; French 2002; Frieze et al. 1991; Hamermesh and Biddle 1994; 
Hatfield and Sprecher 1986; Umberson and Hughes 1987). Employers tend to 
attribute a variety of positive social characteristics to attractive female employees 
including cooperativeness, intelligence and competence (Feingold 1992; Jackson 
1992; Jackson et al. 1995). 

Research on the positive effects of physical attractiveness on occupational at- 
tainment has found that the so-called beauty myth? applies primarily to younger 
women due to gender-specific socio-cultural preferences and expectations of em- 
ployers as well as co-workers (Jackson 1992; Wolf 1991). Indeed, unlike attractive 
women, attractive men are only slightly (if at all) advantaged in terms of career 
attainment (French 2002; Frieze et al. 1991). 

Body mass is one of the primary determinants of evaluations of female at- 
tractiveness (Swami and Tovée 2005a; Tovée and Cornelissen 1999; Tovée et al. 
1999). Therefore, employers who reward physical attractiveness may be more likely 
to discriminate against heavy women due to inferences about ability. Research 
suggests that employers tend to evaluate larger women as less productive and 
desirable employees. Specifically, employers tend to see heavy women as lacking 
in self-discipline and professionalism and as having lower supervisory potential 
(Puhl and Brownell 2001). Additional studies suggest that employers attribute 
a variety of negative characteristics to heavier women including laziness, mental 
incompetence and emotional instability (see Puhl and Brownell 2001 for a review 
of this literature). While the current analysis does not directly measure employer 
discrimination, we expect to observe a residual direct effect of body mass on oc- 
cupational standing after controlling for other attainment-related factors. 

Most investigations of body mass and men’s labor market outcomes have 
concluded that body mass only minimally or even positively affects men’s labor 
market status. For example, some studies have concluded that body mass has 
little or no effect on men’s wages (Gortmaker et al. 1993; Pagan and Davila 1997; 
Register and Williams 1990; Sargent and Blanchflower 1994), while others have 
found a significant positive effect of body mass on men’s earnings (Morris 2006). 
Furthermore, recent studies have found that heavy men are actually more likely 
than non-heavy men to be in the labor force (Sousa 2005), and that body mass 
has little or no effect on men’s household earnings and poverty rates (Gortmaker et 
al. 1993). Indeed, body mass is a relatively weak predictor of assessments of men's 
physical attractiveness (Swami and Tovée 2005b), thus reducing the probability 
that body mass will have a negative impact on men’s career standing over the life 
course. Therefore, we predict the following: 
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Hypothesis 1a: Heavy women will experience lower 
occupational attainment than non-heavy women, 
particularly during the early career stages. 


Hypothesis 1b: Body mass will not directly affect occupational 
attainment among men at any point in the career. 


Educational Attainment 


In addition to discrimination, adolescent body mass may affect occupational 
attainment through educational attainment and human capital accumulation. 
Characteristics that affect academic and occupational achievement, such as socio- 
economic background, ability and aspirations, may also be associated with weight- 
related differences in adolescence. Thus, observed weight-related occupational 
differences may be due to prior causes. 

Conversely, net of these factors, overweight adolescents may face greater social 
stigma and isolation from peers and educators and, as a result, feel marginalized 
from educational institutions (Crosnoe and Muller 2004). In fact, some research 
suggests that the social stigma associated with high body mass is stronger dur- 
ing adolescence than at other times during the life course (Halpern et al. 1998; 
Quinn and Crocker 1999). In their study of body mass and academic achievement, 
Crosnoe and Muller (2004) found that in contexts where obesity is likely to lead 
to negative peer assessments, body mass is strongly and negatively associated 
with achievement. This finding suggests that the stigma associated with being 
overweight can affect academic functioning and, therefore, achievement over the 
life course. If overweight adolescents are less likely to excel in high school or to 
obtain post-secondary schooling, their subsequent occupational attainment might 
suffer. Presuming that elevated body mass reduces years of completed schooling, 
heavy individuals could actually see early career advantages associated with expe- 
rience gained via early labor market entry, while their thinner peers forgo labor 
market entry for additional schooling. However, over the long term, overweight 
adolescents are likely to face substantial occupational deficits as their more highly 
educated peers enter the labor market and ultimately surpass them. 

Evidence for the association between body mass and academic achievement is 
mixed (Chondola, Deary, Blane and Batty 2006). Analyses of the Early Childhood 
Longitudinal Study suggest little difference in academic performance between normal 
weight and overweight children (Datar, Sturm and Magnabosco 2004). However, 
other research suggests that overweight adolescents are less likely than their thinner 
peers to earn a college degree, completing approximately 1 less year of schooling, on 


average (Gortmaker et al. 1993). Given these findings we predict the following: 


Hypothesis 2a: Heavy adolescents will complete fewer years 
of formal schooling than non-heavy adolescents. 
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Hypothesis 2b: Heavy adolescents will have lower 
occupational attainment than non-heavy adolescents at 
each stage of the career, via educational disadvantages. 


Marriage Market Processes 


Scholars have documented ways that body mass affects the likelihood of success 
in the marriage market (Averett and Korenman 1999; Conley and Glauber 2005; 
Gortmaker et al. 1993). The impact of body mass on marriage market processes 
also varies by gender. While previous research has described the influence of body 
mass on marriage and labor market outcomes for men and women, it has tended 
to treat these processes as discrete. Most research in this area has sought to docu- 
ment the additive negative effects of marriage and labor market failures for heavy 
women by specifying differentials in individual earnings and household income 
between heavy and non-heavy individuals. Few studies have analyzed the mecha- 
nisms by which marriage market processes directly or indirectly affect occupa- 
tional standing. This is puzzling given the abundance of research that documents 
the wage penalties women pay as wives and mothers (Budig and England 2001; 
Hersch and Stratton 1997; Korenman and Neumark 1992; Lundberg and Rose 
2000; Neumark and Korenman 1994; Waldfogel 1997) as compared to the wage 
premiums men enjoy as husbands (Allegretto and Arthur 2001; Daniel 1995; 
Gray and Vanderhart 2001; Korenman and Neumark 1991; Loh 1996). Thus, we 
argue that marriage market processes are likely to affect men’s and women’s careers 
in important yet dissimilar ways. 

Previous research suggests that heavy women are less likely to marry, more 
likely to delay marriage and more likely to divorce than non-heavy women 
(Averett and Korenman 1996; Conley and Glauber 2005; Gortmaker et al. 
1993). Elevated body mass also reduces men’s likelihood of marriage, though 
the impact is less than for women. 

Most marriages are characterized by a gender-specific division of labor in which 
women perform a significantly greater proportion of unpaid household labor 
(Hersch and Stratton 1997, 2002; Presser 1994; South and Spitze 1993). Hersch 
and Stratton (1997, 2002) documented the direct negative effects of married 
women’s household labor on earnings. Conversely, men earn significant and robust 
earnings premiums as a result of the same division of labor (Allegretto and Arthur 
2001; Grossbard-Shechtman 2003; Loh 1996). Indeed, Waite and Gallagher 
(2001) argue that according to some estimates, marriage increases men’s earnings 
even more than a college education. Grossbard-Shechtman and Neuman (2003) 
further documented that the earning premium men enjoy grows over the life 
course, increasing every year for the duration of the marriage. 

These findings make clear that marriage market “success” —defined as getting 
married early and staying married —is detrimental to women’s but not men’s la- 
bor market attachment and occupational standing. Taniguchi (1999) found that 
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women who delay family formation pay significantly lower wage penalties than 
women who bear children earlier. Thus, rather than expecting heavy women to be 

doubly penalized, we might expect that “failures” in the marriage market may lead 
to greater successes in the labor market, particularly upon career entry. Because 

they are more likely to delay family formation, heavy women may experience a 

more continuous labor market attachment early in the career, and subsequently in- 
vest more time accumulating job or career-specific training or credentials. Because 

marriage market “failures” may advantage heavy women in the early stages of their 
careers, we predict the following: 


Hypothesis 3a: Heavy women will achieve greater 
occupational attainment than non-heavy women in their 
early career stages as a result of delays in family formation. 


We expect delayed family formation to have the opposite effect for men. If 
marriage (and childbearing) leads to earning premiums for men, then delayed 
family formation could hinder occupational attainment. By delaying or avoiding 
marriage, heavy men may tend to sacrifice the earnings premiums and potential 
promotion possibilities enjoyed by married men. Although body mass may riot 
affect men’s human capital investments or labor market attachment, lesser success 
in the marriage market could nevertheless have a negative influence on the career 
trajectories of heavy men. We predict the following: 


Hypothesis 3b: Heavy men will experience lower occupational 
attainment than non-heavy men in their early career stages 
as a result of delays in family formation. 


Data and Methods 
The Wisconsin Longitudinal Study 


We test our hypotheses using data from the Wisconsin Longitudinal Study. The 
WLS has unique advantages for testing these hypotheses because it includes a mea- 
sure of adolescent body mass prior to labor market entry, as well as occupational 
data covering over three decades of respondents’ careers. 

The WLS follows a random sample of 10,317 individuals who graduated from 
a public, private or parochial high school in Wisconsin in 1957 (Sewell et al. 
2004). In the initial wave, the WLS collected information on academic abil- 
ity, socioeconomic background, attitudes toward higher education, educational 
and occupational aspirations, and a number of contextual factors (Hauser 2005). 
Subsequent waves in 1964, 1975, 1993 and 2004 collected data on a wide range 
of issues essential to studies of the life course, including educational and oc- 
cupational histories, indicators of socioeconomic status, military service, marital 
status, family characteristics, social participation, psychological well-being, health 
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behaviors and health outcomes (Hauser 2005; Sewell et al. 2004). Although the 

WLS is not nationally representative, respondents resemble more than two-thirds 

of Americans who are now entering retirement age in terms of academic achieve- 
ment and ethnic background (Hauser 2005). 


Variables and Measurement 


Occupational Standing 

Because women tend to have higher levels of education than men employed in 
the same occupation but lower levels of earnings, analyses of gender differences 
in occupational stratification require indices of specific occupational features or 
direct measures of occupational characteristics (Hauser and Warren 1997; Warren, 
Sheridan and Hauser 1998). Occupational standing has traditionally been mea- 
sured using composite indices of occupational status (e.g., Duncan 1961) that are 
based on a combination of occupational characteristics. Recent work by Hauser 
and Warren (1997) and Warren et al. (1998), however, argues that such global 
measures have important limitations, particularly in analyses of gender differences 
in occupational attainment. Instead, they advocate operationalizing attainment 
in a way that captures gender differences in earnings and education separately. In 
empirical tests of these measures using both the 1994 General Social Survey and 
the 1986-1988 Survey of Income and Program Participation, Warren et al. (1998) 
conclude that composite indices are inadequate when measuring and comparing 
occupational status by gender. 

We estimate the level of occupational attainment in three ways. Occupational 
education is the proportion of incumbents in an occupation that completed one 
or more years of post-secondary education. Occupational income is the pro- 
portion of incumbents in an occupation that earned $25,000 or more per year. 
Occupational SES is measured by the Duncan Socioeconomic Index. We use 
these measures at three time points: respondents’ first job after the completion of 
schooling, and respondents’ current/last jobs in 1975 and in 1993. This analytic 
strategy allows us to trace the effects of gender and body mass, beginning at labor 
market entry and continuing over several decades of the work career.’ 


Main Predictor Variables 

The two predictor variables of primary interest are gender and adolescent body 
mass. Until very recently, the WLS was limited by the lack of a baseline measure 
of body mass, disallowing the kind of life course analysis pursued here. To account 
for this shortcoming, an 11-point scale was recently developed to code the high 
school senior yearbook photographs of 3,027 randomly selected WLS participants 
for relative body mass (Reither, Hauser and Swallen 2009). Using separate scales 
for boys and girls, a team of six coders assigned RBM scores to WLS participants 
at baseline in 1957. For each of the 3,027 yearbook photographs, each of the six 
coders recorded a RBM scale score ranging from | to 11. RBM scores were com- 
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bined to form the standardized relative body mass index. SRBMI was calculated 
separately for male and female photos by (1. generating coder-specific z scores, 
(2. summing z scores across coders, and (3. dividing the sum of z scores by the 
number of coders in the study. That is, for an individual WLS participant, 


SRBMI, = 2s ell a 
where / is an individual coder, z is the number of coders in the study, / is one of the 
3,027 WLS participants, & is the participant's gender and x,, is the series of RBM 
scale scores for coder i and participant j of gender &, with mean X,, and standard 
deviation s.,. The RBM scores were standardized in this way to eliminate differences 
among coders in the location (mean) and variance of the scores that they assigned. 

Analyses have demonstrated that the RBM scale is reliable (a = .91) and meets 
several criteria of validity as a measure of body mass— including the prediction of 
midlife BMI, obesity, health symptoms, chronic health conditions and premature 
mortality (Reither et al. 2009). Reliability of the RBM scale was unaffected by 
gender, as evidenced by equivalent alpha scores (a = .91 for male and female pho- 
tos). Furthermore, the RBM scale is comparable to traditional measures of body 
mass (e.g., BMI in adolescence) in terms of its ability to predict body mass and 
health events in mid-life. For example, the correlation between SRBMI (measured 
at ages 17-18) and BMI (measured at ages 53-54) is .31, similar to a British cohort 
study that found a correlation of .39 between measures of BMI taken at ages 13 
and 50 (Wright, Parker, Lamont and Craft 2001). Among a subsample of 224 
WLS respondents who achieved maximum BMI between the ages 16-30, the cor- 
relation between SRBMI and this more proximal measure of BMI is substantially 
stronger, .48 for men and .55 for women. 

For our analyses, we divide the WLS sample into four categories based on the 
SRBMI distribution. Underweight WLS participants have SRBMI scores that are 
one or more standard deviations below the mean (SRBMI < -1); low-normal weight 
participants have SRBMI scores between the mean and one standard deviation below 
the mean (-1 < SRBMI < 0) (referent group); high-normal weight participants have 
SRBMI scores between the mean and one standard deviation above the mean (O< 
SRBMI < 1); and overweight participants have SRBMI scores that are one or more 
standard deviations above the mean (SRBMI > 1). We use these categories to examine 
possible non-linear associations between SRBMI and occupational outcomes, and 
to isolate the thinnest and heaviest groups of WLS respondents while still preserving 
sufficient statistical power.* However, we recognize that these terms (particularly 
underweight and overweight) are heuristic devices, not precise clinical markers. 


Potential Mediators and Confounders 


Our analyses include social background variables that could confound the relation- 
ship between SRBMI and occupational attainment. These variables include mother’s 
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Table 1: Descriptive Statistics by Gender 


Men (N=1406) _ Women (N=1621) 
% Mean SD  % Mean SD 





Occupational education first job 30.2 31.4 32.4 27.5 
Occupational income first job Zou 231 8.6 10.7 
Duncan SEI first job 38.3 26.9 46.0 18.8 
Occupational education current/ last job 1975 36.6 29.7 34.9 27.4 
Occupational income current/ last job 1975 Sodeee.2 13:2915,6 
Duncan SEI current/ last job 1975 48.9 247 45.3 20.6 
Occupational education current/ last job 1993 38.8 29.4 31.7 26.9 
Occupational income current/ last job 1993 AOs) Meo. 18.7 19.8 
Duncan SEI current/ last job 1993 50.8 24.8 48.8 20.7 
Adolescent Standardized Relative Body Mass 0 8 0 8 

Underweight (SRBMIS -1) 9.5 gllitA 

Low Normal Weight (-1 < SRBMI < 0) (Referent) 43.8 38.9 

High Normal Weight (1 > SRBMI > 0) 33.4 36.7 

Overweight (SRBMI = 1) 13.4 12.8 
Father's education OT) 2.4 S65 o.0 
Mother's education 104° 529 102" 2.9 
Father's occupational education 19:2. 20:7 20.9 22.1 
Average family income 1957-1960 ($100s) 58.9 62.8 61.3 53.4 
Henmon-Nelson IQ 100 2% toa 100.2 14.6 
High school class rank 42.8 27.5 57.0 28.3 
Educational attainment 13.9 2.4 fo e0. 
Age at first marriage 24.0 43 21:9 223.9 
Age at first birth 25.6 43 2oM aout 


and father’s years of completed schooling, father’s occupational attainment (oc- 
cupational education), and a three-year average of family income from 1954-1956, 
drawn from state tax records. We include two measures of family formation (age at 
first marriage and age at first birth) and educational attainment of the respondent 
(years of completed schooling), as potential mediators of the relationship between 
SRBMI and occupational attainment. Finally, we include baseline measures of ado- 
lescent cognitive ability (Henmon-Nelson IQ) and academic performance (high 
school class rank) to control for the potentially common causal pathway linking 
ability/achievement to both body mass and occupational attainment. 


Analyses 

First, we estimate a baseline multigroup (by gender) covariance structure model 
of occupational attainment over the life course. This covariance structure model 
is estimated with full information maximum likelihood and is represented by the 
path diagram shown in figure 1. The model may also be expressed formally via 


the following equation: 

n=Bn+T6+s6 
where y is a vector of endogenous variables, B is a matrix of effects between 
endogenous variables, I is a matrix of effects between exogenous variables (i.e., §) 
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and endogenous variables, § is a vector of exogenous variables, 
and { is a vector of disturbances for endogenous variables 

(Jéreskog and Sérbom 1996). Measurement components of 
covariance structure models are also estimated via the follow- 
ing set of equations: 


Ve 
Ret 0, 


where y is a vector of endogenous observed variables, A is a 
matrix of the effects of endogenous latent variables (y) on en- 
dogenous observed variables (y), ¢ is a vector of measurement 
errors for y variables, x is a vector of exogenous observed 
variables, A, is a matrix of the effects of exogenous latent 
variables (£) on exogenous observed variables (x) and 6 is a 
vector of measurement errors for x variables (Joreskog and 
Sdrbom 1996). 

Our model of life course occupational attainment is 
based on that proposed by Hauser and Warren (1997). In 
this model we postulate that occupational education plays 
a central role in inter-and intra-generational stratification 
processes. Occupational SES at each stage in the life course 
is a function of the concurrent level of occupational educa- 
tion and occupational income, and occupational income is 
affected by the level of educational attainment within oc- 
cupations. Occupational attainment at each age is a function 
of prior occupational attainment, educational attainment, 
the timing of family formation, academic/cognitive ability, 
and adolescent SRBMI. We constrain father’s occupational 
attainment to act exclusively through initial occupational 
and educational attainment. We also postulate that family 
income in adolescence and parental education affect oc- 
cupational attainment only indirectly, through educational 
attainment. None of these restrictions significantly com- 
promise the fit of the model. Moreover, none of the paths 
we constrained to zero (e.g., family income > occupational 
education of first job) were statistically significant when left 
unconstrained. In addition to the direct effect of SRBMI 
on occupational attainment, we also estimate indirect ef- 
fects via educational attainment, family formation and 
recursive effects through prior occupational attainment for 
both men and women. This allows us to test our hypotheses 
concerning the impact of schooling and marriage markets 
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in mediating the impact of adolescent body mass. Again following Hauser and 
Warren (1997) we allow correlations between the disturbances of occupational 
SES across the life course and also between the disturbances in occupational 
income. Formally, this results in the following additional free parameters: Cov 
CL=(V,9)3 Cov 60 =(V5..,)3 Cov Co =(Vei)s Cov C0 =(V7)3 Cov Cbi= Va r0)s 
Cov 6 ,=(Vq 49). The model also allows for free inter-correlations among the 
exogenous variables. 

In the second step of the analysis, we formally test whether the gender dif- 
ferences we observe in the baseline model are statistically significant by imposing 
equality constraints on the effects of relative body mass on educational attainment, 
family formation, first occupational attainment, 1975 occupational attainment 
and 1993 occupational attainment. Finally, we present estimates from a final 
model incorporating those equality constraints that do not cause significant de- 
terioration in model fit. For each model, we use the MLR estimator in Mplus 5.1 
(Muthén and Muthén 2008). 


Results 


Maximum likelihood estimates from the baseline models of occupational at- 
tainment over the life course are presented for females and males in tables 2 
and 3, respectively. We find that relative to women in the low-normal weight 
category (-1 < SRBMI < 0), women who were underweight in adolescence 
(SRBMI < -1) held mid-career jobs in which 3.4 percent more incumbents had 
been to college. Additionally, women with high-normal weight in adolescence 
(0 < SRBMI < 1) had significantly reduced occupational attainment late in their 
careers. These differences existed net of academic ability and performance in 
high school as well as parental socioeconomic status. We also find countervail- 
ing indirect effects of adolescent SRBMI via educational attainment and family 
formation processes. Heavier adolescent girls (SRBMI scores above the mean) 
went on to complete about .3 fewer years of schooling than their thinner peers, 
which had a negative impact on their occupational attainment over the work 
career. However, overweight adolescent girls (SRBMI > 1) also delayed family 
formation by 1.18 years on average, which had beneficial effects on initial and 
mid-career attainment, and partially ameliorated the negative impact of reduced 
schooling. For women, the negative indirect impact of being overweight on 
initial occupational attainment via reduced schooling was four times larger than 
the beneficial indirect effect via delayed family formation. Similarly, the relative 
impact of these two indirect pathways was approximately 3 to 1 at the mid- and 
late-career stages. Overall, the effects of adolescent SRBMI on occupational 
attainment over the career were modest. For women, the total impact of being 
in the high-normal weight category reduced occupational education by .05.stan- 
dard deviations at labor market entry, .07 standard deviations at mid-career, and 
.10 standard deviations at late-career relative to the low-normal weight category. 
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Table 2a: Model of Occupational Attainment over the Life Course (Females 


First Job 
Educational Family Occupational Occupational Duncan 
Independent Variable Attainment Formation Education Income SEI 
Father's occupational education 02s" .08** 
(7.69) (2.91) 
Family income 00". 
(4.17) 
Parental education -.02 
(-1.50) 
Academic ability a Ast 
(12.65) (3.71) 
SRBMI underweight 16 -.25 -.34 
(.98) (-.85) (-.19) 
SRBMI high normal weight -.32™ -.11 62 
(-3.14) (-.55) (.57) 
SRBMI overweight -.31* Tic -1.32 
(-2.13) (2.77) (-.98) 
Educational attainment 61 0 1006 
(7.85) (24.61) 
Family formation a 
(5.47) 
First occupational education ‘an. PHOS 
(23.67) (34.87) 
First occupational income 01 


(.30) 
1975 occupational education 


1975 occupational income 
1993 occupational education 


1993 occupational income 


R? 29 09 62 Al By) 


: 
Notes: Numbers in parentheses at t-values. 
fe< 10; "p<05° “p<.01- "p< .001 (two-tailed tests) 


Being overweight reduced occupational education by .04 standard deviations at 
labor market entry and mid-career, and by .06 standard deviations at late-career. 

For men, we find that underweight adolescents had significantly reduced oc- 
cupational attainment at labor market entry compared to their peers with low- 
normal weight (Table 3). Those whose SRBMI scores were one standard deviation 
or more below the mean held first jobs in which 5.18 percent fewer incumbents 
had been to college. This is roughly equivalent to the effect of a half year less 
schooling. We also find that by the mid-career, men who were overweight as 
adolescents had significantly reduced occupational attainment relative to their 
peers who fell into the low-normal weight category. Overweight men held jobs 
in which the proportion of their peers who had been to college was lower by 
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approximately 3 percentage points. Contrary to our findings for women in the 
WLS, among men we find no significant indirect effects of adolescent SRBMI on 
occupational attainment via educational attainment or family formation processes. 
But similar to our findings for women, the net impact of adolescent SRBMI on 
occupational attainment was modest for men. Relative to those in the low-normal 
weight category, being underweight reduced occupational education by .07 stan- 
dard deviations at labor market entry, and .06 standard deviations at mid- and 
late-career. Being overweight reduced men's mid-career occupational attainment 
by .05 standard deviations. 
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Next, we test for significant gender differences in the impact of adolescent rela- 
tive body mass by estimating models under various constraints. We first impose 
gender invariance constraints on the impacts of the adolescent SRBMI categories 
on initial occupational attainment. Imposing these constraints does not have 
a significant deleterious effect on model fit using conventional measures. The 
Satorra-Bentler scaled chi-square increases by 5.4 with 3 additional degrees of 
freedom. In addition, there is substantial improvement in the BIC statistic (y2 — df 

* In (N)) (decrease in BIC > 10), suggesting gains in model parsimony without a 
significant loss of fit. Next we impose gender invariance constraints on the effects 
of the adolescent SRBMI categories on mid-career attainment (1975). 
As before, the imposition of these constraints does not significantly 
alter the fit of the model according to conventional criteria (A y= 
4.8 with 3 additional degrees of freedom) and improves parsimony 
(A BIC = -18.9). Similarly, imposing gender invariance constraints 
on the direct effects of the adolescent SRBMI categories on late- 
career occupational attainment (1993) does not cause model fit to 
deteriorate. These results show that despite the appearance of gender 
differences in tables 2 and 3, the direct effects of adolescent SRBMI 
on occupational attainment do not vary significantly for men and 
women. We then impose gender invariance constraints on the indi- 
rect effects of adolescent SRBMI, first via educational attainment 
and then through family formation. Constraining the effects of the 
SRBMI categories on educational attainment to be the same for men 
and women causes significant deterioration in the fit of the model 
using conventional standards (A y’ = 9.7 with 3 additional degrees of 
freedom), though BIC once again is improved. However, imposing 
similar equality constraints on the delayed marriage pathway does not 
appreciably alter model fit. This suggests that SRBMI has the same 
effect on family formation timing for men and women. However, 
delayed family formation has no impact on the career attainment 
of men but positively affects women’s career standing. Overall this 
suggests that the indirect impact of adolescent SRBMI-working 
through educational attainment and, to a lesser degree through the 
timing of marriage/fertility—varies by gender. Finally, we estimate a 
model that incorporates all of the previous constraints that did not 
result in a significant decline in model fit. This model fits the data 
much better than the baseline models presented in tables 2 and 3. 
The Satorra-Bentler scaled chi-square increases by a non-statistically 

significant 18.8 with 12 additional degrees of freedom, which sub- 
stantially improves model parsimony (A BIC = -76.0). 

In tables 4 and 5 we present parameter estimates from the final model. 

This model takes the baseline model presented in tables 2 and 3 and 
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Table 3a: Model of Occupational Attainment over the Life Course (Males) 
First Job 


Educational Family Occupational Occupational Duncan 


Independent Variable _—_—_—Attainment_Formation _Education _Income __SE!__ 
Father's occupational education 102 -.06T 
(6.15) (-1.86) 
Family income .00 
(1.16) 
Parental education 01 
(.64) 
Academic ability Ore .03 
(16.58) (.70) 
SRBMI underweight -.26 59 -5.18** 
(-1.26) (1.33) (-2.91) 
SRBMI high normal weight .02 .02 -1.26 
(.13) (.09) (-1.12) 
SRBMI overweight -.04 .06 -1.12 
(-.25) (1.39) (-.76) 
Educational attainment 20° 11,38" 
(5.21) (42.36) 
Family formation 10 
(.72) 
‘First occupational education oy 524 
(33.34) (51.56) 
First occupational income .46** 
(32.02) 


1975 occupational education 
1975 occupational income 
1993 occupational education 
1993 occupational income 


R? 20 03 14 60 89 


Notes: Numbers in parentheses at t-values. 
tp<.10 *p<.05 “p<.01 **p<.001 (two-tailed tests) 





imposes equality constraints from the previous analysis that do not cause significant 
deterioration in model fit. Specifically, we impose gender invariance constraints on 
the direct effects of the adolescent SRBMI categories on occupational attainment at 
each wave, and also between SRBMI and the latent family formation construct. In 
tables 4 and 5 these constrained parameters are presented in boldface type. The im- 
position of these constraints has an important influence on the estimated impact of 
adolescent SRBMI on occupational attainment. Imposing gender invariance on the 
direct effects of SRBMI on occupational attainment yields no significant direct ef 
fects at conventional levels of statistical inference (p < .05). We find a small, negative 
and marginally significant (p < .10) direct effect of being underweight in adolescence 
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on initial occupational attainment. Similarly, at mid-career we find a marginally 
significant deleterious impact of being overweight. However, elevated SRBMI con- 
tinues to adversely affect the occupational attainment of women indirectly through 
its inverse association with educational attainment. Again, this is partially offset by 
the beneficial effects of delayed family formation among heavy women. Overall, 
the results suggest a statistically significant, though substantively modest negative 
total effect of elevated adolescent SRBMI on life course occupational attainment for 
women. For overweight men, we find an even weaker and only marginally signifi- 
cant negative effect of adolescent SRBMI on occupational attainment. 
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Discussion 


Our first set of hypotheses (H1a and H1b) predicted that heavier women would 
experience lower occupational attainment than normal weight women due to 
employment-based discrimination, while body mass would not affect men’s oc- 
cupational standing at any point in the career. Findings from baseline models 
suggest that heavier women experienced lower levels of occupational success 
in later career stages, while heavier men experienced reduced attainment at 
the mid-career point. However, when we imposed gender equality constraints 
we found that adolescent SRBMI does not directly affect career trajectories in 
gender-specific ways, and only modestly affects the 
attainment of both men and women. 

Despite the lack of substantial direct effects of 
body mass on occupational attainment, we find gen- 
der differences in the indirect impact of adolescent 


Duncan 
SEI 
49*** 


body mass on career trajectories. Heavier women 
in the WLS obtained less post-secondary school- 
N ing, which hindered their careers. This is partially 
consistent with our second set of hypotheses (H2a 
and H2b), which predicted that heavier adolescents 
would complete fewer years of schooling and experi- 
ence lower occupational attainment at each career 
stage. We do not find similar indirect effects for men. 

Our final set of hypotheses predicted that heavier 
men and women would experience delays in family 
formation, and that such delays would improve the 
early career attainment of women (H3a), but hinder 
the early career attainment of men (H3b). We do not 
find gender differences in terms of the effect of SRBMI 
on family formation timing. However, we do find 
differences in terms of the indirect effect of SRBMI 
on occupational attainment through family forma- 
tion processes. Whereas delayed family formation had 
no effect on men’s occupational attainment, delayed 
family formation improved women’s career standing. 
Although subsequent analyses with the full sample of 
WLS respondents might provide sufficient statistical 
power to detect significant gender differences sug- 
gested in the baseline models, it is important to bear 
in mind that the gender differences we observed were 
substantively modest compared to prior studies: 

Our findings have several implications. First, scant 
evidence that body mass directly affects the career 
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Table 4a: Model of Occupational Attainment over the Life Course (Females) 


First Job 
Educational Family Occupational Occupational Duncan 
Independent Variable Attainment Formation _ Education Income SEl 
Father's occupational education ‘On? .08** 
(7.69) (2.88) 
Family income 00". 
(4.17) 
Parental education -.02 
(-1.50) 
Academic ability 4e* fl Satan 
(12.66) (3.72) 
SRBMI underweight tf 05 -2.45f 
(1.03) (.21) (-1.93) 
SRBMI high normal weight oer -.06 -.29 
(-3.12) (-.38) (-.37) 
SRBMI overweight -.31* O17 -1.24 
(-2.11) (3.10) (-1.24) 
Educational attainment OO 10.06*** 
(7.88) (24.73) 
Family formation a 
(5.47) 
First occupational education ‘Zhao 50* 
(23.66) (34.87) 
First occupational income 01 


1975 occupational education 
1975 occupational income 
1993 occupational education 
1993 occupational income 


R? 29 09 62 AT 3 


Notes: Numbers in parentheses at t-values. 
tp<.10 *p<.05 *“*p<.01 **p<.001 (two-tailed tests) 





trajectories of men or women at any point in the career is surprising given the 
conclusions of previous research as well as the characteristics of the cohort under 
study. Wisconsin graduates in 1957 entered a labor market that had substantially 
lower obesity rates compared to more recent decades. Social psychologists have 
shown that the degree to which obese individuals are stigmatized depends in 
part on existing normative structures (Crosnoe and Muller 2004). Furthermore, 
negative social consequences and self-assessments associated with obesity are more 
common among populations where overweight/obesity is rare (Pinhey et al. 1997; 
Ross 1994). Consequently, overweight adolescents in the WLS may have been 
more negatively stigmatized relative to overweight individuals in more recent co- 
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horts (for a review of rising obesity rates since 1960, see Flegal et al. 2002). Under 
these conditions, we expected to observe a greater impact of adolescent body mass 
on the career trajectories of WLS participants than analyses of more recent cohorts. 

Given the lack of residual direct effects of SRBMI on occupational attain- 
ment, we can point to no evidence of employer discrimination against heavy 
women (or men) at any time point during the career. Furthermore, we find no 
evidence that women are subject to gender-specific socio-cultural preferences 
and expectations of employers that affect occupational standing. These findings 
do not support previous research, which has suggested that the careers of heavy 
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women are hindered by employer attributions about cooperativeness, intel- 
ligence and competence. 

Finally, our results suggest that the primary way that adolescent body mass 
affects careers is indirectly through differential educational investments among 
women. Females with elevated SRBMI obtained lower levels of education com- 
pared to their thinner peers, even after controlling for high school academic 
achievement and ability. While these negative effects on women’s careers were 
partially offset by delayed family formation, early differentials in educational 
investment limited women’s occupational advancement at each point in the career . 
trajectory. This suggests that heavy women may be less encouraged to 
seek higher levels of education or may self-select out of post-secondary 
education due to psychosocial complications, such as depressive symp- 
toms or poor self-esteem. 


LDGtt® 
(14.39) 


Conclusion 


This research challenges previous studies that relied on cross-sectional 
or shorter-term longitudinal data and found direct effects of body 
mass on occupational attainment. By controlling for a variety of pre- 
viously unexplored (or underexplored) confounders and mediators, 
including social background, family formation processes, and aca- 
demic ability and achievement, this study isolated several pathways 
by which body mass may affect career standing over the life course. 
This study also included a baseline measure of body mass, lacking in 
some prior studies, and analyzed effects over the course of the career 
rather than being limited to 7 or 15 years following career entry. 

While a long tradition of research has examined the impact of gender, 
family background and education on occupational attainment (DiPrete 
and Grusky 1990; Featherman and Hauser 1978; Warren et al. 2002), 
much less work has analyzed the influence of physical characteristics on 
occupational outcomes. We find that relative body mass does not have 
strong direct effects on the career trajectories of either women or men 
but does affect women’s careers indirectly. Overweight women invest less 
in education compared to non-overweight women, and this negatively 
affects their career trajectories; delays in family formation only partially 
offset these negative impacts. We find limited adverse effects of body 
mass on men’s occupational attainment, and these appear to operate 
independently of educational and family formation processes. 

There are at least four limitations of our study that can be addressed 
by future research. First, ascriptive statuses such as race and ethnicity 
shape career outcomes in important ways (Browne et al. 2001; Browne 
and Kennelly 1999; Kaufman 2002; Moss and Tilly 2001; Reskin, 
McBrier and Kmec 1999) and also influence the likelihood of obesity 





1993 occupational income 


R2 
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Table 5a: Model of Occupational Attainment over the Life Course (Males) 


a a Firslob +e 
Educational Family Occupational Occupational Duncan 
Independent Variable _—_—Attainment_Formation _ Education __Income __SE!__ 
Father's occupational education 02% -.06T 
(6.15) (-1.85) 
Family income .00 
(1.15) 
Parental education 01 
(.62) 
Academic ability Oa 02 
(16.59) (.63) 
SRBMI underweight -.28 05 -2.45T 
(-1.36) (.21) (-1.93) 
SRBMI high normal weight 01 -.06 -.29 
(.10) (-.38) (-.37) 
SRBMI overweight -.04 oT -1.24 
(-.26) (3.10) (-1.24) 
Educational attainment One 113938 
(5.21) (42.40) 
Family formation 10 
(.70) 
First occupational education Oot 52 
(33.34) (51.56) 
First occupational income AG 
(32.02) 
1975 occupational education 
1975 occupational income 
1993 occupational education 
1993 occupational income 
R? 25 03 13 .60 89 





Notes: Numbers in parentheses at t-values. 
Tp<.10 *p<.05 “p<.01 **p<.001 (two-tailed tests) 


(Denney et al. 2004; Flegel et al. 2002). Because members of the 1957 cohort 
of Wisconsin high school graduates are predominantly non-Hispanic white, the 
WLS does not allow for analyses of race and ethnicity—or the interactions between 
race/ethnicity and adolescent body mass—on career trajectories. Additional data 
sources, including subsequent panels of the NLSY and PSID, must be analyzed 
to determine whether body mass differentially affects the career trajectories of 
non-Hispanic whites, blacks, Latinos and Asians. 

Second, while overweight individuals in the WLS did delay family forma- 
tion, they did not differ in the probability of ever marrying. WLS respondents 


married early and tended to remain married irrespective of body mass. However, 
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delayed marriage has a positive effect on occupational attainment for women. In 
more recent cohorts, in which a greater proportion of women are delaying or 
forgoing marriage, weight-based discrimination in the marriage market may be 
more salient, positively affecting career trajectories as predicted. Future analyses of 
nationally representative samples should examine whether marriage delays grant 
additional years to invest in career-related capital, thus potentially limiting the 
negative effects of obesity. Analysis of subsequent panels of the NLSY and PSID 
could extend our study by identifying the ways in which delayed marriage among 
obese individuals in recent cohorts affects career attainment. 
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Third, our estimates of the impact of SRBMI on occupational trajectories may 
be the lower bounds of the true effect. Because the WLS is comprised entirely 
of high school graduates, by definition we are not capturing the bottom of the 
educational and occupational distributions. Given observed negative effects of 
SRBMI on educational attainment after high school, body mass may also influ- 
ence the probability of high school completion. If so, the total effects of SRBMI 
on occupational trajectories presented here are likely underestimates. Conversely, 
body mass and physical attractiveness may matter more to the success of women 
in higher status occupations than those at different points in the occupational 

hierarchy. Future research must also untangle the rela- 

tionship between body mass and educational attainment 
oS in order to specify the mechanisms that lead overweight 
2 ‘| individuals— particularly girls and women —to invest less 
in education. 

Finally, the WLS did not collect extensive health 
information until 1993. Thus, our study cannot com- 
plete a thorough investigation of the potential effects 


Income 
43 








of health status on occupational attainment among 
overweight individuals. Previous research identifies 
strong linkages between body mass and health outcomes 
(Quesenberry et al. 1998; Thompson et al. 1998; Wolf 
and Colditz 1998), and between certain obesity-related 
health outcomes and work-related absenteeism (Burton 
et al. 1998; Narbro 1996; Thompson et al. 1998; Tucker 
and Freidman 1998, cited in Finkelstein et al. 2005). 
Incorporating health measures may specify whether dis- 
advantages overweight workers face are primarily due 
to employer discrimination, health status or a combina- 
tion of these. Such mechanisms are likely to be most 
important in the later stages of the career, when the 
incidence of chronic disease rises, and individuals begin 
to bear the burden of multiple accumulated co-morbid 
conditions.° 


Current/ Last Job 1993 


Duncan Occupational Occupational 
Education 
62 


SEI 
86 


Current/ Last Job 1975 
Income 
45 


Occupational Occupational 


Education 
64 


Table 5b continued 
Independent Variable 

1993 occupational education 
1993 occupational income 
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Notes 


1. For this research we focus on social rather than health-related mechanisms. A large 
body of scholarship has analyzed the impact of obesity on health-related outcomes 
over the life course (see Visscher and Seidell 2001 for a review). Given the ages 
at which we are observing attainment, health-related mechanisms are most likely 
relevant for 1993, and much less so for first and 1975 job. 


2. The “beauty myth” refers to the standards of physical attractiveness that women 
are held to in public and private spheres (Wolf 1991). We test the hypothesis that 
standards of feminine beauty affect women earlier in the career and are most likely 
to negatively affect larger women. 


3. We also estimated the impact of SRBMI on earnings in 1975 and 1993. We did not 
find statistically significant differences in earnings across SRBMI groups for men 
or women at either wave. When SRBMI was treated as a continuous measure, we 
found a marginally significant (p < .10) negative impact of elevated SRBMI on men’s 
earnings in 1993. This difference was largely mediated by current health status in 
1993, suggesting that health selection is important to consider at later stages of the 
career. Given that SRBMI only modestly affects occupational attainment in 1993, 
the impact on earnings must largely operate through reduced labor supply and not 
through reduced quality of job or wage rates. 


4. We tested other categorical specifications of SRBMI. The parameter estimates do 
not change dramatically, and the substantive conclusions are robust to the use of 
alternative cut points. 


5. Inseparate analyses not shown we examine whether BMI measured in 1993 mediated 
the relationship between adolescent SRBMI and occupational attainment in 1993. 
We find only slight evidence for such mediation. Among women, those who were 
heavy both in adolescence and in 1993 had the worst occupational attainment in 
1993, followed by women who were heavy in adolescence but were no longer heavy in 
1993. Women who were thin at both waves had the highest occupational attainment, 
followed by women who were thin as adolescents but who had become heavy by 
1993. This suggests that the influence of adolescent SRBMI is stronger than more 
contemporaneous measures of body mass, and that it is unlikely that the occupational 
differences observed in this analysis are driven by BMI-induced health selection, at 
least in the period of the work career covered here. 


References 


Allegretto, S.A., and M.M. Arthur. 2001. “An Empirical Analysis of Heterosexual/ 
Homosexual Male Earnings Differentials: Unmarried and Unequal?” Industrial & 
Labor Relations Review 54(3):631-46. 

Averett, Susan, and Sanders Korenman. 1996. “The Economic Reality of the Beauty 
Myth.” Journal of Human Resources 31(2):304-30. 

. 1999. “Black-White Differences in Social and Economic Consequences of 
Obesity.” International Journal of Obesity 23(2):166-73. 

Baum, C.L., and W.E. Ford. 2004. “The Wage Effects of Obesity: A Longitudinal Study.” 
Health Economics 13(9):885-99. 

Biddle, Jeff E., and Daniel S. Hamermesh. 1998. “Beauty, Productivity and Discrimination: 
Lawyers, Looks and Lucre.” Journal of Labor Economics 16(1):172-201. 


1802 © Social Forces 88(4) 


Bjorge, Tone, Anders Engeland, Aage Tverdal and George Davey*Smith. 2008. “Body 
Mass Index in Adolescence in Relation to Cause-specific Mortality: A Follow-up of 
230,000 Norwegian Adolescents.” American Journal of Epidemiology 168(1):30-37. 

Bosman, Ciska M., Gerard A. Pfann and Daniel Hamermesh. 2006. “Business Success 
and Business’ Beauty Capital.” Economics Letters 93(3):201-07. 

Brueckner, Hannah. 2004. Gender Inequality in the Life Course. Aldine de Gruyter. 

Browne, Irene, Cynthia Hewitt, Leann Tigges and Gary Green. 2001. “Why Does Job 
Segregation Lead to Wage Inequality Among African Americans? Person, Place, 
Sector or Skills?” Social Science Research 30(3):473-95. 

Browne, Irene, and Ivy Kennelly. 1999. “Black Women in the Labor Market: Stereotypes, 
Employer Perceptions, and Worker Profiles.” Pp. 302-26. Latinas and African 
American Women at Work. Irene Browne, editor. Russell Sage. 

Budig, Michelle, and Paula England. 2001. “The Wage Penalty for Motherhood.” 
American Sociological Review 66(2):204-25. 

Burton, W.N., C.Y. Chen, A.B. Schultz and D.W. Edington. 1998. “The Economic 
Costs Associated with Body Mass Index in a Workplace.” Journal of Occupational 
Environmental Medicine 40(9):786-92. 

Cawley, J. 2000. “Body Weight and Women’s Labor Market Outcome.” Working Paper 
No. 7841. Cambridge, MA: National Bureau of Economic Research. 

. 2004. “The Impact of Obesity on Wages.” The Journal of Human Resources 
39(2):451-74. 

Cawley, John, Grabka, Markus and Lillard Dean R. 2005. “A Comparison of the 
Relationship Between Obesity and Earnings in the U.S. and Germany.” Journal of 
Applied Social Science Studies 125(1):119-29. 

Chandola, T., I.J. Deary, D. Blane and G.D. Batty. 2006. “Childhood IQ in relation 
to obesity and weight gain in adult life: the National Child Development (1958) 
Study.” International Journal of Obesity 30(9):1422-32. 

Conley, Dalton, and Rebecca Glauber. 2007. “Gender, Body Mass and Socioeconomic 
Status: New Evidence from the PSID.” Pp. 253-75. Advances in Economics and Health 
Services Research, Volume 17. Elsevier. 

Crosnoe, Robert, and Chandra Muller. 2004. “Body Mass Index, Academic Achievement, 
and School Context: Examining the Educational Experiences of Adolescents at Risk 
of Obesity.” Journal of Health and Social Behavior 45(4):393-407. 

Daniel, Kermit. 1995. “The Marriage Premium.” Pp. 113-25. Mariano Tommasi 
and Kathryn lerulli, editors. Zhe New Economics of Human Behavior. Cambridge 
University Press. 

Datar, Ashlesha, Roland Sturm and Jennifer L. Magnabosco. 2004. “Childhood 
Overweight and Academic Performance: National Study of Kindergartners and 
First-Graders.” Obesity Research 12(1):58-68. 

Denney, Justin, Patrick Krueger, Richard Rogers and Jason Boardman. 2004. “Race/ 
Ethnic and Sex Differentials in Body Mass among U.S. Adults.” Ethnicity and 
Disease 14(3):389-98. 

DiPrete, Thomas, and David Grusky. 1990. “Structure and Trend in the Process of Stratification 
for American Men and Women.” American Journal of Sociology 96(1):107-43. 

Duncan, Otis D. 1961. “A Socioeconomic Index for All Occupations.” Pp. 109-38. 
Occupations and Social Status. Albert J. Reiss, editor. Free Press of Glencoe. 


The Skinny on Success © 1803 


Featherman, David, and Robert Hauser. 1978. Opportunity and Change. Academic Press. 

Feingold, Alan. 1992. “Good-looking People Are Not What We Think.” Psychological 
Bulletin 111(2):304-41. 

Finkelstein, Eric A., Christopher J. Ruhm and Katherine M. Kosa. 2005. “Economic 
Causes and Consequences of Obesity.” Annual Review of Pubic Health 26:239-57. 

Flegal, K.M., M.D. Carroll, C.L. Ogden and C.L. Johnson. 2002. “Prevalence and Trends 
in Obesity Among US Adults.” JAMA 288(14):1723-27. 

French, Michael. 2002. “Physical Appearance and Earnings: Further Evidence.” Applied 
Economics 34(5):569-72. 

Frieze, Irene Hanson, Josephine E. Olson and June Russell. 1991. “Attractiveness and 
Income for Men and Women in Management.” Journal of Applied Social Psychology 
21(13):1039-57. 

Gortmaker, Steven, Aviva Must, James Perrin, Arthur Sobol and William Dietz. 1993. 
“Social and Economic Consequences of Overweight in Adolescence and Young 
Adulthood.” New England Journal of Medicine 329(14):1008-12. 

Gray, Jeffrey, and Michael Vanderhart. 2001. “The Determinants of Wages: Does Marriage 
Matter?” Pp. 356-67. Ties that Bind: Perspectives on Marriage and Cohabitation. 
Linda Waite, Christine Bachrach, Michelle Hindin, Elizabeth Thomson and Arland 
Thornton, editors. Aline de Gruter. 

Grossbard-Shechtman, Shoshana. Editor. 2003. Marriage and the Economy: Theory and 
Evidence from Advanced Industrial Societies. Cambridge University Press. 

Grossbard-Shechtman, Shoshana, and Shoshana Neuman. 2003. “Marriage and Work 
for Pay. Pp. 222-47. Marriage and the Economy: Theory and Evidence from Advanced 
Industrial Societies. Cambridge University Press. 

Halpern, Carolyn, J. Richard Udry, Benjamin Campbell and Chirayath Suchindran. 1998. 
“Effects of Body Fat on Weight Concerns, Dating, and Sexual Activity: A Longitudinal 
Analysis of Black and White Adolescent Girls.” Developmental Psychology 35(3):721-36. 

Hamermesh, Daniel, and Jeff Biddle. 1994. “Beauty and the Labor Market.” American 
Economic Review 84(5):1174-94. 

Haskins, Katherine, and Edward Ransford. 1999. “The Relationship Between Weight and 
Career Payoffs Among Women.” Sociological Forum 14(2):295-318. 

Hatfield, Elaine, and Susan Sprecher. 1986. Mirror, Mirror...: The Importance of Looks in 
Everyday Life. State University of New York Press. 

Hauser, Robert M. 2005. “Survey Response in the Long Run: The Wisconsin Longitudinal 
Study.” Field Methods 17(1):3-29. 

Hauser, Robert M., and John Robert Warren. 1997. “Socioeconomic Indexes for 
Occupations: A Review, Update, and Critique.” Pp. 177-298. Sociological Methodology 
1997, Volume 27. A.E. Raftery, editor. Basil Blackwell. 

Hersch, Joni, and Leslie Stratton. 1997. “Housework, Fixed Effects and Wages of Married 
Workers.” Journal of Human Resources 32(2):285-307. 

2002. “Housework and Wages.” Journal of Human Resources 37(1):217-229. 

Jackson, Linda. 1992. Physical Appearance and Gender: Sociobiological and Sociocultural 
Pespectives. SUNY Press. 

Jackson, Linds, John Hunter and Carole Hodge. 1995. “Physical Attractiveness and 
Intellectual Competence: A Meta-Analytic Review.” Social Psychology Quarterly 
58(2):108-22. 


1804 © Social Forces 88(4) 


Jacobs, Jerry A. 1989. “Long-Term Trends in Occupational Segregation by Sex.” American 
Journal of Sociology 95(1):160-73. 

Joreskog, K.G., and D. Sdrbom. 1996. LISREL 8: User's Reference Guide. Chicago: 
Scientific Software International. 

Kaufman, Robert. 2002. “Assessing Alternative Perspectives on Race and Sex Employment 
Segregation.” American Sociological Review 67(4):547-72. 

Korenman, Sanders, and David Neumark. 1991. “Does Marriage Really Make Men More 
Productive?” Journal of Human Resources 26(2):282-307. 

. 1992. “Marriage, Motherhood and Wages.” Journal of Human Resources 27 (2): 233-55. 

Loh, Eng Seng. 1993. “The Economic Effect of Physical Appearance.” Social Science 
Quarterly 74(2):420-37. 

. 1996. “Productivity Differences and the Marriage Wage Premium for White 
Males.” Journal of Human Resources 31(33):566-89. 

Lundberg, Shelly, and Elaina Rose. 2000. “Parenthood and the Earnings of Married Men 
and Women.” Labour Economics 7(6):689-7 10. 

Mitra, Aparna. 2001. “Effects of Physical Attributes on the Wages of Males and Females.” 
Applied Economic Letters 8(11):731-35. 

Morris, Stephen. 2006. “Body Mass Index and Occupational Attainment.” Journal of 
Health Economics 25(2):347-64. 

Moss, Phillip, and Chris Tilly. 2001. Stories Employers Tell: Race, Skill and Hiring in 
America. Russell Sage. ‘ 

Must, Aviva, Paul E Jacquez, Gerard E. Dallal, Carl J. Bajema and William H. Dietz. 
1992. “Long-Term Morbidity and Mortality of Overweight Adolescents. A Follow- 
Up of the Harvard Growth Study of 1922 to 1935.” New England Journal of 
Medicine 327(19):1350-55. 

Muthén, Linda K., and Bengt O. Muthén. 2008. Mplus Version 5.1. Available at: http:// 
www.statmodel.com. 

Narbro, K., E. Jonsson, B. Larsson, J. Waaler, H. Wedel and L. Sjostrom. 1996. 
“Economic Consequences of Sick-Leave and Early Retirement in Obese Swedish 
Women.” International Journal of Obese Related Metabolic Disorders 20(10):895-903. 

Neumark, David, and Sanders Korenman. 1994. “Sources of Bias in Women’s Wage 
Equations: Results Using Sibling Data.” Journal of Human Resources 29(2):379-405. 

Pagan, Jose, and Alberto Davila. 1997. “Obesity, Occupational Attainment and Earnings.” 
Social Science Quarterly 78(3):756-70. 

Pinhey, Thomas, Donald Rubinstein and Richard Colfox. 1997. “Overweight and 
Happiness: The Reflected Self-Appraisal Hypothesis Reconsidered.” Social Science 
Quarterly 78(3):747-55. 

Presser, H. 1994. “Employment Schedules Among Dual-Earner Spouses and the Division 
of Household Labor by Gender.” American Sociological Review 59(3):348-65. 

Puhl, Rebecca, and Kelly D. Brownell. 2001. “Bias, Discrimination and Obesity.” Obesity 
Research 9(12):788-805. 

Quesenberry Jr. C.P, B. Caan and A. Jacobson. 1998. “Obesity, Health Services Use, and Health 
Care Costs Among Members of a HMO.” Archives of Internal Medicine 158(5):466-72. 

Quinn, Diane M., and Jennifer Crocker. 1999. “When Ideology Hurts: Effects of Belief 
in the Protestant Ethic and Feeling Overweight on the Psychological Well-Being of 
Women.” Journal of Personality and Social Psychology 77 (2):402-14. 


The Skinny on Success ¢ 1805 


Reither, Eric N., Robert M. Hauser and Karen C. Swallen. 2009. “Predicting Adult Health 
and Mortality from Adolescent Facial Characteristics in Yearbook Photographs.” 
Demography 46(1):27-41. 

Register, Charles, and David Williams. 1990. “Wage Effects of Obesity Among Young 
Workers.” Social Science Quarterly 71(1):130-41. 

Reskin, Barbara, Debra McBrier and Julie Kmec. 1999. “The Determinants and Consequences 
of Workplace Sex and Race Composition.” Annual Review of Sociology 25:335-61. 

Ross, Catherine E. 1994. “Overweight and Depression.” Journal of Health and Social 
Behavior 35(1):63-78. 

Sargent, James, and David Blanchflower. 1994. “Obesity and Stature in Adolescence and 
Earnings in Young Adulthood.” Archives of Pediatrics and Adolescent Medicine 148(7):68 1-87. 

SAS Institute Inc. 2003. SAS for Windows, Version 9.1. Cary, NC: SAS Institute. 

Sewell, William H., Robert M. Hauser, Kristen W. Springer and Taissa S. Hauser. 2004. 
“As We Age: A Review of the Wisconsin Longitudinal Study, 1957-2001.” Pp. 3-114. 
Research in Social Stratification and Mobility, Volume 20. Kevin T. Leicht, editor. Elsevier. 

Sorensen, A.B. 1974. “A Model for Occupational Careers.” American Journal of Sociology 
80(1):44-57. 

Sousa, Silvia. 2005. “Does Size Matter? A Propensity Score Approach to the Effect of 
BMI on Labor Market Outcomes.” Unpublished manuscript, European University 
Institute. Available at https://editorialexpress.com/cgi-bin/ conference/download. 
cgitdb_name=pej2008&paper_id=64. 

South, Scott, and Glenna Spitze. 1993. “Housework in Marital and Nonmarital 
Households.” American Sociological Review 59(3):327-47. 

Swami, V., and M.J. Tovée. 2005a. “Female Physical Attractiveness in Britain and 
Malaysia: A Cross-Cultural Study.” Body Image 2:115-28. 

. 2005b. “Male Physical Attractiveness in Britain and Malaysia: A Cross-Cultural 
Study.” Body Image 2:383-93. 

Taniguchi, Hiromi. 1999. “The Timing of Childbearing and Women’s Wages.” Journal of 
Marriage and the Family 61(A):1008-19. 

Thompson, D., J. Edelsberg, K. Kinsey and G. Oster. 1998. “Estimated Economic Costs 
of Obesity to US Business.” American Journal of Health Promotion 13(2):120-27. 

Tovée, M.J., and PL. Cornelissen. 1999. “The Mystery of Female Beauty.” Nature 
399(6733):215-16. 

Tovée, M.J., D.S. Maisey, J.L. Emery and PL. Cornelissen. 1999. “Visual Cues to Female 
Physical Attractiveness.” Proceedings of the Royal Society of London 266(1415):211-18. 

Tucker, L.A., and G.M. Friedman. 1998. “Obesity and Absenteeism: An Epidemiologic 
Study of 10,825 Adults.” American Journal of Health Promotion 12(3):202-07. 

Umberson, Debra, and Michael Hughes. 1987. “The Impact of Physical Attractiveness on 
Achievement and Psychological Well Being.” Social Psychology Quarterly 50(3):227-36. 

Valian, Virginia. 1999. Why So Slow? The Advancement of Women. MIT Press. 

Visscher, Tommy L.S., and Jacob S. Seidell. 2001. “The Public Health Impact of Obesity.” 
Annual Review of Public Health 22:355-75. 

Waite, Linda, and Maggie Gallagher. 2001. The Case for Marriage: Why Married People 
Are Happier, Healthier, and Better off Financially, Broadway Books. 

Waldfogel, J. 1997. “The Effect of Children on Women’s Wages.” American Sociological 
Review 62(22):209-17. 


1806 © Social Forces 88(4) 


Warren, John Robert, Jennifer Sheridan and Robert Hauser. 1998. “Choosing a Measure 
of Occupational Standing: How Useful Are Composite Measures in Analyzing Gender 
Inequality in Occupational Attainment?” Sociological Methods and Research 27(1):3-76. 

Wolf, Naomi. 1991. The Beauty Myth. Harper Collins. 

Wolf, A.M., and G.A. Colditz. 1998. “Current Estimates of the Economic Cost of Obesity 
in the US.” Obesity Research 6(2):97-106. 

Wright, Charlotte M., Louise Parker, Douglas Lamont and Alan W. Craft. 2001. 

“Implications of Childhood Obesity for Adult Health: Findings From Thousand 
Families Cohort Study.” BM/ 323(7324):1280-84. 


Stymied Mobility or Temporary Lull? 
The Puzzle of Lagging Hispanic College Degree Attainment 


Sigal Alon, 7e/-Aviv University 
Thurston Domina, University of California, Irvine 
Marta Tienda, Princeton University 


We assess the intergenerational educational mobility of recent cohorts of high school 
graduates to consider whether Hispanics’ lagging post-secondary attainment reflects 
a temporary lull due to immigration of low education parents or a more enduring 
pattern of unequal transmission of social status relative to whites. Using data from 
three national longitudinal studies, a recent longitudinal study of Texas high school 
seniors and a sample of students attending elite institutions, we track post-secondary 
enrollment and degree attainment patterns at institutions of differing selectivity. We 
find that group differences in parental education and nativity only partly explain the 
Hispanic-white gap in college enrollment, and not evenly over time. Both foreign- 
and native-born college-educated Hispanic parents are handicapped in their abilities 
to transmit their educational advantages to their children compared with white par- 
ents. We conclude that both changing population composition and unequal ability to 
confer status advantages to offspring are responsible for the growing Hispanic-white 
degree attainment gap. 





Introduction 


The democratization of higher education after the G.I. Bill was enacted and again in 
the aftermath of the Civil Rights movement is evident in higher college participation 
rates for all demographic groups. At the turn of the 20" century only 2 percent of 
the adult population were college graduates and a scant 6 percent achieved college 
degrees as recently as 1950 (Cox and Alm 2001). Currently about one in four 
persons ages 25 and older, and 29 percent of 25-29 year olds, are college graduates. 
These secular trends conceal persisting differentials along race and ethnic lines, how- 
ever. As the most rapidly growing segment of the college-age population, Hispanics 
are notable because they are falling behind whites and blacks even as their college 
enrollment and graduation rates rise (Tienda and Mitchell 2006). 

From 1970 to 2000 the share of Hispanics with bachelor’s degrees doubled, 
rising from 5 to 10 percent among adults ages 25 and older. During the same 
period the share of same-age, college-educated black adults climbed from 4 to 14 
percent and the share of college-educated whites more than doubled, up from 
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11 to 27 percent (U.S. Bureau of the Census 2003). Hispanic BA receipt rates 
trailed those of whites by 6 percentage points in 1970, but the gap widened to 17 
points over the next 30 years. Thus, in the midst of an impressive expansion in 
American higher education, Hispanics have fallen further behind in their attain- 
ment of college degrees. 

The growing Hispanic-white college gap challenges classic sociological predictions 
that between-group inequalities will tend to decline as society-wide educational op- 
portunities increase. According to status attainment theory, educational expansion 
permits upward intergenerational mobility, particularly for groups with low average 
levels of educational attainment (Blau and Duncan 1967; Treiman 1970). Thus, 
lagging Hispanic college degree attainment presents a puzzle that has broad implica- 
tions for understanding the contours of ethnic stratification as their share of the U.S. 
population increases while the demand for skilled workers remains high. 

The most widely cited explanations for the widening Hispanic-white college at- 
tainment gap focus on changes in population composition. Specifically, sustained 
immigration of low-skilled workers from Mexico and elsewhere in Latin America 
has put downward pressure on the educational attainment levels of second-gen- 
eration Hispanic youth, whose low college attendance is traced to low education 
parents (Pry 2004; Mare 1995; Schneider et al. 2006; Wojtkiewicz and Donato 
1995). Recent trends in educational attainment for native-born Hispanics are 
consistent with this explanation. The share of native-born Hispanic adults who 
have a bachelor’s degree or higher rose from 10 to 15 percent between 1980 
and 2000, while the share of foreign-born Hispanic college graduates declined 
modestly (Statistical Abstract of the United States 2006). This compositional 
explanation for the Hispanic college puzzle is consistent with status attainment 
theory, which predicts rising attainment levels across successive generations, but 
especially during a period of educational expansion (Blau and Duncan 1967; 
Hauser and Featherman 1978; Mare 1979, 1995). 

The voluminous status attainment literature has amply documented the pow- 
erful influence of parental education on children’s completed schooling. Because 
parental education provides a floor below which offspring are not likely to fall, 
sons and daughters of college-educated parents are more likely than offspring 
with less educated parents to pursue higher education. Compared with parents 
who lack post-secondary degrees, college-educated parents are better equipped to 
convey their postsecondary expectations at young ages. They better understand 
the post-secondary landscape and increasingly competitive admissions process, 
and invest more in resources to promote college-going (Glick and White 2004; 
Attewell et al. 2007; Schneider et al. 2006; Bellessa-Frost 2006). For example, in 
1994, 59 percent of high school graduates whose parents lacked college degrees 
enrolled in a post-secondary institution compared with 93 percent of students 
with college-educated parents. These disparities are even more striking according 
to type of institution attended. Only 27 percent of high school graduates whose 
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parents were not college educated attended a four-year institution, but more 
than 70 percent of their schoolmates with college-educated parents did so (Choy 
2001; Nanuz and Cuccaro-Alamin 1998). 

By assuming that the relationship between parental and children’s education 
is similar across social groups, status attainment theory predicts a narrowing of 
group inequalities over time, as greater shares of disadvantaged young adults 
surpass their parents’ education levels. Yet, even early mobility theorists acknowl- 
edged that the assumption of uniform transmission might be unwarranted. In a 
classic article about occupational mobility, Otis Dudley Duncan (1968) showed 
that black men were more likely than white men to experience downward mobility 
(based on their fathers’ and their own first job). Duncan viewed this as a transitory 
aberration associated with the great migration of southern-born blacks to the 
industrial North. In his words (1968:21), “it is virtually certain... that this pattern 
cannot prevail indefinitely in the future.” 

Recent scholarship documenting that black parents have a more difficult time 
transmitting their educational advantages to their offspring challenges both the 
assumption that mobility rates are uniform among demographic groups and 
that unequal transmission rates between blacks and whites are temporary. Hertz 
(2004) finds that white children born in the top income quartile have a 45 
percent chance of remaining there compared with only 15 percent of their black 
counterparts. Moreover, he reports that affluent black children are nearly four 
times more likely than whites to experience extreme downward mobility, from 
the top to the bottom income quartile, and that black children from the lowest 
income quartile attain the top income quartile at only a quarter of the rates of 
whites. Likewise, Attewell and associates (2007) show that young black adults 
with college-educated parents experience higher rates of educational downward 
mobility than their white statistical counterparts. 

These stylized facts correspond to Persell, Catsambis and Cookson’s notion 
of “differential asset conversion” —a process by which the economic, social and 
cultural assets that influence educational attainment of one group do not operate 
in similar ways for other groups. In their words (1992:209), “Social stratification 
may influence the rate of asset conversion, as it does in financial markets, where 
small savers receive lower returns than do large savers. Thus, members of disad- 
vantaged groups may lack equal access to certain markets or may pay higher prices 
in the same market.” If college-educated Hispanic parents are less able to transmit 
educational opportunities to their children than college-educated whites, changes 
in population composition may explain only part of the widening Hispanic-white 
gap in college attainment. 

Whether and to what extent unequal intergenerational transmission rates ac- 
count for Hispanics’ lagging educational attainment is an open question. Recent 
evidence that educational attainment of fourth- and fifth-generation Mexican- 
Americans trails that of whites (Telles and Ortiz 2008) suggests that Hispanics 
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lagging educational attainment is not a temporary lull, but likely reflects stymied 
mobility across parental education levels. However, these findings are based on 
residents in two Southwestern metropolitan sampled in 1968 and re-interviewed 
35 years later and may not generalize to the current Mexican origin population, 
which is more geographically dispersed. 

By assessing the intergenerational educational mobility of three recent cohorts of 
high school graduates (1982, 1992 and 2004), we consider whether Hispanics’ lag- 
ging postsecondary attainment reflects a temporary lull or a more enduring pattern 
of unequal transmission of social status relative to whites. Specifically, we decom- 
pose the Hispanic-white college gap into its explained and unexplained components, 
distinguishing between differences in population composition, notably parental 
education and immigration status, and unequal transmission rates by origin. If 
immigration mainly changes the educational composition of Hispanic parents, our 
analyses of the relationship between parental education and young adults’ college 
trajectories adequately account for its role in the Hispanic college puzzle. But if, as 
several studies have suggested, foreign-born parents face unique challenges transmit- 
ting status advantages to their children (Glick and White 2003, 2004; Wojtkiewicz 
and Donato 1995), the growing Hispanic-white college gap also may derive from 
nativity differences in intergenerational transmission of educational status. 

Our approach to the well investigated process of intergenerational mobility is 
situated to unveil the presence of differential transmission rate by origin among 
youth with comparable background in terms of parental education and nativity. 
Importantly, the empirical analyses also consider how expansion of the postsec- 
ondary system and the greater postsecondary stratification it generated affected the 
Hispanic-white college gap by distinguishing between quantitative and qualitative 
enrollment inequities. Finally, given the significance of our research question to 
the current debate about the assimilation of immigrants, we complement analyses 
of national trends in Hispanic postsecondary enrollment with data of Texas high 
school seniors in 2002.! 


Postsecondary Education Expansion and Ethnic Stratification 


The expansion of the postsecondary system during the period of the current 
investigation provides an intriguing setting for assessing questions related to 
temporal changes in intergenerational mobility. That not all segments of the 
postsecondary system expanded uniformly has direct implications for the col- 
lege choices confronted by successive cohorts of college entrants. Specifically, 
the relative growth of public and private, selective and nonselective, as well as 
two- and four-year institutions determines both the number and range of col- 
lege options confronted by white and Hispanic high school graduates in recent 
decades. Both quantitative and qualitative dimensions of the postsecondary 
system are important for understanding trends in educational intergenerational 
mobility, and the Hispanic college puzzle in particular. 
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Recent trends reveal that the postsecondary education expansion from 1970 to 
2000 largely involved two-year institutions because enrollment at four-year public 
and private universities grew slowly over the same period (Digest of Education 
Statistics 2003). In Texas, for example, the share of total post-secondary enroll- 
ment at two-year institutions surpassed that at four-year institutions after 1995 
(Tienda 2006). That demand for access to selective institutions surged as children 
of baby boomers graduated from high school in unprecedented numbers created 
a college squeeze and sharpened divisions within the higher education system 
(Alon and Tienda 2007). 

Enrollment trends are also relevant for understanding persistent inequality in 
higher education because many recent studies show that the type and selectivity 
of college attended —two-year vs. four-year/ non-selective vs. selective —influences 
the probability of graduation as well as long-term life chances (Bowen and Bok 
1998; Alon and Tienda 2005). Notably, Hispanics are over-represented at two- 
year institutions compared with whites, and among those who attend two-year 
colleges they are less likely than whites to transfer to four-year institutions. Not 
surprisingly, Hispanic college enrollees are less likely than whites to complete 
baccalaureate degrees (Schneider et al. 2006; Fry 2004). 

Thus, decoding the Hispanic college puzzle requires consideration of both 
quantitative and qualitative aspects of postsecondary outcomes. The quantitative 
aspect refers to the chances of college attendance, and the qualitative aspect (i.e., 
the stratification within the postsecondary system) refers to the selectivity and 
type of college destinations. In particular, Raftery and Hout’s (1993) notion of 
Maximally Maintained Inequality is relevant to understanding college enrollment 
differentials during a period of postsecondary expansion. ‘They posit that even 
when post-secondary educational capacity expands, educational disparities be- 
tween high- and low- status groups will persist until most members of high-status 
groups have reached the prevailing attainment threshold. Lucas (2001) refined the 
MMI concept by distinguishing between quantitative and qualitative inequities, 
arguing that privileged groups use their status advantages to secure quantitatively 
similar, but qualitatively superior education. His theoretical refinement, denoted 
Effectively Maintained Inequality, predicts that educational outcomes will become 
increasingly stratified along qualitative dimensions (such as college type and selec- 
tivity) even as disparities in overall college attainment levels shrink. 

These formulations suggest that both quantitative and qualitative dimensions of 
inequality should guide our quest to solve the puzzle of lagging Hispanic college 
degree attainment. Accordingly, our empirical analyses address several specific 
questions about quantitative and qualitative inequality in postsecondary educa- 
tion and how it might have structured and helped to maintain the ethnic gap in 
intergenerational mobility: (1. Are white and Hispanic students with college-edu- 
cated parents equally likely to attend college? (2. Are white and Hispanic students 
with college-educated parents equally likely to attend four-year non-selective and 
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selective post-secondary institutions? Given claims that population composition, 
and immigration in particular, is largely responsible for the rising Hispanic-white 
college gap, we further ask: (3. Are native-born white and Hispanic students with 
college-educated parents equally likely to attend college, and, if so, are they equally 
likely to attend four-year selective post-secondary institutions? Finally, because 
the Hispanic-white college gap is also attributed to differential access to two- and 
four-year institutions, we also ask whether (4. conditional on enrollment in four- 
year institutions, are Hispanic students with college-educated parents as likely as 
whites to earn a bachelor’s degree? 

Answers to these questions will reveal whether Hispanics’ lagging educa- 
tional attainment is a temporary lull or potentially more enduring stymied 
intergenerational mobility. Finding that a substantial share of the attainment 
gap is unexplained by compositional differences—education and nativity status 
of parents—will require further research to theorize about why transmission 
rates are unequal in order to fully understand the growing ethnic disparities in 
college degree attainment. 


Data and Methods 
Estimation Strategy and Hypotheses 


The key question raised by the Hispanic college puzzle is whether the relationship 
between parental and offspring’s educational attainment differs between white and 
Hispanic high school graduates.” If the rates of intergenerational educational trans- 
mission for whites and Hispanics are equal, then the Hispanic college puzzle derives 
mainly from changes in the Hispanic population's educational composition. If status 
transmission rates are unequal, educational attainment for Hispanic youth would 
lag behind that of whites even in the absence of changes in population composition. 
To understand Hispanics’ lagging postsecondary attainment, we first estimate 
the odds of college enrollment (or graduation) using the following general form: 


O(x) = exp(xB) 


Where x includes a series of dummy variables indicating the individual’s ethnici- 
ty and parental education (Hispanic no college, Hispanic some college, Hispanic 
college, white no college, and white some college). The odds-ratios for these 
variables represent the college enrollment (or graduation) odds for each ethno- 
education group relative to the reference category (white high school graduates 
with college-educated parents). To differentiate the effect of parental education 
from that of immigration we also estimate a specification that fully interacts 
parental education with nativity. 

The hypothesis of equal transmission rates (HO) will be supported if at any 
given time point, the college-going and completion odds of Hispanic students 
whose parents lack college degrees roughly equal those of white students with simi- 
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larly educated parents. The enrollment/graduation odds of children of both white 
and Hispanic parents lacking college experience should be lower than those of 
the children of parents with bachelor’s degrees. Furthermore, equal transmission 
rates imply trivial ethnic inequalities among children whose parents are college 
graduates. The “differential asset conversion” hypothesis (H1) assumes unequal 
intergenerational transmission rates for each cohort. Lower odds of college en- 
rollment/graduation for Hispanic students compared with white students with 
similarly educated parents refute the null hypothesis. Furthermore, lower enroll- 
ment/graduation rates for all Hispanic cohorts would suggest that the observed 
ethnic inequities are not transitory. 

We also model students’ odds of enrolling in two-year, four-year and four-year 
selective institutions in order to evaluate qualitative stratification among post- 
secondary outcomes. These specifications are based on logistic regressions where 
the dependent variable compares each college destination to no post-secondary 
education. H1 predicts that qualitative disparities in college destination will 
be larger than the quantitative gap in college enrollment. Furthermore, among 
students with college-educated parents, the Hispanic-white gap will be larger for 
four-year than two-year institutions, and for selective four-year than nonselective 
four-year institutions. For parsimony of exposition we plot odds ratios to portray 
the standing of ethno-education groups relative to whites while the coefficients are 
reported in the Appendix tables A-1 and A-2, available from the author. 


Simulations 


To decompose intergenerational educational mobility into its explained and unex- 
plained components, we compare the observed matriculation rates for white and 
Hispanic students with two simulated alternatives. The first simulates Hispanic 
college enrollment if their parents’ education matched that of white parents, hold- 
ing constant their intergenerational educational transmission rates. This isolates 
the share of the ethnic college gap attributable to a key difference in popula- 
tion composition, namely parental education. The second simulation estimates 
Hispanic college enrollment if they experienced the same rate of intergenerational 
educational asset conversion as whites, given their parents’ actual education. This 
exercise isolates the share of the ethnic gap due to unequal transmission rates. 
These simulations are based on multinomial logistic regressions where the de- 
pendent variable is an array of college destinations stratified by college type and 
admission selectivity.? Computationally, the first simulation is generated by regress- 
ing college destinations on parental education levels for white students only, and 
then applying the resulting white coefficients to Hispanic high school graduates for 
the corresponding parent education strata. The second simulation is produced by 
regressing college destinations on parental education levels for Hispanic students 
only and estimating their enrollment patterns after substituting average white 
educational levels for the Hispanic parental education distribution. To further dis- 


1814 © Social Forces 88(4) 


aggregate the explained component, we repeat these simulations by disaggregating 
Hispanic students according to their parents’ education and birthplace. 


Data Description 


We use five different data sets. First, to assess the intergenerational educational 
mobility during a period of educational expansion and population diversification, 
we analyze three nationally representative cohorts of high school graduates in 
1982, 1992 and 2004.4 High School & Beyond, which surveyed a nationally- 
representative sample of high school sophomores in 1980, and re-interviewed 
respondents in 1982 when most were high school seniors. Additional follow-up 
interviews were administered until approximately 10 years after high school. The 
National Education Longitudinal Study of 1988 drew a representative sample of 
8" graders in 1988 and re-interviewed them in 1990, 1992 (their senior year), 
1994 and 2000. More recently, the Education Longitudinal Study of 2002 sam- 
pled 10" graders in U.S. high schools in 2002 and followed up with respondents 
in 2004 and 2006, when most were two years out of high school.? 

Because sample sizes of college-educated immigrant parents are tiny in these 
national samples, we supplement the national studies with a large, representative 
survey of Texas public high school graduates, which permits sample stratificatiori 
by both parental education and nativity. We use data from the Texas Higher 
Education Opportunity Project, based on a representative sample of 13,803 high 
school seniors who were interviewed in spring of 2002.° A random sample of 
5,863 respondents was re-interviewed one year later to ascertain their college 
enrollment status. We restrict analyses to students who participated in both the 
base-year study and in the 2003 follow-up. 

Finally, to track ethnic differences in college graduation likelihood according to 
college selectivity level and parental education, we use the College & Beyond data- 
base, a restricted-access database built by the Andrew W. Mellon Foundation, based 
on a large sample of students who enrolled in 28 academically “more competitive” 
college and universities in the fall of 1989 (Bowen and Bok 1998). Again, national 
datasets contain too few observations of college enrollees to allow simultaneous 
stratification by ethnicity, parental education and college type. The C&B sample is 
sufficiently large to stratify both by ethnicity and parents’ educational level.” 


Variable Definitions 


Empirical analyses focus on high school graduates’ ethnicity and parental edu- 
cational attainment as determinants of their postsecondary outcomes (college 
enrollment, destination and attainment). Respondents missing data on any of 
these attributes are dropped from the samples. 

We use students’ self-reported race and ethnicity to define four demographic 
groups: whites, blacks, Hispanics and Asians.* All statistical analyses include blacks 
and Asians, but for parsimony of exposition these coefficients are reported only in 
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the online appendix. One issue that may create a bias in our findings is Hispanics’ 
heterogeneity based on national origin. This is because immigrants from different 
countries differ in their educational selectivity vis-a-vis non-migrants in the home 
country, which in turn affects the college attainment of the second generation 
(Feliciano 2005; Bohon, Johnson and Gorman 2006). To a large extent, however, 
these differences reflect variation in family background, and parental education 
in particular (Duncan, Hotz and Trejo 2006). Moreover, it was demonstrated that 
generation status is more influential predictor of socioeconomic outcomes than 
national origin (Tienda and Mitchell 2006). Unfortunately, few datasets permit 
disaggregation of Hispanics by both nativity and national origin. 

Our focus on Hispanics with college-educated parents precludes a full disag- 
gregation by national origin, but as a sensitivity analysis we have disaggregated 
Hispanics to Mexicans and non-Mexican, as was done by two NAS studies (Smith 
and Edmonston 1997; Tienda and Mitchell 2006). As expected, our results show 
that parents of Mexican youth average lower education levels compared with 
other Hispanic youth.? Within parental education strata, however, the educational 
transmission rate among Hispanics varies little by national origin. This claim finds 
further support in lower Hispanic transmission rates compared with whites in all 
four datasets, despite changes that occurred in the composition of the national 
origin groups during the observation period. In the interest of clarity and parsi- 
mony, we do not disaggregate Hispanics by their national origin for multivariate 
analyses, but do reference relevant Mexican vs. non-Mexican differences when 
these are statistically significant. 

Parental educational attainment is measured by three categories representing the 
highest degree attained by either the respondents’ mother or father. High school 
graduates whose parents dropped out of high school or completed a high school 
diploma but never enrolled in post-secondary education are designated “No col- 
lege parents.” If at least one parent had entered college, but did not earn at least a 
bachelor’s degree, students are classified as having “Some college parents.” Finally, 
graduates who had at least one parent with a bachelor’s degree, including those who 
completed a eraduate degree, are designated “BA or higher parents.” Using this clas- 
sification, respondents were sorted into six ethno-education groups: Hispanics with 
no college parents, Hispanics with some college parents, Hispanics with bachelor's 
degree, or higher, parents, whites with no college parents, whites with some college 
parents, and whites with bachelor's degree, or higher, parents (which serves as the 
reference category). Nativity was coded, using the Texas survey, by classifying re- 
spondents’ parents according to their birthplace, and both native- and foreign-born 
parents were further assigned to the six ethno-education groups.” 

Degree attainment data was collected 10 years and 8.5 years after high school 
graduation for the 1982 and 1992 high school cohorts, respectively. We sort 
students into four categories based on their final educational attainment: (1. no 
college; (2. attended college but earned no degree; (3. earned a certificate or an 
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Associate Degree (AA); and (4. Bachelor's degree or higher..Analyses of degree 
attainment are restricted to the 1982 and 1992 national datasets because the col- 
lege graduation status of the Texas and the 2004 national high school graduation 
cohort is unavailable. In addition, we model 6-year graduation status using the 
C&B database. This dependent variable is obtained from institutional records that 
were collected for all students who enrolled in the fall of 1989 at all of the C&B 
institutions. In the college graduation analysis we also control for standardized test 
scores and high school class rank. . 

Most studies of post-secondary attendance only consider whether or not stu- 
dents matriculated, thus ignoring the “qualitative” stratification in college destina- 
tions. For Hispanics this is important because their rising college participation 
involves disproportionate representation in two-year institutions, with their at- 
tendant implications for eventual degree attainment (Schneider et al. 2006; Swail 
et al. 2003; Fry 2004; Velez 1985). We use a multiple response category with 
four outcomes based on selectivity categories from Barron’s Profiles of American 
Colleges (1992), but also including non enrollment and two-year institutions: 
(1. no post-secondary education; (2. two-year colleges (those offering programs 
requiring two years or less to complete, including community colleges); (3. four- 
year noncompetitive colleges; and (4. four-year competitive colleges (median SAF 
above 900).'' Finally, in order to improve the comparability between the three 
national datasets, models include statistical controls for Census regions.'* These re- 
gional dummies identify respondents who live in New England, the Mid-Atlantic, 
East North Central, West North Central, South Atlantic, East South Central, 
West South Central, Mountain and Pacific states. 


Results 


Table 1 classifies white and Hispanic high school graduates by parental education 
status using data from the three national longitudinal studies as well as the Texas 
study. Hispanic parents average lower levels of post-secondary education in all four 
surveys. Nationally, the share of white students whose parents lacked college train- 
ing fell by half between 1982 and 2004, but the comparable drop for Hispanics 
was less than 25 percent. Consequently the ethnic gap in parental education of 
high school graduates widened considerably from 1982 to 2004. For both groups, 
most of the improvement in parental education status occurred during the 1980s, 
but appears to have slowed during the 1990s, a period coincident with the surge 
in low-skill immigrants from Latin America. 

The white-Hispanic ratio provides a convenient way to summarize the 
trends and differentials reported in Table 1. Nationwide the white-Hispanic 
ratio for students with parents lacking college fell over the study period, and 
by 2004, the proportion of white high school graduates with no college par- 
ents was less than half the proportion of Hispanic high school graduates with 
no college parents. The mirror image is the group share with college-educated 
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Table 1: Parental Education of 1982-2004 Hispanic and White 
High School Graduates 
No College Some College/ BA or Higher 


Parents AA Parents Parents 
1982 
White 43.5 29.8 26.8 
Hispanic 59.4 254 14.9 
Ratio, white to Hispanic As 1.16 1.80 
1992 
White 23.3 39.7 37.1 
Hispanic 44.7 37.0 18.3 
Ratio, white to Hispanic 02 1.07 2.03 
2004 (National) 
White 21.8 34.7 43.5 
Hispanic 46.3 32.3 21.4 
Ratio, white to Hispanic AT 1.07 2.03 
2002 (Texas) 
White 19.2 26.5 54.3 
Hispanic 55.7 25.4 18.9 
Ratio, white to Hispanic 34 1.04 2.87 
Notes: 


1982: High School and Beyond (HS&B), weight = fu2wt; 

4992: National Education Longitudinal Survey (NELS), weight = f3qwt; 

2004: The Education Longitudinal Study of 2002 (ELS), weight = BYSTUWT, 
2002: Texas Higher Education Opportunity Project (THEOP), weight = w2_weight. 


parents. By 2004 more than 40 percent of white high school graduates had at 
least one college-credentialed parent (up 17 percentage points since 1982), yet 
the share of Hispanic high school graduates with college-educated parents rose 
only 6 percentage points—from 15 percent in 1982 to 21 percent in 2004.'° 
The white-Hispanic ratio for students with college-educated parents rose from 
1.8 to 2.0 during the 1980s and remained unchanged through 2004. In Texas, 
the educational divide along ethnic lines is wider still. In 2002, 19 percent of 
white high school graduates had parents lacking college experience compared 
with over half of Hispanic students. Conversely, almost 54 percent of white 
students had college-educated parents, compared with only 19 percent of 
Hispanics (W-H ratio of oe 

Table 2 portrays the ethnic gap in degree attainment by parental education status 
for the 1982 and 1992 national cohorts. Consistent with secular trends, college 
attainment rates were higher for both whites and Hispanics from the class of 1992 
compared with the 1982 class. Moreover, in accord with predictions of status attain- 
ment theory, students with college-educated parents exhibit higher degree attain- 
ment than their ethnic peers with less educated parents. Yet, for all levels of parental 
education, Hispanic high school graduates from both national cohorts were less 
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Table 2: Degree Attainment of 1982 and 1992 High School Graduates by 


Parental Education 
Children's Educational Attainment 
No Attended College, 
College No Degree Certificate AA BA N 


1982 

White 

No college parents 48.2 27.0 6.4 5.54 (12.84 [3 1eee 
Some college/AA parents 27.3 O22 6.0 79 265° (2206) 
BA or higher parents 11.8 24.9 Sf 6.1 535 [2298 
Hispanic 

No college parents Soa! 33.0 oe 2.9 59 [1,074] 
Some college/AA parents 44.5 32.8 4.4 73° 11.0 = [474] 
BA or higher parents 32.1 31.0 17 8.1 2/2" ee 
White/ Hispanic ratio, BA attainment for children of BA parents 1.97 

1992 

White 

No college parents 31.84 41.68 11.6 93 1F.1 (iene 
Some college/AA parents 16.08 43.87 8.7 9.1 31.0 [2,987] 
BA or higher parents 6.1 29.18 45 6.0 58.8 [2,592] 
Hispanic 

No college parents 20.7 56.26 137 8.7 14.2 [556] 
Some college/AA parents 15.34 or? 75 99 17.5 [476] 
BA or higher parents 9.53 Sie 7.0 62> (33009 -18z4 
White/ Hispanic ratio, BA attainment for children of BA parents 1.78 

Notes: 


1982: High School and Beyond (HS&B), weight = fu2wt; 
1992: National Education Longitudinal Survey (NELS), weight = f3qwt; 


likely to earn bachelor’s degrees than similarly-situated whites. Of particular note, 
among students with college-educated parents, whites were nearly twice as likely to 

earn a baccalaureate degree as Hispanics. To further examine how inequalities in 

transitions to and from the post-secondary system contribute to the Hispanic-white 

gap in degree attainment, we estimate the likelihood of college enrollment/destina- 
tion and college graduation for both groups across different time periods. 


Inequalities in College Enrollment and College Destinations 


For each dataset we estimate the association between parental education and whether 
offspring enrolled (1. at any post-secondary institution; (2. at a two-year college; (3. 
at a four-year college or university; and (4. at a competitive institution. x includes 
a seties of dummy variables for the ethno-education groups (Hispanic no college, 
Hispanic some college, Hispanic college, white no college, and white some college) 
and regional dummies. Figures 1 and 2 plot odds ratios to portray the standing of 
ethno-education groups relative to whites whose parents graduated from college, 
represented by the crossing line set at 1.0 (Long 1997)."4 All of the odds ratios plotted 
in Figure 1 fall below the crossing line, indicating persistent ethnic disparities in the 
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Figure 1. Odds Ratio Plots for Post-Secondary Enrollment 
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likelihood of college attendance. In 1982 the odds ratios for Hispanics with college 
educated parents lag behind their white counterparts, but these youth also were 
unable to differentiate themselves from whites whose parents had only some college 
experience. Likewise Hispanic youth whose parents acquired some college were un- 
able to differentiate themselves from whites whose parents lack any college experience. 

The 1992 national stratification regime reveals a well-defined tripartite seg- 
mentation where white students with college-graduate parents are at the top of 
the hierarchy—well above Hispanics with college-educated parents—and all other 
groups are clustered below them. The ensuing dozen years witnessed a pulling 
apart of whites and Hispanics with college-educated parents. In 2004, the college 
enrollment odds for Hispanics with college-educated parents are only marginally 
higher than the enrollment odds for whites whose parents have some college. 
Results for the 2002 Texas high school cohort also reveal that Hispanics with 
college-educated parents lag well behind their white peers with comparably edu- 
cated parents. The results also indicate that black high school graduates also are 
less likely than similarly situated whites to enroll in college. 
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Figure 2. Odds Ratio Plots for Post-Secondary College Destinations 


a. Odds of Enrolling in a Two-Year College 

















1982 HS&B 1992 NELS 2004 ELS 2002 THEOP 


b. Odds of Enrolling in a Four-Year College 
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Figure 2 portrays the qualitative aspects of the stratification regime based on 
group differences in enrollment at two-year, four-year and competitive postsec- 
ondary institutions. Panel A shows a white-Hispanic two-year college enrollment 
gap for all cohorts with college-educated parents (these differences are statistically 
significant for the 1982 and 2002 cohorts). Moreover, as panels B and C reveal, 
the association of parental and offspring’s educational credentials is coupled with 
the stratification of the post-secondary system. The odds of four-year college en- 
rollment are more polarized along ethnic and class lines compared with two-year 
enrollment. For all four cohorts, students whose parents lack any college or have 
some college exposure short of a degree are considerably less likely to enroll in 
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Figure 2 continued 


c. Odds of Enrolling in a Competitive or More Competitive Four-Year College 
1 














1982 HS&B 1992 NELS 2004 ELS 2002 THEOP 


four-year colleges compared with offspring of parents with baccalaureate degrees. 
Furthermore, among students with college-educated parents, Hispanics (as well 
as blacks) are at a large disadvantage vis-a-vis whites in their access to four-year 
institutions. > Data on competitive university enrollment for the 2004 national 
cohort is not yet available, but evidence from the other three surveys indicates 
that white students with college-educated parents are much more likely to attend 
competitive post-secondary institutions than all education-specific demographic 
groups, including Hispanics with college-credentialed parents. Among students 
whose parents lack bachelor's degrees the contours of ethnic stratification are less 
sharply defined for their offspring. 

Taken together, the results suggest that the key difference between Hispanics 
and whites stems from differential access to bachelor’s degree-granting institutions, 
which prior scholarship largely attributes to family background (notably parental 
education). The odds ratios plotted in figures 1 and 2 are consistent with the basic 
tenets of status attainment theory because they show that within racial groups, 
higher levels of parental education are associated with higher levels of college en- 
rollment. However, they also challenge the assumption of a uniform association by 
ethnicity (and race). For all four cohorts, Hispanic students with college-educated 
parents are significantly /ess likely than white students whose parents are college 
graduates to enroll at any college, particularly a four-year institution. That ethnic 
inequality in educational attainment persists for more than three decades indicates 
that the Hispanic college puzzle is not a transitory phenomenon. Moreover, differ- 
ences in the distribution of parental education and Hispanic parents’ relatively low 
success in transmitting their educational resources to their offspring, particularly 
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for attendance at four-year universities, contributes to the Hispanic-white college 
disparity. To better appreciate the relative magnitude of population composition 
and unequal intergenerational transmission rates in maintaining Hispanic-white 
college enrollment disparities, we decompose the gap into its main components. 


Decomposition of the Ethnic Gaps in College Destinations 


We first assess what part of the gap can be attributed to differences in parental 
education levels and what part to differences in transmission rate. In essence we 
compare the observed matriculation rates for white and Hispanic students with 
two simulated alternatives. The first estimates Hispanic college enrollment if their 
parents’ education matched that of white parents, 
assuming no change in the intergenerational trans- 
mission of educational status. The second simulates 
Hispanic college enrollment patterns if the distribu- 
tion of parental education remained unchanged, but 
Hispanic parents converted their educational assets 
at the same rate as whites. 

The results reported in Table 3 show that observed 
ethnic enrollment gaps range from 8 to 17 percentage 
points across the cohorts compared, The first simula- 
tion reveals the relative influence of parental education 
on the post-secondary enrollment of their offspring. 
Assigning parental education of white students to 
Hispanic high school graduates boosts overall college 
enrollment rates for all cohorts, and notably so for 
the 1992 national cohort and the 2002 Texas cohort. 
Specifically, more than 70 percent of 1992 Hispanic 
high school graduates and nearly 80 percent of 2002 
Hispanic high school graduates from Texas would 
have enrolled in college if their parents’ education 
was comparable to that of their white counterparts. 
This compares with actual enrollment rates of 66 and 
67 percent, respectively. Overall, parental education 
explains a quarter to a third of the Hispanic-white 
enrollment gap in 1982 and 2004, respectively, while 
it accounts for most of the gap in 1992 and in the 
Texas data (73 and 84 percent, respectively). 

Yet, parental education contributes less to explain- 
ing qualitative gaps. For all national cohorts, ethnic 
differences in parental education explained no more 
than a quarter of the ethnic enrollment gap at four- 
year institutions and the more competitive institu- 


Hispanic-only mlogit model. 


w2_weight. 


Simulated enrollment probabilities derived from multinomial logistic odds-ratios. 


f3qut; 


2004: The Education Longitudinal Study of 2002 (ELS), weight = BYSTUWT; 


-only mlogit model to Hispanic respondents. 


fu2wt; 


1992: National Education Longitudinal Survey (NELS), weight 


2002: Texas Higher Education Opportunity Project (THEOP), weight 
bEnrollment probabilities calculated by applying white parental education distribution to 


1982: High School and Beyond (HS&B), weight 
‘Enrollment probabilities calculated by applying white 


Notes: 
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tions. In Texas, Hispanic parents’ educational assets account for about half of the 
ethnic college enrollment gap at four-year institutions and almost 40 percent of 
the average disparity in selective college enrollment. Mass migration of unskilled 
immigrants from Mexico is implicated in the results for Texas. Yet, even in Texas, 
most of the qualitative gaps in college attendance are not accounted for by parental 
educational resources, suggesting that unequal transmission rates also foment the 
Hispanic-white college gap. 

The simulation that assigns whites’ transmission rates to Hispanics reveals that 
at all levels of institutional selectivity, part of the Hispanic-white enrollment gap 
results from Hispanic parents’ lower success in transmitting their education status 
to their offspring compared with similarly credentialed white parents. Equalizing 
intergenerational transmission rates would reduce qualitative enrollment gaps by 
boosting Hispanics’ access to baccalaureate institutions. Specifically, in 1982 and 
2004 Hispanics’ simulated enrollment rate at four-year institutions is 10 percent- 
age points higher than the observed share, and 4 percentage points higher in 1992 
and in the Texas data. The simulations reveal that the ethnic gap in enrollment 
at four-year and competitive colleges and universities would be substantially re- 
duced if educational transmission rates of Hispanics and whites were similar. Thus, 
parents’ ability to transmit status advantages can reduce qualitative educational 
inequality by increasing the share of Hispanic youth that attend baccalaureate 
institutions. In sum, our temporal perspective reveals that Hispanics’ lagging post- 
secondary attendance is more than a temporary lull due to changing population 
composition, but also stems from unequal transmission rates. 

To further unpack the Hispanic college puzzle we assess how much immigra- 
tion contributed to the rising disparities by changing the educational composition 
of Hispanic parents, and whether native- and foreign-born parents are equally suc- 
cessful transmitting status advantages to their children. Using the Texas data, Table 
4 repeats the simulations reported in Table 3 by disaggregating Hispanic students 
according to their parents’ birthplace.’ The top panel reports the simulation for 
Hispanic students with U.S.-born parents and the lower panel repeats the exercise 
for Hispanics with foreign-born parents. 

In 2002 Hispanic Texas public school graduates with native-born parents were 
11.5 percentage points less likely than white students to enroll in any college, 16 
percentage points less likely to enroll in four-year colleges, and 19 percentage 
points less likely to enroll in a competitive four-year college. Consistent with 
national data, they were more likely than whites to enroll in two-year institutions. 
The college enrollment gap for Hispanic graduates with foreign-born parents was 
larger still—18.4 points overall. Moreover, their enrollment at four-year institu- 
tions was 27 percentage points below that of white Texas students. 

The simulations reveal that if U.S.-born parents of Hispanic students had the 
educational resources of whites, the Hispanic-white college enrollment would be 
virtually eliminated. This finding regarding quantitative inequality provides strong 
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support for the status attainment explanation of Hispanic-white educational in- 
equality. Yet, the status attainment explanation does not speak to the experience 
of students with foreign-born parents, nor does it explain qualitative inequality 
between the groups. If immigrant parents had the same educational resources as 
whites, they could only halve the overall enrollment gap. Moreover, parental edu- 
cation does not entirely account for qualitative differences in college destinations 
between Hispanics and whites, irrespective of birthplace. For Hispanic students 
with native-born parents, parental education accounts for 60 percent of the gap 
in attendance at four-year colleges, but only for 35 percent among their counter- 
parts with immigrant parents. The explanatory power of parental education for 
enrollment at competitive institutions is lower still, accounting for 41 and 33 
percent of the ethnic disparities among Texas high school graduates with native- or 
foreign-born parents, respectively. 

To illustrate the implications of these findings the second simulation demon- 
strates that equalizing intergenerational transmission would not close the overall 
ethnic college enrollment gap either for Hispanics with U.S.- or foreign-born 
parents, but it could considerably alter their college destinations, from two-year 
colleges to four-year institutions including the most selective ones. The magnitude 
of the gap reductions is larger for students with foreign-born parents, affirming 
that immigration depresses the ability of Hispanic-educated parents to transmit 
their assets to their children. 

Findings for Texas cannot be generalized to the United States as a whole, partly 
because the state’s post-secondary system includes a larger share of two-year in- 
stitutions compared to the national average and partly because more than half of 
Texas's college-age population is minority (Tienda 2006). Yet, they are instructive 
in affirming that both foreign- and native-born college-educated Hispanic parents 
are handicapped in their ability to transmit their educational advantages to their 
children compared with white parents. Hispanics’ intergenerational transmission 
handicap is most pronounced for access to competitive post-secondary institu- 
tions, particularly among students with immigrant parents. The simulations for 
‘Texas cement our claims that a changing population composition due to immigra- 
tion of less-educated parents and the unequal ability to confer status advantages to 
offspring are responsible for the growing Hispanic-white college enrollment gap. 


Inequalities in College Completion 


Hispanic-white disparities in post-secondary enrollment and college destination 
are the precursors to observed gaps in degree attainment. The process of differen- 
tial asset conversion may continue through college, possibly widening disparities 
in graduation rates, even at four-year schools or at the most selective institutions in 
the nation. To assess this claim we analyze the bachelor’s degree completion odds of 
students who attended the most selective institutions in 1989 based on the C&B 
study. After estimating the college graduation odds of all C&B students (Panel A 
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in Table 5), we disaggregate the sample by institutional selectivity. Specifically, we 
compare bachelor’s attainment odds at the most selective institutions in American 
higher education, where the average SAT score is 1250 or higher (Panel C), with 
the degree attainment at less selective institutions (Panel B). 

The first model of Table 5 reveals that Hispanic C8&B students, even those with 
college-educated parents, are less likely than whites to have earned a bachelor's within 
six years of enrollment. This generalization applies to the most selective institutions 
as well as those designated highly selective (models 3 and 5). Yet, controlling for test 
scores, and high school ranking, Hispanic students with college educated parents 
who attended less selective C&B institutions are no less likely to earn a bachelor’s 
than white enrollees (Model 4). Substantively, these results indicate that Hispanic- 
white gap in bachelor’s attainment is partly a function of ethnic inequalities in high 
school achievement that determines college readiness, even among students whose 
parents hold bachelor’s degrees. Yet, at the nation’s most elite colleges and universi- 
ties Hispanic students with college-educated parents are significantly /ess likely than 
their white counterparts to complete a bachelor’s degree, even after equalizing their 
academic preparation. This suggests that highly qualified Hispanic students who 
attend the most selective universities face academic and other challenges greater than 
their white peers with highly educated parents. Data do not permit further probing 
of this result, such as comparing the graduation rates of students whose parents have 
post-graduate degrees or who themselves graduated from elite institutions, but this 
is an area deserving further investigation. 


Conclusions 


This study investigates the Hispanic college puzzle, namely, that Hispanics have 
fallen further behind whites in their post-secondary enrollment and bachelor’s 
attainment levels even as their college enrollment and graduation rates reached 
an all-time high. This stylized fact is all the more important because Hispanics 
surpassed blacks as the largest national minority in 2003 and are projected to grow 
at a rapid clip through 2030 (Tienda and Mitchell 2006). Although the surge in 
low-skill immigration from Latin America is a popular explanation for the wid- 
ened Hispanic-white college gap, we demonstrate that compositional shifts are not 
the only factor responsible for the growing degree attainment disparities. Group 
differences in parental education only partially explain the Hispanic-white gap in 
college enrollment, and not evenly over time. The explanatory power of parents 
nativity status also is limited because all Hispanic parents are handicapped in their 
ability to convert their educational assets for benefit of their offspring. 

The large unexplained ethnic gap in access to four-year and the more competi- 
tive institutions indicates rising qualitative inequality in higher education even as 
quantitative gaps narrowed. ‘This, too, curbs Hispanic degree attainment because the 
type and selectivity level of institution attended shape students’ graduation chances 
(Bowen and Bok 1998; Alon and Tienda 2005). Moreover, the ethnic transmission 
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disadvantage persists well into college, as evidenced by considerable gaps in gradua- 
tion rates among students attending the nation’s most selective institutions. 

Immigration is not the sole factor driving the Hispanic-white college gap but 
this is not to say that changes in population composition are inconsequential for 
the widened Hispanic-white college gap. Children of immigrant parents face for- 
midable barriers to college access, particularly if they are first-generation college- 
goers. Our results confirm that Hispanic students with foreign-born parents face 
greater disadvantages in post-secondary destinations than their counterparts with 
native-born parents mainly because of their parents’ lower education levels. That 
children of immigrants are the fastest growing segment of the Hispanic popula- 
tion magnifies the social significance of these findings for the contours of future 
socioeconomic inequality. 

The reasons for the unequal transmission rates are a topic of another study. Ethnic 
differences in high school academic preparation and college orientation may par- 
tially explain the puzzle. For example, Schneider and her colleagues (2006) find 
that Hispanic students whose parents hold college degrees are significantly less likely 
than their white peers to take advanced math courses in high school, which is an 
important predictor of college enrollment (Schneider et al. 2006). Rising income 
inequality coupled with soaring college costs and changes in financial aid policies 
may further exacerbate the ethnic gap in college destinations and graduation rates 
(Alon 2007; The College Board 2005a; Morris and Western 1999; The College 
Board 2005b). Other circumstances, such as limited information about college and 
graduation from high schools that do not prepare them for college-level work, may 
further clarify the Hispanic college puzzle. Finally, ethnic differences in families 
social capital that are related to family structure—such as the total fertility rate, the 
percentage of births to unmarried mothers, and the percent female householder 
(Landale and Oropesa 2007)—may also partly account for the unequal transmis- 
sion rates. Our findings should propel additional research to identify other factors 
responsible for the growing Hispanic-white post-secondary gaps. 


Notes 

1. High-immigration states, like Texas, are notable for their high level of ethnic 
inequality in educational attainment (Murdock 2003). For example, in 2005, 35 
percent of adult white Texans held college degrees as opposed to less than 10 percent 
of comparably aged Hispanics (U.S. Census Bureau 2006). 

2. Students who failed to earn a high school diploma or a GED (and thus were ineligible 
to enter college) were excluded from the analysis. 

3, The model follows this general form: Oj) =exp «lB, -B,)) where m and n are types 
of college destinations. 

4. These longitudinal surveys were conducted by the National Center for Education Statistics. 

5. Because all three national studies are based on two-stage, stratified sampling designs, 
our multivariate analyses correct the downwardly biased standard errors due to the 
clustering of students in institutions. 
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6. Like the national surveys, the THEOP data are based on a school-lével sampling scheme. 
One hundred eight Texas public high schools were randomly selected based on a scheme 
that classified Texas public high schools as of 2000 based on region, urbanicity and 
school racial/ethnic composition. Eighty-two of these high schools granted permission 
to administer surveys during class time to all seniors present, and the remaining schools 
required the project to sample a small number of classes or administer the survey via 
the mail. Sample weights produce the population distribution. 


7. The C&XB analyses are adjusted to account for the clustering of observations in a small 
number of institutions. 


8. Native Americans and students who reported another race or ethnicity are excluded 
from these analyses. 


9. These results are not reported but are available from the authors. 


10. Respondents who did not provide data on their parents nativity are set to missing on 
this variable, and were excluded from relevant analyses. 


11. For the analyses based on the ELS data we only consider whether or not students 
enrolled in two-year colleges or four-year colleges because detailed data on 
postsecondary institutions are not currently available. 


12. This accounts for differences in the sampling frames of the three national datasets. 


13. Among Mexicans the rise in the share of educated parents was trivial: from 15 percent 
in 1982 to only 18 percent in 2004. Among non-Mexicans it was more substantial 
(from 15 to 29 percent). 


14. Estimates for figures 1 and 2 are reported in Appendix tables A-1 and A-2. (http:// 
www.tau.ac.il/~salon1/). 


15. A sensitivity analysis reveals that there are no differences in transmission rates between 
Mexicans and other Hispanics. 


16. We are unable to directly assess parents’ immigration status in the national data 
because these samples lack sufficient numbers of Hispanic high school graduates with 
college-educated parents who were also born abroad. 
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The Declining Relative Status of Black Women 
Workers, 1980-2002 


Raine Dozier, Western Washington University 


During the 1980s and 1990s, industrial restructuring led to a marked increase in 
wage inequality. Women, however, were not as negatively affected by declining manu- 
facturing employment because their pay was relatively low within the industry, and 
their already high representation in the service sector provided access to newly cre- 
ated opportunities. However, black and white women did not fare equally and the 
black-white wage gap more than doubled. As both black and white women increased 
their representation as professionals and managers, black women became more likely 
to earn low wages within these occupations. Black degree holders also lost ground 
as they were unable to keep pace with the remarkable gains made by white women 
degree holders. The growth in black-white wage inequality, then, was not due to black 
women’s relegation to “bad jobs.” Instead, as women increased their share of “good 
jobs,” white women disproportionately benefitted. 





The transition from a goods-producing to a service-sector economy in the United 
States has captured the interest of both sociologists and the American public 
(Atkinson 2005; Ehrenreich 2001; Moore 1989; Morris and Western 1999). The 
popular press has often blamed the erosion of men’s earnings on the decline in 
unionized, manufacturing jobs and the increase in low-wage service sector jobs, 
yet less attention has been given to the effect of deindustrialization on women’s 
wages (Farber 1997; Moore 1989). Although the transition to a service-sector 
economy has raised the specter of a nation of McDonald’s counter clerks and 
Walmart cashiers, the service sector encompasses a wide variety of jobs including 
lawyers, educators and other professionals, and employs the majority of workers 
in the United States (Bernhardt, Morris and Handcock 2001; Carnevale and 
Rose 1998; Morris and Western 1999). Thus the shift to a service economy did 
not strictly mean a “race to the bottom” for American workers; instead it heralded 
greater inequality due to the vast range of jobs in the service sector. 

Concurrent with industrial restructuring, women’s labor market position im- 
proved due to both increasing educational attainment and stronger labor force at- 
tachment, making them uniquely positioned to benefit from the shift to an “office 
economy.” (Carnevale and Rose 1998) Within this advantage, however, black and 
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white women fared differently. Between 1980 and 2002, the median black-white 
wage gap among women grew from 8 to 18 percent for women workers (see Figure 
1) while the black-white wage gap among men remained similar (Bernhardt et al. 
2001). In dollar terms, the gap in median hourly pay grew from 79 cents to $2.05! 
between 1980 and 2002; the growth in the mean wage gap was even greater. 

The rise in racial wage inequality among women is curious. While the black- 
white wage gap among men remained relatively stable, the gap among women 
grew significantly. After the remarkable strides made by black women in the 1960s 
and 1970s, one would expect the black-white wage gap to continue to shrink as 
black women made steady progress in educational attainment and occupational 
diversity. Yet even with these positive changes, the wage gap continued to grow, 
becoming two and a half times larger by 2002. Was the erosion of black women's 
relative wages primarily due to white women “moving up” to better jobs, or black 
women taking the “down escalator” to increasingly bad jobs during the post- 
industrial transition (McBrier and G. Wilson 2004)? 


The Effects of Economic Restructuring 


’ ‘The growth in earnings inequality in the United States has been broadly investigat- 
ed, generally focusing on the effects of deindustrialization on wages (Bernhardt et 
al. 2001; Card and DiNardo 2002; Couch and Daly 2002; Mishel, Bernstein and 
Schmitt 1997; Morris and Western 1999). Deindustrialization includes myriad 
features that affect wages, particularly for the less skilled, including an overall 
decline in manufacturing employment, the movement of manufacturing jobs 
from inner cities to metropolitan areas, and the replacement of manufacturing 
jobs with service sector employment (Bernhardt et al. 2001; Massey and Denton 
1992; Wilson 1990). Studies show that the shrinking manufacturing sector drove 
down the wages of less-skilled men in the 1970s and 1980s, disproportionately 
affecting black male earners (Darity and Myers 1998; Massey and Denton 1992). 
Yet deindustrialization did not affect women similarly because of their relatively 
low representation in the manufacturing industry and their low pay within the 
industry due to their predominance as operatives. Manufacturing jobs were never 

“good jobs” for women. Thus, rather than disadvantaging women workers, the 
changing industry mix resulted in median wage gains for both black women and 
white women (Bound and Dresser 1999; Newsome and Dodoo 2002). 

The effect of industrial restructuring on the wages of Americans has often been 
examined in terms of job creation — that is, are jobs becoming better or worse 
for American workers? ‘The good jobs/bad jobs debate examines changes in job 
quality resulting from the transition to a service economy, encompassing several 
aspects of earnings and occupations including the extent that job growth is in 
good or bad jobs; the effects of changes in the industry and occupation mix on 
the distribution of jobs into “good” and “bad,” and the effect of weakening wage 
setting institutions (Bernhardt et al. 2001; Mishel, Bernstein and Boushey 2003). 
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Figure 1. Wage Trends Among Women Workers, 1980-2002 
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The weakening of wage-setting institutions has been associated with the growth 
in wage inequality in the United States (Mishel, Bernstein and Schmitt 2001; 
Morris and Western 1999). However, the effect of unionization on women’s wages 
is complex because their union membership is highly associated with public sector 
and professional occupations (Newsome and Dodoo 2002). Generally, changes 
in unionization have had little effect on women’s wage inequality because of their 
lower membership, smaller decline and greater likelihood of membership in pro- 
fessional unions (Card 2001; DiNardo, Fortin and Lemieux 1996). 

The devaluation of the minimum wage has also been implicated in the growth 
in overall wage inequality in the United States (Mishel et al. 2001). Some research 
finds that the decline in the value of the minimum wage during the 1980s ex- 
plained a significant proportion of the growth in wage inequality (Dinardo et al. 
1996; Lee 1999), while others find a negligible effect on women’s wages except 
among particular subsets of women workers such as young women living in the 
South (Bound and Dresser 1999). A recent investigation questions the association 
between the declining minimum wage and wage inequality, finding a strong and 
robust correlation between the value of the minimum wage and wage inequality in 

- the top half of the wage distribution (Autor, Katz and Kearney 2008). The effect 
of the minimum wage across the wage distribution, rather than just at the lower 
end, implies a spurious relationship, perhaps representing broader “political pres- 
sures associated with changing labor market conditions.” (Autor et al. 2008:311) 

During the 1980s and 1990s, the wage distribution became more polarized with 
jobs increasingly bifurcated into “good jobs” and “bad jobs,” resulting in greater 
wage inequality (Autor, Katz and Kearney 2006; Carnevale and Rose 1998; Farber 
1997; Kalleberg, Reskin and Hudson 2000; Meisenheimer II 1998). Within this 
divergence, there is a general consensus that while there has been an increase in both 

“lousy and lovely” jobs, the shift to a service economy has created more good jobs 
than bad (Goos and Manning 2007). Thus the bifurcation of jobs coupled with 
women’s increased educational attainment and labor force attachment contributed 
to the growth in wage inequality among black and white women as those who were 
well-positioned were able to take advantage of the growth in good jobs, leading to 
strong wage gains (Carnevale and Rose 1998; Farber 1997; Kalleberg et al. 2000). 


The Demand for Skill 


One of the most common explanations for an increasingly polarized job distri- 
bution is skill-biased technological change. The theory posits that the growing 
technical demands of jobs have resulted in greater demand and higher pay for 
skilled workers and lesser demand and lower pay for unskilled workers leading to 
greater wage inequality. However, the evidence supporting this argument is weak 
(Card and DiNardo 2002; Morris and Western 1999). Contrary to skill-biased 
technological change explanations, the employment and wages of managers, other 
sales, and financial sales occupations, not technical occupations, are primarily 
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responsible for growing inequality (Mishel et al. 2001). 

In one of the better known exchanges regarding the issue, Krueger (1993) finds 
support for the skill-biased technological change argument, claiming that com- 
puter use at work leads to higher wages. DiNardo and Pischke (1996) countered, 
showing that the use of a pencil also leads to higher wages. Thus jobs in the office, 
particularly managing services and people, not technically demanding jobs, have 
become far more lucrative. Over the 1980s and 1990s, women increased their 
representation in these occupations with the proportion of women in profes- 
sional (excluding teaching) or managerial positions growing from 12 percent to 
28 percent between 1970 and 2000 (Katz, Stern and Fader 2005). 


Fducation 


The premium for skill, particularly a college degree, grew markedly during the 1980s 
and 1990s (Autor, Katz and Kearney 2008; Goldin and Katz 2007; Gottschalk 
1997; Levy and Murnane 1992; Morris and Western 1999). While the growth 
in the degree premium among men was largely due to declining wages for the 
less-educated (Bernhardt, Morris, and Handcock 1995), the growth in the degree 
premium among women was more clearly due to wage gains for degree holders. 
Although women had median wage gains across the board, degree holders had the 
greatest gains (Katz and Autor 2008). Studies find that differential college degree at- 
tainment contributed significantly to the black-white wage gap among women with 
estimated contributions ranging from 25 to 40 percent of the wage gap (Antecol and 
Bedard 2002; Blau and Beller 1992; Bound and Dresser 1999; Kim 2002). 


Cognitive Skills 


In addition to growth in the college premium, some researchers claim that a grow- 
ing return to cognitive ability increased wage inequality (Herrnstein and Murray 
1994). Few studies, however, find much of a contribution after controlling for 
other human capital characteristics (Bowles and Gintis 2002; Cawley, Heckman 
and Vytlacil 1999; Farkas 2003; Murnane 1995); instead, cognitive skills primarily 
express the likelihood of educational attainment. Relatively little work examines 
whether the return to cognitive skill has grown over time, but the evidence avail- 
able indicates a small to non-existent increase since the 1970s (Bowles and Gintis 
2002; Murnane 1995). Overall, cognitive skill explains a relatively small portion 
of the variation in earnings (2 to 3 percent) after controlling for human capital 
(Cawley et al. 1999; Kerckhoff, Raudenbush and Glennie 2001). With such low 
explanatory power, growth in the premium could not contribute significantly to 
increased wage inequality among black and white women. 


Soft Skills 


Soft skills are a collection of personality traits, work habits and communication skills 
that employers seek in potential employees. Soft skills have been variously described 
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as attitude, friendliness, communication ability, teamwork and motivation (Bowles 
and Gintis 2002; Moss and Tilly 2001). Researchers find that employers typically 
prioritize soft skills over any formalized technical skill or credential when seeking 
an employee (Bowles and Gintis 2002; Moss and Tilly 1996; Moss and Tilly 2001). 
There is some evidence that a growing proportion of jobs require soft skills, ei- 
ther because individuals must have contact with customers or they work in teams. 
For example, in the past, a segment of clerical workers were employed in typing 
pools where they specialized in one task with relatively little interaction. With 
the advent of word processing technology, an increasing proportion of clerical 
workers performed varied tasks including word processing, answering phones and 
scheduling appointments, tasksrequiring interpersonal skills. In addition, work 
reorganization in the 1980s and 1990s eliminated levels of hierarchy, requiring 
greater communication and supervisory skills among lower level workers as they 
worked in teams and made more decisions (Cappelli 1996; Cappelli et al. 1997). 
As jobs demanded greater interaction, employers sought workers who possessed 
“soft skills.” The rising demand for soft skills may have particularly disadvantaged 
black workers as employers attempting to evaluate subjective, culturally-bound 
"skills such as attitude, friendliness and motivation, relied on race as proxy for soft 
skills (Bowles and Gintis 2002; Moss and Tilly 2001). ; 


Data 


For this analysis, I use the Merged Outgoing Rotation Group, derived from the 
Current Population Survey. The CPS is a monthly household survey of 50,000- 
60,000 households conducted by the U.S. Department of Labor’s Bureau of Labor 
Statistics in order to measure labor force participation and employment. Each 
household that enters the CPS is interviewed for four months, not interviewed for 
eight months, then interviewed again for four months. Because the CPS adds new 
households every month, in any one month, one quarter of the sample is rotating 
out — either for an eight-month break or because it is the end of the 16-month sur- 
vey period. The MORG is comprised of these outgoing households. The MORG 
is optimal for investigating the black-white wage gap among women because of 
its large sample size, representative sample, reliable earnings data and consistency 
in questioning throughout the observation period. Its major shortcoming is the 
lack of a measure of work experience. In addition, MORG data do not include a 
measure of union membership until 1983 or children in the household until 1984. 

I use a sample of black and white women, ages 25-54, who worked for pay 
and were not self-employed, and compare the observation years 1980 and 2002. 
Although there are data available after 2002, the occupation and industry codes 
were changed substantially in 2003. 

For the dependent variable in linear regressions, I use the natural log of hourly 
wages deflated to 2000 dollars using the Personal Consumption Expenditures 
index. Human capital is measured using age and educational attainment. Age is 
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used as a proxy for potential experience and is a continuous variable. Education is 
derived from highest grade completed, and although this results in some overesti- 
mation of diplomas and degrees (Frazis, Ports and Stewart 1995), the effect should 
remain constant over the observation period. Because the influence of educational 
attainment is non-linear, dummy variables best capture the changing effect of 
educational attainment. Educational attainment is expressed as: less than high 
school, high school, some college and a college degree, with high school as the 
omitted category. Marital status is coded as married, never married and previously 
married (divorced, separated or widowed) with married as the omitted category. I 
also include dummy variables indicating region, rural residence, part-time work 
(less than 35 hours), public sector work and hourly work. 

Occupation is divided into nine categories: professional/ technical, managerial, 
sales, clerical, service, craftsmen, operative, labor, and farm. Industry is divided into 
11 categories: agriculture/mining/construction, manufacturing, transportation and 
communications, finance/ insurance/ real estate, wholesale and retail sales, business 
and repair services, health care, education/social services, public administration, per- 
sonal service/entertainment, and private household. I separate health care from edu- 
cation and social services because the health care industry is a significant employer 
of women. I also retain private household industry as a separate category because, in 
1980, 6 percent of black women were still employed in private households. Although 
using three-digit occupation and industry codes would yield more detailed informa- 
tion about the specific outcomes of black women and white women as a result of the 
transition to white-collar work, the purpose of this analysis is to examine the effects 
of the transition to white-collar occupations more broadly. 


Methods 
Regression 


Ordinary least squares regression is a method of linear regression that estimates 
the effect of independent variables on a dependent variable by minimizing the 
sum of the residuals squared. I examine earnings trends by estimating regressions 
separately for black women and white women in 1980 and 2002 using the fol- 


lowing model: 


InY=Xbr+e 


where InY is the natural log of observed hourly wages, X is a vector of variables 
measuring human capital and job characteristics, b is a vector of coefficients, and 


e is a random error term. 


Results 


From 1980 to 2002, women increased both their labor force participation and their 
median wage. Table 1 indicates that the labor force participation of black and white 
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Table 1: The Labor Force Participation of Women . 
White Black 
1980 2002 1980 2002 
Fulltime 45 by 4 63 
Parttime 1S 18 10 09 
Not in labor force 36 Ze 32 22 
Unemployed 03 .03 07 06 


Note: CPS Merged Outgoing Rotation Group data, weighted 


women grew similarly, especially as full-time workers. Both black women and white 
women had median wage gains with the bulk of the growth occurring from the mid- 
1990s to 2002, similar to wage growth among all U.S. workers (Mishel, Bernstein 
and Allegretto 2007; see Figure 1). The median wage of women workers grew by 
a third between 1980 and 2002 (from $9.60 per hour to $12.75 per hour) while 
the standard deviation doubled, growing from approximately $6 to $12 per hour. 
Within this broader wage distribution, black women’s relative position declined. 

Change in the distribution of women across human capital and job characteris- 
tics can be expressed as percent change (i.e., change relative to a previous time pe- 
riod) or as percentage point change (i.e., the absolute change for a population). I use 
percentage point change in this analysis because percent change does not adequately 
capture the effect of compositional change on the growth in inequality. For example, 
black women had a slightly greater percent growth in degree attainment from 1980 
to 2002 (48% vs. 46%), yet in absolute terms, black women fell behind, gaining 
10 percentage points in degree attainment to white women’s 15 points resulting in 
a wider gap in degree attainment. By 2002, a third of white women had a college 
degree while only a fifth of black women did. Although the growth in degree at- 
tainment among black women is encouraging, in order to maintain their relative 
position, their growth would have had to exceed that of white women. 

Table 2 shows the broad shifts in the distribution of black and white women 
among human capital and job characteristics over the observation period. Both 
black and white women workers experienced general occupational upgrading. The 
proportion of both black women and white women in clerical, service, and opera- 
tive occupations declined while the proportion in professional/technical, manage- 
rial, and sales occupations grew over the observation period. Industry shifts were 
less remarkable, the most notable being losses in manufacturing, growth in the 
business and repair service and health care industries, and, for black women, a 
decline in private household employment. Typically, the examination of labor 
market restructuring in the United States has focused on the effects of industrial 
shifts. Although changes in the industry mix can reflect a fundamental shift in.the 
types of jobs people perform (e.g., moving from manufacturing to health care), 
changes in occupation more clearly reflect a shift in the character of work. Women 
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Table 2: Descriptive Statistics for Selected Variables 
White Black 
1980 2002 Change? 1980 2002 Change? 
Characteristics of all Women 


Mean age 38 40 37 39 

Less than high school ay OG 35 (Sauer? 
High school AT CU PALLASL., .40 34 --.06 
Some college 18 31 3 15 oo 18 
College 18 30 eo 10 .20 10 
Characteristics of Working Women 

Part time oe a 05 5 19 .04 
Paid hourly 5 56 01 62 .66 .04 
Public sector 20 20.03 31 2o=05 
Selected Occupations 

Professional/ Technical we 29 .07 16 21 .05 
Manager/ Official .08 18 10 04 az .08 
Clerical oT 2Aee 13 29 24 = -.05 
Sales .06 10 04 02 .07 05 
Operative 10 04 ~=-.06 16 08  -.08 
Service 14 ‘11 -.03 29 24 = -.05 
Selected Industries 

Manufacturing 18 10 = -.08 A9 09 = -.10 
Transportation’ Communication .05 .05 .00 05 .08 03 
Wholesale/ Retail trade 18 dee 10 aD 03 
Finance/ Insurance/ Real estate 09 09 .00 .06 .07 01 
Business/ Repair services .06 ‘1 05 .04 .08 04 
Health care 15 18 03 19 122 03 
Education/ Social service 18 19 01 19 A8:  »=.01 
Public administration 05 .05 .00 08 .09 01 
Private household 01 01 00 06 uf) ie DD) 


Notes: CPS Merged Outgoing Rotation Group data, weighted 
Percentage point change within race from 1980 to 2002 


experienced far greater shifts in occupational distribution relative to industry 
distribution over the observation period, illustrating a dramatic change in the 
tasks women performed at work. 

Overall, black and white women experienced similar trends in compositional 
change, but white women gained more in areas associated with wage gains such 
as college degree attainment. At the same time, black women experienced greater 
change in areas that would have adverse effects on wages (e.g., growth in propor- 
tion paid hourly and decline in public sector work). Interestingly, black women 
and white women had similar losses in employment in lower-paying occupations 
with approximately 20 percentage points fewer workers in clerical, operative, and 
low-end service occupations by 2002. 

In addition to their relative losses due to compositional shifts, black women’s 
relative wage growth also suffered. Table 3 shows that although both black women 
and white women experienced wage gains among many human capital and job 
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Table 3: Median Hourly Wage for Selected Characteristics of Working Women 








White  ~——Black__ Black-White Wage 
1980$ 2002$ 1980$ 2002$ Gain Differential $ 

Median Hourly Wage 9.60 13.28 8.81 10.86 1.63 
Less than high school 7.60 7.87 6.72 7.24 -.26 
High school 9.020, 10:62 8.64 9.66 58 
Some college 10.568) 212.555, 10.08 J iA3 .94 
College 12.96 218,51) 43-630 17.64 1.43 
Job Characteristics 

Fulltime 10:23. 43:91 9.60" nat hea 1.68 
Parttime 7.68 10.34 6.51 Vds 1.45 
Salaried 11.7608 16.926. 10.670 ot M1 
Paid hourly 8.3508 10.96 7.87 9.66 82 
Private sector 9.60.09912,55 8.06 10.14 88 
Public sector 11:30.% . 15:03.4 210.56. 2 A3k2 te 
Occupation 

Professional/ Technical 13.44 18.02 1344 15.60 2.42 
Managers/ Officials ANOS Se TEAS 131209 Hos n29 
Clerical 9.60 11.59 9.60 11.59 .00 
Sales 7.68 10.86 7.30 7.85 2.63 
Operatives S024 Uh 7.87 9.66 -.14 
Service work 6.72 8.25 6.72 8.21 04 

* Industry 

Manufacturing 9.60 13.27 S.91, 10.33 1.84 
Transportation/ Communication 12.48 14.85 12.87 12.53 2.70 
Wholesale/ Retail trade 7.68 9.66 7.20 8.21 97 
Finance/ Insurance/ Realestate 10.03 14.49 960 «12.53 153 
Business/ Repair services 10.56 14.49 9.08 11.47 1.54 
Health care 10.56 14.49 8.83 10.55 221 
Education/ Social service 10.67 14 AC OS eee at 1.25 
Public administration 11,9224115.09" Ett See 144) 68 


Notes: CPS Merged Outgoing Rotation Group data, weighted; PCE deflated to 2000 dollars 


characteristics, the dollar per hour gain of white women was generally greater. 
Column 3 expresses the disparity in wage gains in dollar terms, illustrating that 
wage gains were comparable among measures with a lower median wage and 
less similar among measures with a high median wage. For example, median 
wage gains were most similar among women with a diploma or less and among 
low-end service and clerical occupations and less similar among managers and 
professionals. ‘The notable exception to this pattern was among sales occupations, 
a typically low-paying field. White women’s wage gains were $2.63 per hour more 
than black women’s over the observation period, mainly due to the stagnation of 
black women’s median wage. 

Racial inequality in wage gains led to a shift in black women’s relative ranking. 
In 1980, black women had a wage advantage within certain human capital and 
job characteristic measures, outearning white women as degree holders, manag- 
ers, and within transportation and communication industries (see Table 3). In 
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addition, black and white women earned similar wages as professionals, clerical 
workers and in public administration. By 2002, however, white women’s median 
wage superseded black women’s within all human capital and job characteristics 
except as clerical workers where black and white women had similar median 
wages. Ihe downward shift in the relative wages of black women across occupation 
and industry, coupled with their growth in jobs paid hourly, implies that as the 
distribution of jobs changed, black women were relegated to less desirable jobs 
relative to white women workers. 


Regression 


In this analysis, regression models express the effect of human capital and job 
characteristics on wages. In order to examine differential trends, I estimate models 
separately for black and white women in both 1980 and 2002. Variables that esti- 
mate potential experience are commonly used in regression models although they 
are less accurate for women because of women’s weaker labor force attachment 
(Antecol and Bedard 2002). In this case, I regard the age premium as a signal of 
changing work experience among women rather than a change in the return to 
experience or age. Because women increased their labor force participation over 
the observation period, it should follow that age would be increasingly correlated 
with work experience leading to a greater return to age. The observed change in 
the effect of age on wages lends support to this supposition. Although the age 
premium for both black and white women was relatively small, it grew over the 
observation period. In the full model, 10 years of age resulted in a wage penalty of 
2 percent for white women in 1980, growing to a premium of 6 percent in 2002. 
Among black women, the premium grew from 3 to 5 percent. Thus the wage ef- 
fects of age grew 8 percent for white women and only 2 percent for black women, 
implying that the wages of white women grew due to increasing work experience. 
Research examining observed work experience among young women during the 
1980s supports this supposition (McCrate and Leete 1994). 

As expected, wage inequality due to educational attainment for both black 
and white women grew over the observation period, while the premium for 16 
or more years of education and the penalty for less than 12 years increased (see 
Table 4). From 1980 to 2002, the wage penalty for less than 12 years of school- 
ing grew by approximately 50 percent, ending at 15 percentage points for white 
dropouts and 20 percentage points for black dropouts after controlling for other 
human capital and job characteristics. The college wage premium grew from 15 
to 32 percent for white women and 21 to 31 percent for black women from 
1980 to 2002. Thus, although both black and white degree holders experienced 
considerable growth in their college wage premium, white women gained far 
more, reversing their 1980 ranking in median wage. 

Although the declining status of black degree holders is concerning, it is impor- 
tant to note that their relative loss is primarily due to white women’s remarkable 
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clined relative to clerical work. In 1980, black women received a far greater premium 
than white women for managerial occupations (22% and 14% respectively); by 2002, 
their positions reversed with white women’s premium becoming far greater (24% vs. 
16%). Trends in sales occupations also disadvantaged black women. As a growing 
proportion of women worked in sales occupations, black and white women fared 
differently with white women’s penalty becoming a 6 percent premium relative to 
clerical workers, while black women’s penalty grew from 2 to 12 percent. In addition, 
black women’s penalty for service and operative occupations, relative to clerical work, 
grew disproportionately. By 2002, then, changing rewards to occupations resulted in 
smaller rewards for black women in good jobs, 
and larger penalties for black women in bad 
jobs relative to clerical occupations. 

The relative wage losses of black women 
indicate an increasingly racialized distribution 
of jobs. At the same time that less educated 
workers lost value in the labor market, women 
with degrees became more valued. Yet even 
among degree holders, black women lost 
ground and, by 2002, white degree holders 
had a higher median wage. Coupled with 
their stronger wage growth, white women 
also made greater gains in degree attainment 
leading them to dominate white-collar jobs. 


Differences in Industry Distribution 


As black and white women moved into profes- 
sional, managerial and sales occupations, their 
rewards were increasingly disparate. The oc- 
cupation-industry categories in Table 5 reveal 
that, to a certain degree, black women and 
white women in sales occupations worked 
in different industries. Although the great 
majority of black and white women in sales 
occupations worked in the retail/wholesale 
trade industry, 17 percent of white sales- 
women worked in finance, insurance and real 
estate industries, while only 10 percent of 
black women did. Additionally, the propor- 
tion of black women sales workers in financial, 
insurance and real estate services declined 6 
percentage points from 1980 to 2002, twice 
the decline of white women workers. 


Notes: “p< .05 *“*p <.01 ***p < .001 (two-tailed tests); Dependent variable is log hourly wage; Unstandardized OLS 


regression coefficients; Model 2 controls for part-time, paid hourly, and public sector; Occupation variables not shown 


include craftsman, farm labor, and non-farm labor. Industries not shown include agriculture, personal and recreation 
services, and private household. Analysis is restricted to black and white women, ages 25-54, working in the public and 


private sector. 
‘Reference category: Wholesale/retail trade 


Reference category: Married 
Reference category: Northeast 
°Reference category: Non-rural 
‘Reference category: High school 
‘Reference category: Clerical 
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Among managers and professionals, black and white women became more 
similar in industry distribution over the observation period. The proportion of 
women working as managers in wholesale/retail trade declined precipitously, while 
both black and white women workers moved into health care and business and 
repair industries. A greater proportion of white women moved out of wholesale/ 
retail trade creating a more similar distribution among black and white women 
managers across industries by 2002. Like managers, women professionals in- 
creased their representation in health care and business and repair services. Both 
black and white women left the field of education and social services, but black 
women’s decline was far steeper, also leading to more similar industry distributions 
among professionals by 2002. 

Often, inequality is blamed on the growth in low-paid service work; however, 
the proportion of black and white women in service occupations declined slightly 
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from 1980 to 2002 while their median hourly wage remained similar. Their in- 
dustry distribution also remained similar except for changes in the proportion of 
women working in health care. By 2002, fewer white women (26%) and more 
black women (44%) service workers were employed in the health care industry 
relative to 1980. Although almost half of black women service workers were 
employed in the health care industry, this helped rather than hurt their wages. 
The median wage for both black and white women was higher in the health care 
industry relative to service occupations as a whole.” Even among low-paid workers, 
then, industry shifts cannot explain the growth in wage inequality. 

Differential rewards to occupation, then, cannot be explained by differential 
industry distribution as industry distribution among “growth occupations” be- 
came more similar over time. Only within sales occupations was there a clearly 
differential industry distribution. Black women’s loss of representation in finance, 
insurance, and real estate and manufacturing and increased representation in 
wholesale/retail sales helps explain the growth in their wage penalty (relative to 
clerical work) in sales occupations. However, relatively few black women worked 
in sales (7%) while a third of black women worked as professionals (21%) and 
managers (12%). As a growing number of women became managers and profes- 
sionals, then, black and white women became more similar in industry distribu- 
tion, making it a weak explanation for the growth in wage inequality. 


Industrial Restructuring and Occupational Mobility 


Changing human capital, both in education and experience, and shifting rewards 
to these characteristics resulted in a reordering of women workers, with white 
women securing an overall higher position in the labor queue relative to black 
women. At the same time, the transition to a post-industrial economy resulted 
in a changing occupation and industry mix that offered different types of jobs 
to workers (Atkinson 2005; Bernhardt et al. 2001; Morris and Western 1999). 
Women were well-poised to succeed in the new “office economy,” as they were 
already disproportionately located in service sector jobs. 

The service sector contains a wide array of jobs, both “good” and “bad.” A 
good job is characterized as permanent, salaried, fulltime and with a median wage 
higher than the median for the group as a whole, while a bad job is commonly 
defined as temporary, paid hourly, parttime and with a median wage lower than 
the group median wage (Atkinson 2005; Farber 1997; Kalleberg et al. 2000; 
Meisenheimer II 1998; Piore 1970; Reskin 1991). Although part-time work is 
associated with a lower median wage (see Table 3), many women choose part-time 
work due to family obligations, thus it is excluded from my conceptualization 
of “bad jobs” in this article. Examining job characteristics in terms of lower- 
than-median and higher-than-median wages gives credence to using the above 
indicators to characterize good and bad jobs. Job characteristics with a lower-than- 
median wage include work paid hourly, low-end service occupations and work in 
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Figure 2. Change in Occupational Distribution of Workers 
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the wholesale/retail trade industry, all characteristics commonly associated with 
“bad jobs.” Full-time positions, salaried positions and professional and managerial 
occupations all have a higher-than-median wage (see Table 3) and are common 
criteria in defining “good jobs.” Even among good jobs, however, black and white 
women fared differently as evidenced by their differential pay as professionals, 
managers and salaried workers. 

The rise of the service sector resulted in a changing occupational mix where 
white-collar occupations became more highly rewarded while the wages of low- 
skill occupations such as service workers and operatives stagnated or declined 
(Carnevale and Rose 1998; Farber 1997; Queneau 2006). Figure 2 illustrates the 
change in the occupational mix for all workers, and separately for white women 
and black women. Relative to the total population, the proportion of black and 
white women working as professionals and managers grew disproportionately while 
they also disproportionately left clerical and operative occupations. White women 
had both the greatest growth in occupations with high median wages during the 
observation period and the greatest movement out of clerical occupations. Both 
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black and white women also moved out of low-wage service work while, among the 
general population, the proportion of individuals in service work increased slightly. 
The growth in the proportion of both black women and white women working in 
sales occupations was similar to the growth in the total population. 

‘The great majority of job mobility for both black and white women, then, can 
be described as aggregate occupational upgrading; that is, as a group, they moved 
to occupations with higher median wages. Generally, women moved from bad jobs 
as operatives and service workers, and neutral jobs as clerical workers, into good 
jobs as managers and professionals. The growth in sales occupations was the only 
growth in a “bad job,” that is, in an occupation with a median wage lower than the 
overall median. The median wage for white women in sales was 18 percent lower 
than their overall median wage while, for black women, the median wage was 28 
percent lower. However, far more women moved out of low-paying operative and 
service occupations than moved into low-paying sales occupations, Overall, then, 
as the proportion of good jobs grew in the occupational mix (Farber 1997), women 
workers were especially advantaged, experiencing aggregate occupational upgrading. 

Although both black and white women increased their representation as profes- 
sionals and managers, the character of jobs in these broad occupational categories 
varies widely. Professional occupations include physicians and pre-school teach- 
ers while managerial occupations include college administrators and retail store 
managers. Although both black and white women experienced greater aggregate 
occupational upgrading than the total population of workers, within occupations, 
black and white women did not fare equally. Between 1980 and 2002, the median 
wages of white women professionals, managers, and sales workers gained 34 percent, 
44 percent, and 42 percent, respectively, while black women’s median wages gained 
16 percent, 22 percent and 8 percent, respectively. As women moved into potentially 
lucrative occupations, then, white women reaped far greater benefits. 


Were Black Women Increasingly in Bad Jobs? 


Although both black and white women experienced aggregate occupational up- 
grading, some share of workers must still fill less desirable jobs. As more white 
women improved their occupational status, were black women increasingly rel- 
egated to bad jobs? Ifa bad job is defined as a job in a low-paying occupation such 
as service work, then black women did not increasingly take bad jobs. Additionally, 
other characteristics of bad jobs such as working in wholesale and retail trade were 
disproportionately held by white women across the observation years (see Table 
2). The only indication that bad jobs increased for black women was the growth 
in the proportion of workers paid hourly relative to salaried. Black women were 
far mote likely to be paid hourly in both 1980 and 2002; however, the proportion 
grew at a slightly higher rate than white women’s over the observation period. By 
2002, two-thirds of black women were paid by the hour, 10 percentage points 
greater than white women (see Table 2). 
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Generally, evidence indicates that black women were not increasingly relegated 
to bad jobs, instead white women moved into better jobs growth in inequality 
was not due to the displacement of black women from their previous good jobs; 
instead, as the occupational mix changed, white women garnered a bigger share 
of newly created good jobs resulting in a relative, not absolute, decline in black 
women's wage and job status. 


Were Wages Within Occupations Increasingly Racialized? 


The proportion of black and white women in professional, managerial and sales 
occupations grew over the observation period, but within these occupations, black 
womens relative wages fell. Figure 3 shows that, in 1980, the proportion of black 
women workers in the bottom quartile of the combined wages of women was close 
to 25 percent across the selected occupations. By 2002, however, far more than 
25 percent of black women were in the bottom quartile of women's wages with 
almost one half of sales workers residing in the bottom quartile of wages. Among 
earners in the top quartile, black women were equitably represented across the 
selected occupations in 1980, but by 2002, their position had declined.’ Black 
women were much less likely to make good wages as managers and professionals, 
and especially unlikely to make good wages as sales workers. Among “growth” oc- 
cupations, then, black women lost ground across the wage distribution, increasing 
their presence among low earners and decreasing their likelihood of earning wages 
in the top quartile of the distribution. 


Conclusion 


The growth in the black-white wage gap among women can be explained both 
by women’s inroads into the labor force and by broad labor market restructuring. 
Judging from their wage growth, as women increased their educational attain- 
ment and labor force attachment, they became more valued workers (Mishel et al. 
2003; Padavic and Reskin 2002). At the same time, industrial restructuring and 
the rise of the “office economy” coupled with a greater supply of jobs requiring 
a college degree uniquely advantaged women (Carnevale and Rose 1998; Farber 
1997; Morris and Western 1999). With better opportunities for women workers, 
one would expect aggregate occupational upgrading to equally improve the wage 
outcomes of black and white women. Yet the shift in occupational distribution 
benefited white women far more than black women, partially because more white 
women moved into professional and managerial positions where their greater 


educational attainment and growing labor force attachment helped them benefit 
from new, potentially lucrative opportunities. 


Did Discrimination Increase? 


Although educational attainment contributed to white women’s greater success in 
“ » . . . . . . 
the new “office economy,’ the majority of black-white wage inequality is unex- 


The Declining Relative Status of Black Women Workers © 1851 


Figure 3. Black Women’s Wage Distribution among Women’s Wage Quartiles 
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plained by educational difference (Antecol and Bedard 2002; Bound and Dresser 
1999; Kim 2002). The marked growth in wage inequality within occupations 
shows that as professionals, managers and sales workers, black women were in- 
creasingly in the bottom quartile of women's wages. In addition, as managers and 
sales workers, jobs dependent upon social interaction, black women became much 
less likely to earn wages in the top quartile of women’s wages. Although it may not 
be surprising that black women face inequities in the labor market, it is surprising 
that their disadvantage grew substantially after the 1970s. 

Sociologists have often described differential rewards to occupations, industries 
and human capital as discrimination; however, allocating wage inequality to dis- 
crimination is not straightforward (Blau 1984; Cancio, Evans and Maume 1996; 
Padavic and Reskin 2002). It is difficult to believe that overt discrimination (i.e., the 
unwillingness to hire a black worker) has increased since 1980. Instead, the changing 
nature of jobs may have increased discrimination against black workers as more jobs 
demanded “soft skills” (Browne 2000; Moss and Tilly 1996; Moss and Tilly 2001). 

As jobs become available, employers must engage in a hiring process employ- 
ing formal and/or informal methods to recruit and screen potential employees. 

- Generally, employers do not formally test employees’ skill levels. Instead, they 
use a variety of proxies that signal desired skills such as degrees and certifications- 
(Farkas et al. 1997). Employers also use proxies that do not directly reflect skills, 
but reflect the probability of a skill. The judgment of skill using characteristics 
not directly related to employment such as age, sex and race is termed statistical 
discrimination (Altonji and Blank 1999; Padavic and Reskin 2002; Piore 1970). 
Employers can choose not to hire black workers believing they are less likely to 
possess desired skills (Bertrand and Mullainathan 2004; Kennelly 1999; Moss 
and Tilly 2002), or choose to hire women believing they are more likely to be 
nurturing and possess “people skills.” (Skuratowicz and Hunter 2004) The greater 
demand for soft skills such as attitude, personality, appearance and communica- 
tion ability increases racial discrimination because they are highly subjective and 
culturally defined (Browne 2000; Moss and Tilly 1996, 2001). With imperfect 
information, employers are more likely to disctiminate based on ascriptive char- 
acteristics; in this case, using race as a proxy for “people skills.” (Altonji and Blank 
1999; Becker 1971; Kennelly 1999; Padavic and Reskin 2002) 

Hiring is one of a variety of factors that influences work and wages. However, 
hiring is crucial to wage outcomes both because it is the point where employers have 
the least information about workers, and because starting wages influence wage and 
career trajectories for workers (Bernhardt et al. 2001). For example, young black 
women workers accumulate less work experience in their early work years, leading 
to flatter wage trajectories. In addition, black degree holders’ lower starting wage 
coupled with a flatter wage trajectory increases wage inequality over time (Alon and 
Haberfield 2007). As job stability decreases, workers experience a greater number 
of “hires” over their work life (Bernhardt et al. 2001). On average, women workers 
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report five to six jobs from ages 23 through 36 (Bureau of Labor Studies 2002), 
leaving them vulnerable to statistical discrimination at each point of hire. 

Black Americans have suffered discrimination in the United States, both his- 
torically and across social institutions (Massey and Denton 1992; Wilson 1990). 
Although in the late 1970s, wage discrimination against black women was less 
evident, a relatively narrow wage distribution and offsetting upward influences on 
black women’s wages may have obscured discrimination. When women’s wages 
were more similar, there was little room for differentiation, but as the wage dis- 
tribution widened, racial sorting became more evident. From 1980 to 2002, the 
standard deviation in the median hourly wage of women doubled, growing from 
$6 to $12 per hour, leaving far more room for stratification based on both race 
and educational attainment. Because occupations with higher mean wages have 
greater black-white wage inequality (Huffman 2004), it follows that as women 
moved into occupations with higher mean wages, racial stratification would follow. 

In addition to a wider wage distribution, factors that upwardly influenced black 
women’s wages have weakened. Historically, black women’s stronger labor force at- 
tachment increased their standing in the labor queue, helping to equalize the wages 
of black and white women within educational levels (Blau and Beller 1992; Bound 
and Dresser 1999; Corcoran 1999). In addition, the high proportion of black 
women, especially professionals, in the public sector protected their wages from 
greater discrimination found in the private sector (Bernhardt et al. 2001; Katz et al. 
2005). During the observation period, these protective factors weakened, making 
labor market discrimination more evident. Even black women with the greatest 
skills experienced downward mobility that was less explained by human capital 
and job characteristics typically found to protect workers from job loss (McBrier 
and G. Wilson 2004). Historically, black women have faced discrimination in the 
labor market, yet during the 1960s and 1970s, white women’s weaker labor force 
attachment and limited opportunities for women workers resulted in a compressed 
wage distribution that masked inequality. As women diversified their occupational 
attainment and white women became more similar to black women in labor force 
attachment, racial discrimination became more apparent. 

Did discrimination increase, then? If discrimination is defined as employers’ 
reluctance to hire black employees based on race, then possibly. One would expect 
that as the demand for “soft skills” grew, employers were more likely to rely on race 
to signal these difficult-to-measure skills. If discrimination is defined as lower aver- 
age pay for a group that is unexplained by job characteristics or human capital, then, 
discrimination grew between 1980 and 2002 (Bound and Dresser 1999; Kim 2002). 


Future Research 


The purpose of this article was to examine the differential effects of broad restructur- 
ing on women’s wages. The finding that white women disproportionately benefited 
from the transition to a service economy calls for further investigation of the effects 
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of aggregate occupational mobility using finer occupational categories, quantified 
by median wage or occupational prestige score. With this scaled value, we could bet- 
ter understand to what extent black women's aggregate occupational mobility was 
horizontal or downward as they moved into traditionally white-collar occupations. 

In addition, a portion of the aggregate occupational upgrading experienced by 
women could indicate a change in job title rather than an actual improvement in 
occupation. For example, the vague category, “administrative and managerial occu- 
pations, not elsewhere classified” employed the greatest number of white and many 
black women in 2002, yet it is unclear whether this indicates a clerical occupation 
that has been retitled, or whether this position fundamentally differs from clerical 
occupations in authority, tasks and room for advancement (Jacobs 1992). 

And finally, the emergence of black-white wage inequality among degree holders 
is troubling because educational attainment is often the focus of policies designed 
to “leveling the playing field” for historically disadvantaged groups. Although the 
growth in inequality is primarily explained by white women’s remarkable gains, 
any decline in black degree holders’ relative position warrants further investigation 
in order to ensure equal opportunity in the labor market for black women. 


Notes 
1. PCE deflated to 2000 dollars 


2. In 2002, the median wage for black women and white women service workers in the 
health care industry was $8.69 and $9.17, respectively. Median wages are derived 
from CPS MORG data, weighted; PCE deflated to 2000 dollars. 


3. White women’s wage distribution quartiles remained similar between 1980 and 2002; 
results available from the author upon request. 
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Societal Responses to Endemic Terror: 
Evidence from Driving Behavior in Israel 


Guy Stecklov, Hebrew University 
Joshua R. Goldstein, Max Planck Institute for Demographic Research 


In this article, using data on traffic volume and fatal accident rates in Israel from 
2001 to 2004~—a period spanning much of the Second Intifada—we examine the 
population-level responses to endemic terror to uncover whether societies become 
habituated so that the response weakens following repeated attacks or whether they 
become increasingly sensitized so subsequent attacks have a greater impact. Our 
analysis, using distributed-lag time series models, supports earlier findings while 
highlighting the persistence of the response to terror attacks even several years into 
the violence. There are, however, signs that the reaction to terror has accelerated. This 
shift, which is not naturally seen as evidence for either habituation or sensitization, is 
suggestive of social learning of norms over time. 








Introduction 


There has been a dramatic increase in studies focusing on the implications of terror 
following the September 11" attacks. Far-ranging consequences have been shown 
to result from such one-time attacks—from rises in population-level measures of 
stress and anxiety (Schlenger et al. 2002; Schuster et al. 2001; Verger et al. 2004), 
to increased drinking and smoking (Vlahov et al. 2002, 2004), and deterioration 
in observable health outcomes such as cardiovascular disorders (Feng et al. 2006; 
Steinberg et al. 2004). A broader set of studies, often focusing on Israel, have also 
detailed the broader macroeconomic (Eckstein and Tsiddon 2004; Eldor and 
Melnick 2004), political (Berrebi and Klor 2006; Gould and Klor forthcoming) 
and behavioral (Stecklov and Goldstein 2004) implications of terror. Do repeated 
attacks cause societies to adapt and become habituated or do they sensitize popu- 
lations and cause them to become increasingly traumatized? Understanding the 
evolution of the social response to terror may provide insights into the ability of 
societies to deal with repeated stress, not just terrorist attacks and war, but also 
natural disasters and other large-scale crises. 

The question of how societies respond to repeated terror attacks requires both 
an understanding of how stress might be produced as well as the mechanisms 
through which this response might weaken or strengthen over time. Our ap- 
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proach builds on both micro- and macro-level explanations relating adaptations 
to stress. At the individual level, an extensive literature in psychology examines 
individual reactions and adaptations to terrorist attacks, war and disasters. This 
includes the exposure of civilians to wartime bombings (Bleich et al. 1992; U.S. 
Strategic Bombing Survey 1947), how populations are affected by terror and 
natural disasters (North and Pfefferbaum 2002; Tierney et al. 2006), as well 
as studies that analyze the reactions of soldiers to extended periods of combat 
(Friedman 2006; Hoge et al. 2006). At the macro level, communities, societies 
and state institutions may adapt, and their reaction may be critical in determin- 
ing response in the face of political violence (Collins 2004; Kimmerling 1985) 
or natural or man-made disasters (Erikson 1976; Quarantelli and Dynes 1977; 
Rodriguez et al. 2006). 

The focus of our investigation is Israel, where we examine the social response to 
repeated terrorist attacks over a four-year period during the Second Intifada—from 
Jan. 1, 2001 to Dec. 31, 2004. We employ unique data on driving behavior and 
traffic accidents within Israel over this period as well as on the timing of terrorist 
attacks. Our research seeks to determine whether past findings that demonstrated 
a link between terrorism and driving for an 18-month period continue to hold 
when the length of the study period is extended to four years. A second aim is to 
use the extended time series on terror, traffic volume and traffic fatalities to test 
whether the response to terror shifts over time. 


Measuring the Response to Terror 


Early perspectives on the social consequences of terrorism were often based on 

studies of population-level responses to disasters and wars (National Research 

Council 2003; Tierney 2007). Unfortunately, much of that evidence was hearsay 
and after-the-fact reports, often from participants, and “subject to selectivity and 

distortion.” (National Research Council 2003; Tierney et al. 2006) More recently, 
a large number of systematic studies on the societal reaction to terror have been 

carried out using better-designed surveys and based on random-sampling meth- 
ods. Many of these studies are based on surveys carried out immediately after 

the event, and generally include instruments to test for PTSD or other forms of 
stress and anxiety. These studies have been valuable in providing insight into the 

emotional stress following events such as the 9/11 attacks or terror attacks in Israel 

or Spain, and they have enabled researchers to distinguish those individual-level 

characteristics more associated with severe responses to terrorism. However, even 

in the best of cases, there is a delay of 3 to 5 days from the attack to the start of 
data collection, and in most cases the delay is longer. The delay exacerbates the 

possibility that individual responses to traumatic events will be socialized through 

friends or media sources (Tierney et al. 2006). Furthermore, the potential implica- 
tions of non-response bias in surveys are very difficult to gauge following disasters 

or terrorist attacks (North and Pfefferbaum 2002; Silver et al. 2002). 
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An alternative approach is to focus on behavioral changes that follow popula- 
tion-level traumatic events. In this respect, both data on the amount people drive 
(traffic volume) and on the quality of their driving (traffic accidents) provide 
important instruments for measuring the behavioral response to terrorism. First, 
the lack of selectivity of driving in a society, which is highly mobilized like Israel, 
widens the generalizability of the response. Another advantage is that traffic ac- 
cident and volume data are collected in routine data systems with fine precision, 
offering a method to gauge the behavior before and after the event studied. Also, 
the routine data systems in place for collection of traffic accident and traffic vol- 
ume are not systematically affected by terror attacks themselves helping to assure 
that causality is correctly identified. 

The two outcomes in our analysis—traffic volume and fatal traffic accidents—each 
offer distinct insight into the effect of terror on a different socio-behavioral process. 
Traffic volume, which measures the number of cars on the road, is in part a measure 
of discretionary travel. Changes in this indicator can capture the impact of a ter- 
rorist attack on people's inclination or desire to undertake leisure activities, or their 
response to social signals that staying home is “correct” and “safe” following an at- 
tack. The decline in leisure is consistent with evidence indicating short-term falls in 
coffee shop sales in Jerusalem immediately following attacks in Israel (Spilerman and 
Stecklov 2009). Increasing use of roadblocks following a terrorist attack, acommon 
practice, is also relatively short-term and generally limited to the broader vicinity 
of the attack (Weisburd et al. 2009). In addition, changes in traffic volume could 
be further affected by changes in work-related traffic if substitutions occur between 
private automobile driving and alternative forms of transportation such as busses, 
which may face higher perceived risks from terror—particularly following attacks 
that specifically targeted busses (Becker and Rubinstein 2010). 

The traffic accident data measure the quality of driving for those who have 
made the decision to be on the road. In this case, the mechanism of influence on 
accidents is more likely associated with micro-behavioral change associated with 
stress and anxiety following terrorist attacks. ‘There is a well-documented positive 
association between psycho-social stress and traffic accidents (Norris et al. 2000; 
Selzer et al. 1968). This link is believed to be due to the association between 
driving behavior and aggression, stress and frustration (Norris et al. 2000; Shinar 
1998: Underwood et al. 1999). Longitudinal studies on war veterans provide fur- 
ther evidence of this link between stress and driving behavior. They show that war 
veterans, experiencing greater levels of stress following war-time experiences, also 
face higher risks of traffic mortality (Macfarlane et al. 2000; Writer et al. 1996). 

The link between stress and fatal traffic accidents has also been discussed in the 
literature on imitative suicides. A series of seminal studies has shown that suicides, 
traffic fatalities and airplane accidents often increase following well-publicized 
suicides and murders (Bollen and Phillips 1982; Phillips 1977, 1978, 1979), sug- 


gesting that some portion of traffic fatalities are in fact disguised suicides (Phillips 
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1977; Ohberg 1997). Thus, terrorist attacks, particularly given the widespread and 
graphic television attention they receive, may lead to more covert suicides in the 
form of traffic fatalities (Stecklov and Goldstein 2004). 

The current study builds directly on prior research showing the response of driv- 
ing behavior in the wake of terrorist attacks (Stecklov and Goldstein 2004). In the 
United States, following 9/11, people shifted modes of long distance travel away from 
airplanes, and this choice has been connected to an increase in total fatalities associ- 
ated with travel (Blalock et al. 2008). In Israel, relatively small distances mean that 
domestic travel is almost entirely by surface transportation. Prior research in Israel has 
shown that the amount people drive is reduced in the short term following terrorist 
attacks, but this effect is delayed by a day or two following the attack (Stecklov and 
Goldstein 2004). Also, fatal traffic accident rates appear to rise following terrorist 
attacks, although this substantial and significant increase is only observed after a 
three-day lag and then quickly dissipates (Stecklov and Goldstein 2004). 


Habituation or Sensitization to Endemic Terror? 


Several studies based on single-event terrorist attacks have shown that effects weaken 
~ over time (Silver et al. 2002; Vlahov et al. 2004). Yet, when a society such as Israel or. 
Iraq faces a form of trauma that is repeated many times over an extended period of 
time, another fundamental question arises. How does the population-level response 
to terror evolve through the repeated experience of this same form of traumatic event? 

We consider two alternative perspectives for understanding how the effect of 
terrorism may shift over time—assuming an effect is identified. One is that the 
population’s reaction weakens after each attack as individuals become habituated. 
The other is that sensitivity to terrorism increases as the result of repeated assaults. 
Understanding how this process evolves offers a unique lens into individual and 
social processes of adaptation. 

At the micro level, the process of habituation is well understood in both human 
and non-human organisms, although there is debate about the underlying mecha- 
nisms that drive the process. Both psychological and physiological research on ha- 
bituation demonstrate how the response to a repeated stimuli may be modified by 
adaptive learning (del Rosal et al. 2006; Thompson and Spencer 1966; Thompson 
1986). Furthermore, empirical evidence suggests that individuals tend to over- 
react to threats that are rare and unfamiliar such as the risk of flying (Blalock et 
al. 2008; Sivak and Flannagan 2003; Sunstein 2003). Increased familiarity with 
the “risk” of terror may be instrumental in reducing the reaction to terrorism. 
Thus, terror at the early stages will invoke stronger fears and responses within the 
population, but less fear will be generated by later attacks as the threat and risk 
become increasingly familiar. 

Habituation may also occur simultaneously at the broader group or societal lev- 
el. Within emergent norm theory, collective behavior is altered as crises force the 
reevaluation of what is legitimate conduct (Turner and Killian 1987). Furthermore, 
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an increasing consensus of what the social normative response to terrorism may be 
can cause habituation as individuals receive “psychological gain” from a narrowing 
set of behavioral choices (Berger and Luckmann 1966). Individuals can then adapt 
their leisure responses to fall within the range of legitimate behavior —whether this 
behavior involves dating, coffee breaks or other leisure activities. 

Habituation is also facilitated by the complex institutional structures that con- 
nect individuals. Because terrorism targets entire societies, societal institutions such 
as family, kin, community and state may play an increasingly active and effective 
role and contribute to a shift in the response to terror over time. This may be a direct 
result of families learning to better identify and support more sensitive family mem- 
bers (Bleich et al. 2003; Kaitz et al. 2008; Shalev et al. 2006). New organizational 
and institutional forms may also emerge to provide necessary assistance and enable 
individuals to cope better with the ongoing threat (Kimmerling 1985; Stadler et 
al. 2005). Becker and Rubinstein (2010) use natural variation in media exposure 
in Israel—a product of the non-publication of Saturday newspapers—to show that 
media coverage plays an important role in the social response to terrorist attacks 
(Becker and Rubinstein 2010). Then, coping may be further facilitated over time if 
media attention declines during endemic terror—a pattern which appears to have 
occurred in Israel (Frosh and Wolfsfeld 2007; Liebes and Kampf 2007). 

In contrast to habituation, the reaction to terrorism may strengthen rather than 
dissipate as attacks are repeated, leading to increased sensitivity over time. Heightened 
sensitivity, like habituation, is a form of response to repeated stimuli (Kandel 1976; 
Thompson 1986). After a particular fear is triggered in a person, a subsequent reoc- 
currence may stimulate a stronger response because of the development of “hyper- 
excitable fear circuits.”(Rosen and Schulkin 1998) Thus, early terrorist attacks may 
cause a relatively weak reaction, but hard-wire an individual so that later attacks 
generate a stronger response. Furthermore, as casualty counts rise, increasing num- 
bers of people become linked through family or friendship to dead or injured victims, 
potentially contributing to an increase in response to terrorist attacks. 

At the broader macro level there is little reason to expect institutional mecha- 
nisms to strengthen responses over time. In this sense, the impact of terrorism is 
quite different than that of war, given that most institutions continue to function 
in some capacity. In war, governments may collapse, providing little or no support 
as occurred in late 20" century Liberia (Levitt 2005). In the case of Israel, however, 
Kimmerling (1985) argues that past wars prepared Israel’s central institutions, for 
the most part, to deal effectively with conflict. 

Ultimately, whether habituation or sensitization dominates, or whether they 
cancel out, the result is informative for understanding the connection between 
social stress and behavior. On the one hand, micro-level theory provides no clear 
prediction while sociological reasoning is less ambiguous and would tend to favor 
habituation. Of course, both the micro- and macro-level processes must work 


jointly further complicating any prediction. 
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Data Sources and Methods of Analysis 
Data Sources 


Our analysis is based on a combination of continuous, routinely collected data 
on automobile traffic volume levels and fatal traffic accidents, which enable us to 
gauge the population-level reaction to terrorist attacks and shifts in this reaction 
over time. Equally important, because such a large proportion of the population 
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drives, our indicator has excellent population level coverage. Traffic accident data 
provide our second indicator for the population reaction to terrorism. We ex- 
cluded data on non-fatal accidents for reliability concerns (Elvik and Mysen 1999; 
Hauer and Hakkert 1988), and considered only data from fatal traffic accidents. 

Traffic volume data provide exceptionally precise feedback into the response 
of the population to terrorism. Do attacks cause people to be less mobile, less 
outgoing and to reduce their activities in order to lessen their exposure? Fatal 
traffic accident data, in combination with traffic volume data, produce 
traffic accident rate measures, which may uniquely identify the extent 
to which drivers are stressed and aggressive. Both sources of data are 
routine, continuous systems which alleviates the possibility of psycho- 
social response biases produced by surveys following violent events 
(Dijkema et al. 2005; Norris et al. 2002). The pseudo nature of the 
fatal accident rate we use is due to the fact that fatal accidents occur 
throughout the country, while the exposure indicator is based on a 
single freeway in central Israel. However, it should be noted that the 
Ayalon freeway in Israel is Israel’s most travelled freeway and bisects its 
largest city, Tel Aviv (Bar-Gera 2007). 

Our earlier work has identified some potentially important pop- 
ulation-level reactions to terrorism using these data (Stecklov and 
Goldstein 2004). Large and significant declines in traffic following ter- 
rorist attacks have been identified using data from the first 18 months 
of the Second Intifada (Jan. 1, 2001 to June 22, 2002). These declines 
are centered on the third day after the attacks. Data on traffic accident 
rates, calculated from the number of fatal traffic accidents nationwide 
on day ¢, corrected for exposure levels based on traffic volume data 
for the Ayalon on day t, showed a short-lived decline in light accident 
rates and no obvious pattern in serious accident rates following terrorist 
attacks. However, both types of accident data are suspect because of 
potential reporting biases (O’Day 1993). In fact, fatal accident data, 
which suffer a much smaller degree of inaccuracy (Elvik and Mysen 
1999), showed that the fatal accident rate rose dramatically on the 
third day following attacks. This large and significant rise of almost 35 
percent (p = .01) shows an even greater effect of 69 percent (p = .02) 
when only large terrorist attacks are included. 

Data on traffic volume is routinely collected using rubber gauges 
placed at various points along the freeway. The gauges provide a con- 
tinuous relay of traffic flow data, collected in a central database and 
aggregated into counts for each five-minute interval. For our purposes, 
traffic volume is measured as the number of vehicles passing each of 
four points along this freeway. The values from the four locations are 
averaged to create a summary measure of average traffic flow for each 


Note: Daily Traffic Volume Levels from Ayalon Freeway, Fatal Traffic Accidents from Israel Central Bureau of Statistics and 


Terrorist Attack Fatalities for Israel, Jan. 1, 2001 to Dec. 31, 2004 (with gaps). 
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five-minute interval, then the data are further aggregated into peak-hour and non- 
peak-hour averages. Further analysis showed little substantive difference between 

the peak and non-peak patterns in our data, leading us to combine the traffic 

volume into total daily averages (see Figure 1). The traffic volume data are also 

used to proxy for nation-wide driving exposure, enabling us to construct pseudo 

fatal traffic accident rates for the country as a whole (see Figure 1). 

Data on terrorist attacks are obtained from the International Policy Institute 
for Counter-Terrorism at the Interdisciplinary Center of Herziliya, Israel and 
Bitselem, a human rights group. We only include attacks and traffic information 
from within the 1967 borders of Israel, given the substantial differences in popula- 
tion composition and infrastructure separating Israel proper and the settlements, 
so West Bank, Gaza and East Jerusalem are excluded. During the period covered 
by our analysis, 91 terrorist attacks are reported with at least one civilian fatality. 
These attacks produced a total of 505 victims for an average of 5.5 fatalities per 
fatal attack. In this analysis we distinguish large attacks as those with 10 or more 
fatalities, of which there were a total of 19. 

We include a number of controls to capture the impact of various factors that 

* might otherwise cause us to mistakenly determine a causal effect of terrorism. For 
example, if traffic volume tends to be higher on Sundays and attacks are more” 
common on Sundays, then we might conclude an association between terrorism 
and traffic volume even though this connection is spurious. The controls we 
include are year, month of the year and day of the week for our daily observations 
of traffic accident volume as well as important holidays. 

The current analysis extends the earlier study by supplementing the original 
data from the period Jan. 1, 2001 to Jun. 22, 2002 (Period 1) with new data to 
extend the analysis from June 23, 2002 to Dec. 31, 2004 (Period 2). However, 
several gaps exist during which traffic volume data were not collected (see top 
panel of Figure 1). These gaps are due to computer problems and not associated 
with terror.” In our final analysis, only days with a full 24 hours of data were used 
to avoid introducing additional error into the analysis. Our final data include 
1,306 days over the four-year period. The daily average count of volume over this 
period was 105,729 vehicles per day (sd = 15,342). The most visible outliers in the 
top panel of Figure 1 are for Yom Kipper—the holiest day in the Jewish calendar 
and a day in which traffic volume is nearly six standard deviations below the mean. 
With respect to fatal accidents, there is an average of 1.2 per day (sd = 1.1). 


Method of Analysis 


Our statistical analysis is based on the use of regression methods to assess how 
trafic volume and fatal traffic accident rates are affected by terrorist attacks on 
the same day and any of the five subsequent days. We use a simple distributed-lag 
formulation whereby the effect of an event on each of the lagged days is separately 
estimated in the same model. While multiple lags may introduce high degrees of 
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collinearity, preliminary analyses indicate low correlations between lagged days.’ 
Prior analysis has already shown that five lagged days following the day of the 
attack is sufficient to capture the short-term dynamics following terrorist attacks 
(Stecklov and Goldstein 2004). Also, by keeping the length of the lags limited to 
five days following attacks, we reduce complications that result from overlapping 
intervals with multiple attacks. 

Our analysis is based on both ordinary least squares regression and Poisson re- 
gression. We use linear regression for the logged values of the volume data primarily 
because we expect traffic volume effects to be in proportional rather than absolute 
terms. The logarithmic transformation also helps to reduce the influence of outli- 
ers in the traffic data. The count nature of the fatal accident data along with their 
distribution suggests we use a count-data model. A variety of statistical models are 
available in such cases enabling us to overcome the distributional properties that 
may make linear regression unattractive (Long and Freese 2000). A central assump- 
tion of the Poisson model is that the mean and variance are equal. Failure of this 
condition is common and may lead researchers to alternatives such as the negative 
binomial model (Donner 2007). In our sample, the mean and variance both equal 
1.2, providing no indication of overdispersion and supporting the Poisson model. 

Our baseline models for estimating the effect of terror on traffic volume on day 
Eels shown in Equation | and is, 


5 
log(V,) =a +6°X,+) y, ‘terror, (1) 
k=0 


Here, the vector X contains dummy variables for year, month, day of week, and 
if a day is a holiday. The corresponding vector gives the coefficients, which we 
show in exponentiated form, that capture the impact of a specific category relative 
to the reference category. The w term is the intercept. Within the summation, the 
terror dummy variable takes the value “1” if there has been a fatal terror attack 
on a particular day rand “Q” otherwise. Thus, the 7, coefficients each capture the 
impact of a terror attack on a particular day, t-k, on the value of V.. 

A related model is estimated for traffic fatality data, but using Poisson regres- 
sion, so the model takes the form, 


5 

log(F.) = log(V,)+4 +8 -X, +) 7, ‘terror, (2) 
k=0 

Here, the number of fatal accidents on day t, F , is estimated while controlling 
for exposure. In this case, the additional log(V.) term acts as an “offset,” where 
the parameter estimate of the offset is constrained to equal 1 (McCullough and 
Nelder 1989), so that the volume of traflic is taken into account when predict- 
ing traffic fatalities. We include in Equation 2 the same set of control variables 
and the same set of lagged dummies as in Equation 1. In both the volume and 
accident regressions, our results are presented in exponential form so that a null 
effect of a factor is given by a value near “1.” Our results are thus interpreted in 
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terms of the proportional change in traffic volume or the fatal traffic accident 
rate following an attack occurring & days earlier. 

In addition to a set of analyses that focus on a main effect of terrorism, our 
attention is concentrated on two approaches for testing for habituation or sen- 
sitization in the behavioral response. The first, the period interaction approach, 
tests to see whether the relationship between fatal terrorist attacks and traffic 
changes over time. The second, the memory effect approach, evaluates whether 
the cumulative number of fatal attacks in past months influences the impact of 
attacks in recent days. 

To test for a change in the effect in the effect of the lags over time we include 
a simple interaction term between the lagged terms and a dummy indicator for 
Period 2. In equations 3 and 4, the variable Z takes the value (0) for Period 1 
and Z takes the value (1) for Period 2. While the two periods are not split in the 
middle, they are also reasonably balanced.* The formal model for traffic volume, 
V, including the same controls as before as well as a main effect for the Period 
indicator, Ass 


5 5 
log), =a; H6 Xet AZ Noy, terror 4 A Oe error, (3) 

k=0 k=0 e 
The model in the case of traffic fatalities is, 


5 3 
log(F) = log(V,)+a +6 -X,+1°-Z + ¥ y, ‘terror_,+ Ss Oy 2, “HeTTO (4) 
k=0 k=0 

Here, the coefficients, 0, in the period interaction approach in equations 3 and 
4 provide a test of whether the effect of terror shifts over time. 

We also test for habituation or sensitization using an alternative approach. This 
alternative focuses more on a direct form of adaptation associated with the level 
of past violence. It involves testing whether the effect of terrorism in the past few 
days is affected by the level of terror in the previous month(s). We explore previous 
month, two months earlier, three-months earlier, and four to six months earlier, 
The appropriate equations are, 


3 t-m 
log(V,) =a +B °X,+) y, ‘terror_, +0 ss terror, (5) 
k=0 


j=t-n 


Similarly, we test the Poisson model for fatalities using, 

5 t—m 
log(F) = log(V,)+a +8-X, + yy, ‘terror_, +8 y) terror, (6) 
k=0 


j=t-n 


Our approach is based on testing for a moderating variable(s) based on the 
aggregate number of terrorist attacks for a given period of time from t — n to 
t—m days prior. Thus, the new term in the summation reflects the aggregation of 
the number of fatal attacks from t — n days ago, where the smallest value of 7 is 6 
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and the largest is 90, to m days ago, where the values range from 30 to 180 days 
depending on the specification. Thus, we test a series of intervals including 6-30 
days, 30-60 days, 60-90 days and 90-180 days. Each non-overlapping interval 
produces a separate model. For each model, however, only a single value of @ is 
produced. We also show the results of a model including all four intervals jointly. 
While the models in equations 5 and 6 are interesting, the statistical test for the 
moderator effect actually requires one additional step. When the main effect of 
terrorism in recent days is significant, we test a series of interactions between 
the lagged effects and the number of attacks over one or more months prior to 
determine whether the effect of a recent attack might be moderated by a previous 
attack. These alternative specifications are not shown but involve inclusion of an 
additional interaction between terror and the aggregate measures of past terror 
into the models shown in equations 5 and 6. 

Our analyses generate a large number of coefficients given that the distributed- 
lag model produces a unique coefficient estimate, y (or d where appropriate), for 
each lagged day t. We take into account the comparisons created by the multiple- 
lag coefficients in each model by using the Bonferroni method to multiply the tra- 
ditional P-values on the lagged effects (Abdi 2007; Bland and Altman 1995). We 
adopt the Bonferroni corrected significance level of a* = a/6 = .0167 to create a 90 
percent confidence region with a significance level of a = .10. The null hypothesis 
is that the true effect at all lags is zero, with the alternative being two-sided. Both 
the OLS and Poisson regression models have coefficients that are asymptotically 
normal, and so the confidence intervals use +/— 2.39 standard errors, which covers 
the desired 1 — a* = .9833 interval. The adjusted confidence intervals produced 
by this correction increase our confidence in the significant effects we observe. 


Results 
Traffic Volume 


Our baseline model for daily traffic volume for the entire four-year period is shown 
in the first column of Table 1. This baseline model is similar to equations 1 and 2 ex- 
cept the terror lag dummies and coefficients in the summation term are not included 
in the model. The reference category for all our models is the year 2001, month of 
January, and day-of-week Saturday. The results of the baseline model illustrate the 
powerful role of the covariates on the logged value of daily traffic volume. Trafhic 
volume increases from year to year over the four-year period, tends to be higher 
during the summer, and is unsurprisingly lowest on Saturdays and holidays when 
observant Jews avoid driving and many stores and restaurants are closed. 

Changes in the effect of attacks on traffic volume gauge the disruptive effect of 
terror attacks on routine activities and also shed light on the timing of responses —a 
reflection on how people reorganize their activities to reduce their vulnerability. 
Over the four-year period we find, as seen in Figure 2 and Table 2, that terrorist 
attacks reduce traffic volume on the day of the attack and the subsequent three 
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Table 1: Baseline Proportional Effects for OLS Volume Models aid Accident 


Fatality Rate Poisson Models 
Traffic Volume Fatal Accident 


Regression Rate Regression 
.966 


2002 .996 
(.718) (.602) 
2003 1.022 9 
(.039) (.128) 
2004 1.039 841 
(.001) (.019) 
February .998 1.094 
(.908) (.532) 
March 1.02 1.013 
(.297) (.923) 
April 1.005 1.069 
(.781) (.626) 
May 1.042 .937 
(.034) (.642) 
June 1.034 1.386 
(.080) (.011) 
July 1.046 ied 
(.018) (.363) 
August 1.041 1331 
(.036) (.025) 
September .897 1.308 
(.000) (.042) 
October .99 1.238 
(.614) (.105) 
November 1.007 .996 
(.725) (.979) 
December 1.024 1.304 
(.249) (.055) 
Holiday 702 1.643 
(.000) (.004) 
Sunday 1.423 a0 
(.000) (.001) 
Monday 1.401 .68 
(.000) (.000) 
Tuesday 1.417 tit 
(.000) (.000) 
Wednesday 1.406 116 
(.000) (.001) 
Thursday 1.438 18 
(.000) (.008) 
Friday 1.207 893 
.000 238 
Observations 1,306 1,306 
R-squared O12 
Chi-squared 62.234 


Notes: Significant coefficients and p-values estimates (in parentheses) are in bold. 
Reference categories are 2001 for year, January for month, and Saturday for Day-of-Week. 
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days. However, only the coefficient on lag 2 is significant. Specifically, we find that 
terrorist attacks are associated with a roughly 4-5 percent decline in traffic volume 
on the second day following an attack. This effect, which is highly significant (p < 
.01), also remains significant after we apply the Bonferroni multiple comparison 
adjustment to the confidence intervals (see figures 2 and 3). 

Additional evidence that traffic volume declines we see are indeed causal can 
be obtained by checking for a “dose-response” effect, with larger attacks causing 
greater declines in traffic volume. This effect is observed in Period 1, when attacks 
with 10 or more fatalities produce larger declines in traffic volume (Stecklov and 
Goldstein 2004). Here, looking at the entire four-year period, we see that the 
dose-response effect is also evident for lag 2 (top right panel of Figure 2) where the 
impact of large attacks produces a larger, and significant, decline in traffic volume. 

Looking at each period separately and focusing on the test of habituation by 
examining the shift over time, we note a decline of roughly 3 percent on day 3 
following attacks in Period 1 (the lag-4 coefficient is significant at traditional levels 
but not at the Bonferroni adjusted levels). In contrast, during Period 2, the impact 
appears more immediately after the attacks and is seen from lags 0-2, with a peak 
of 9 percent on day 2 following an attack; the coefficients that are significant using 
standard confidence intervals are also significant after the Bonferroni adjustments 
(see Figure 2). Statistically, while the lag 1 and lag 2 coefficients differ across periods 
and are significant at traditional levels they are not significantly different from Period 
1 after the Bonferroni adjustments. Thus, one cannot reject the hypothesis that the 
terror effects on daily traffic volume on the same day or one day after an attack that 
are observed in Period 2 do not differ from those seen in Period 1. However, this 
same test is significant on day 2 after an attack. Furthermore, this shift in the effect 
of terror on traffic volume on lags 0-2 is jointly highly significant (p = .001). 

The alternative approach (Equation 5) to look at adaptation focuses on whether 
the effects of terror in the days immediate following the attack are affected by the 
level of terror in prior months. The results of a test for a direct additive effect of 
past terror in addition to the effect of more recent terror are shown in Table 3. 
Results are presented for a model which includes only fatal terrorist attacks 6 to 
30 days prior (i.e., short memory effects) while subsequent model also includes 
attacks two, three or six months earlier (i.c., long memory effects). 

Results provide evidence that terrorist attacks in the six days before an attack 
also have an impact on traffic. Interestingly, the direct effect of terrorist attacks 
in previous months, where significant, is in the opposite direction. The days 
6-30 measure comes out positive and significant (p < .05) for both all and large 
terrorist attack models indicating that past attacks, controlling for the negative 
effect of recent terrorism, are associated with higher trafic volume levels. We 
also introduce measures for the total number of fatal attacks two, three and four 
through six months earlier. The effect of the variables “Terror Days 6-30” remains 
positive and significant as is the effect of the variable “Terror Days 60-90.” The 
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Figure 2. Proportional OLS Estimates of Effects of Terror Attacks on Daily Traffic 
Volume for the Days Following Attacks 
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Notes: Data for Period 1, Period 2 and Periods 1 and 2, and for all attacks and large 
fatal attacks. We define “large” attacks as those in which 10 or more persons were killed, 
Effects that are not shaded are insignificant at the 5% level using traditional confidence 
intervals. Effects that are shaded are significant at the 5% level, without taking into 
account multiple comparisons. Effects outside of the dashed lines are significant at the 
40% level, after accounting for six multiple comparisons, one for each lag. The * marks 
effects that differ significantly from Period 1 to Period 2 at the 10% level, after accounting 
for multiple comparisons. 
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Figure 3. Proportional Poisson Estimates of Effects of Terror Attacks on Daily Fatal 
Accident Rates for the Days Following Attacks 
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Notes: Data for Period 1, Period 2 and Periods 1 and 2, and for all attacks and large 
fatal attacks. We define “large” attacks as those in which 10 or more persons were killed. 
Effects that are not shaded are insignificant at the 5% level using traditional confidence 
intervals. Effects that are shaded are significant at the 5% level, without taking into 
account multiple comparisons. Effects outside of the dashed lines are significant at the 
10% level, after accounting for six multiple comparisons, one for each lag. The * marks 


effects that differ significantly from period 1 to period 2 at the 10% level, after es 
for multiple comparisons. 
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other measures for terror in the past are not significant. The additive effect of past 
terror appears to depend on terror up to three months back and terror from earlier 
months generates no additional impact on trafic volume. 

Are the effects we identified for terrorism over the past six days robust to the 
inclusion of controls for terrorism in previous months? The alternative is that 
the impact of terrorist attacks over the past few days depends on the level of ter- 
rorism in prior months. ‘The results in Table 3 show that our main coefficient is 
unchanged (compare to Table 2) offering no evidence that the effect of attacks in 
the present is mediated by attacks in previous months. Furthermore, an interac- 
tion between the number of fatal attacks over the past month with the second-day 
lag effect is insignificant (p = .88)— furthering the claim of no shift in the impact 
of recent terror depending the scale of past terror. 

Results of both tests suggest that the impact of terrorism on the amount people 
drive remains relatively stable as terror continues and there is little evidence of either 
habituation or sensitization. At the same time, results from the period interaction ap- 
proach provide some evidence that the response accelerates in Period 2. This accelera- 
tion is indicated by a more rapid reaction as well as a speedier end of the effect. One 
possible explanation for such a shift in timing is that simple habituation or sensitiza- 
tion responses are insufficiently detailed to account for the complex dynamics when 
the response to terrorism is /earned over time. In a manner reminiscent of Sherif’s 
(1936) classic experiment using the autokinetic phenomenon, norms are learned 
through repeated exposure to the same event. The establishment of norms may in 
itself accelerate the response as well as facilitate the recovery. ‘Thus, our results might 
indicate that a particular form of habituation is occurring, but it is not expressed in 
terms of attenuation but rather by an acceleration of the response and its dissipation. 


Accidents 


Despite a monotonic decline in the fatal accident rate from year to year, few effects 
are actually significant except for the coefficient on 2004 (relative to 2001), which 
is individually significant, though not after the Bonferroni adjustment. The year 
coefficients are also not jointly significant (p = .09). The monthly pattern shows 
that fatal accident rates tend to be higher in the summer and are about 64 percent 
greater during holidays. Relative to Saturdays, all other days have significantly 
lower rates except for Fridays. 

Looking at both periods in Table 4, the day 3 increase in fatal accidents (Stecklov 
and Goldstein 2004) identified solely during Period 1 continues to be significant 
(p = .01). However, change over time is more difficult to detect. The average effect 
over the two time periods studied is a 25 percent increase in the traffic fatality rate 
on the third day after an attack (Table 4). This average is a combination of a 35 
percent (p = .01) increase in period one and an insignificant 11 percent increase 
in period two. While this might be indicative of habituation—it is too imprecise 
to reject the hypothesis of no decline in the day-three effect. 
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ument could be made that this pattern offers evidence of habituation 


An arg 


given the large decline in the coefficient value between perio 
the Period 2 estimate is not significantly different from zero. 


and that 


On the other hand, 


ds 1 and 2, 
tests for a change of this day-3 effect over time are inconclusive. The interaction 


.25), making it difficult to claim attenu- 


test for a change is not significant (p 


3 lag, which is still positive and 


ation. Also, the Period 2 coefficient on the day- 


indicating an 11 


> 


substantively significant—though not statistically—equals 1.11 


percent increase in the fatal accident rate. 


Focusing exclusively on large attacks lends greater credence to claims of persis- 
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the fatal traffic accident rate is replaced by a Period 2 day-4 effect of an 84 percent 
increase. The former loses significance after the Bonferroni adjustment. While the 
temporal shift might be interpreted as real, a more skeptical interpretation might 
focus on random variation as the apparent cause. It is probably more reasonable 
to interpret this as a continuation of a powerful lagged effect rather than any real 
shift in the underlying behavioral response. Ten large attacks in each of the periods 
probably does not provide sufficient power to capture such shifts with accuracy. 
The day-3 or day-4 effects in each of the two periods are most likely identifying 
the same process, which appears to have lost little power over time. Finally, another 
sign of persistence is the manner in which the coefficient for small attacks on the 
third-day lag remains nearly unchanged from 1.255 to 1.247 (not shown). 

An additional set of models provides further support for claiming consistency 
over time. First, in Table 3 we repeat the earlier memory test where measures for 
the total number of fatal terrorist attacks in previous months are introduced into 
the regression. As with the volume results, the effects of terror in the immediate 
past is unchanged when the potential mediating variables are introduced. ‘This is 
apparent when looking at all attacks as well as large attacks, although the latter 
result is less relevant given that the main effect of large attacks is not significant 
for the entire period. Furthermore, an interaction between the total number of 
terrorist attacks from 60-90 days prior with terrorist attacks three days earlier—the 
test of a moderator effect of past terror—is not significant. 


Discussion 


There is intense interest in understanding how populations react to traumatic events, 
including natural disasters as well as man-made disasters and violence. Terrorism 
has emerged in the 21" century as a central form of inter-societal violence with 
consequences across a wide range of personal and social activities. In this article, 
we examine a fundamental question that cannot be examined in the context of 
most other forms of societal trauma: does repeated exposure to endemic terrorism 
in Israel lead to a process of habituation whereby the effects of terror attenuate or 
do populations become increasingly sensitized to terror through repeated exposure. 

Our results provide overall support for earlier published findings on the ef- 
fects of terrorism on traffic volume and accident rates. Over the entire four-year 
period, traffic is reduced on the day of the attack and three subsequent days, 
but only the 5 percent decline on the second day following fatal terrorist attacks 
is significant. Similarly, we find a 24 percent increase in fatal traffic accident 
rates on the third day following terrorist attacks. The results tell us that there 
is a reaction that can be measured using traffic data, that this reaction is not 
immediate but delayed, and that the impact of terrorism on both the amount 
people drive and the “quality” of their driving also dissipates by the fourth or 
fifth day following an attack. Our analysis also provides new evidence on how 
the impact of terror on current behavior may reverberate over the course of time. 
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In particular, we find that the effects of terrorist attacks in previous months 
continue to influence behavior but only for some three months. 

Our main findings indicate that there is little or no habituation or sensitiza- 
tion to terror attacks in the traffic data. These results are most consistent with 
persistency in the response to terror attacks over the course of the 2° Intifada. 
This is evidenced by the lack of change in the response over time. It is further 
clarified when we test whether current terror effects are moderated by the level of 
terror in prior months. These effects indicate continuity of impact, regardless of 
the level of violence experienced. Stability in the reaction to terror is interesting 
in light of limited evidence suggesting that recurrent disasters produce increased 
social resiliency, particularly where social institutions and social capital provide 
a sufficiently stable context (Adger 2005; Samanta 1997). In Israel, while social 
institutions are strong (Kimmerling 1985), the political nature of attacks and their 
focused impact may not create the same degree of shared experience and social 
resiliency, reducing the potential for social habituation. 

Both approaches used to test for habituation rely on a specific null hypothesis 
test. A lack of significance in the test of the interaction coefficient is either because 
the null hypothesis is true or because we lack the statistical power to reject the null. 
Earlier analysis found no interaction between time and terror attacks but these 
findings were treated cautiously because of the relatively short duration studied and 
low number of terror attacks (Stecklov and Goldstein 2004). In contrast, the cur- 
rent analysis includes more observations and the statistical power is increased. With 
a standard error of .03 on the interaction coefficient, effects larger than 6 percent 
should be detectable. Thus, while we cannot conclude that there is no effect we can 
be reasonably confident that the change in the effect over time is less than 6 percent. 

And despite the overall impression of stability in our findings, there are also signs 
in the data—albeit statistically weaker ones—that the population-level reaction to 
terror has been accelerated, appearing more quickly but also dissipating more rapidly. 
This unexpected shift is difficult to interpret specifically in terms of either sensitiza- 
tion or habituation. Instead, a shift in the timing of the effect may itself represent 
a form of social habituation that develops over time as the response is accelerated. 
To the extent that habituation is a form of adaptive learning, both individuals and 
society might habituate to terror by developing new norms of response. The in- 
troduction of these customs over time produces a learned reaction. ‘The evolution 
of a norm, in this sense, may be precisely the sort of effect that we observe in the 
traffic volume results. While reasonable, far more research is needed to explore how 
behavioral patterns emerge during periods of intense violence, such as terror attacks, 
and to substantiate our findings with alternative behavioral measures. 
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Notes . 
1. Fatal traffic accident rates are also used in Stecklov and Goldstein (2004). 


2. The largest gap is from Dec. 7, 2003 to March 9, 2004 during which all data was 
irretrievably lost due to computer problems at the Ayalon Management Freeway 
Company. There is no statistical difference in the mean level of terror attacks between 
periods with and without traffic volume data. 


3. Simple tests of bivariate correlations between the lags indicate that the average 
correlation is under 4 percent and none of the values exceed 9 percent while the VIF 
. ae » 
isnear 1. 


4, The last date included in the analysis of the first paper is almost identical to the date 
at which half of the total, cumulative fatalities from terrorism over the four- year 
period occurred. 
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Social characteristics that differ by place of residence are consequential for health. To 
study implications of this among older adults in rural vs. urban China, this study 
employs data from the Beijing municipality, a region that has witnessed growth and 
gaps in development. Life and active life expectancy is assessed using a multistate life 
table technique that estimates hazard rates and subsequent expected years in various 
health states. Hazards are estimated for a model that adjusts regional differences for 
age and sex and for a series of other models including additional covariates. Results 
indicate urban residents have an advantage. Specific factors show socio-economic 
status and access to health service account for a large part, social support and health 
behaviors for little, while disease is a suppressor. 





Introduction 


Sociological studies have long implicated spatial and ecological setting as con- 
sequential for a wide variety of health and other well-being outcomes at various 
stages of life suggesting that place of residence itself is a critical social determinant 
of health (LeClere, Rogers and Peters 1997; Robert 1999). In some ways, the issue 
can be traced as far back as Engels and Marx’ concern about the living environ- 
ments of working classes in London. One of the reasons that location seems to 
matter is a number of characteristics that tend to distinguish individuals living 
across ecological settings that influence health outcomes, such as those related to 
domains of socio-economic status, behaviors, social cohesion and access to health 
services. A primary residential delineation distinguishing individuals across these 
domains is urban vs. rural residence. Yet, despite the history of research examin- 
ing the affects of ecological setting on health, surprisingly little is known about 
health differences between people living in urban and rural areas, particularly in 
the developing world (Langmore 2001; Montgomery et al. 2003). Even in stud- 
ies that include an urban/rural covariate, there is typically little consideration of 
underlying factors that generate discrepancies. 

The current study quantifies the extent to which urban/rural differences in 
health exist with reference to mortality and functional health outcomes and as- 
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sesses which of a series of socially related characteristics play a part in accounting 
for differences. The setting is the Beijing municipality of China, the data come 
from the mid 1990s, and the study sample comprises adults ages 55 and older. 
The evaluation of health discrepancies relies on active life expectancy, which divides 
total life expectancy into different states of functional health. Although a small 
number of studies have examined active life expectancy in China, these, for the 
most part, do not divide estimates by place of residence or across regions of the 
country. One study showed a slight rural advantage (M. Wang, cited in Saito et al. 
2003). This was, however, based on cross-sectional prevalence rate estimates using 
the Sullivan method, a technique that results in biased estimates if large changes in 
mortality and morbidity incidence occur over time (Laditka and Hayward 2003). 
The current study uses a multi-state life table approach. It adjusts estimates for 
characteristics related to four individual-level social domains that are thought to 
differ across urban and rural areas — social support, socio-economic status, be- 
haviors, and health access — and one health domain that considers differences in 
disease patterns. These adjustments allow for an assessment of the extent to which 
domains explain and mediate the residential impact on health. 


Urban/Rural Health Differences Historically 


Differences in health by urban vs. rural residence have been observed since the 
industrial revolution when urban living brought hazards owing to communicable 
diseases that were transferred easily across populations living in close proximity 
(Kearns 1988). An epidemiological transition, from communicable to degenerative 
disease, benefited urbanites, mostly due to public health, political factors and social 
dynamics (Preston and van de Walle 1978; Szreter 1997). Then in a landmark study, 
Kitagawa and Hauser (1973) reported mortality rates in non-metropolitan areas of 
the United States to be 5 percent below those in metropolitan areas. The advantage 
has more recently been confirmed by House et al. (2000) even after adjusting for 
individual-level social, demographic, economic and behavioral characteristics. 

The situation for older adults in the United States is somewhat more complex. 
Indeed, a number of studies have reported a rural advantage in mortality and other 
health outcomes (Clifford and Brannon 1985; Hayward, Pienta and McLaughlin 
1997; Laditka et al. 2007; Smith et al. 1995). Because urbanites have better access 
to health service, these findings present somewhat of a paradox, which is often 
explained with reference to levels of social support. Specifically, rural areas are 
characterized by greater cohesion, more frequent exchange and contact between 
generations, and more affective family relationships (Amato 1993; Beggs, Haines 
and Hurlbert 1996). In turn, social support characteristics have been implicated 
as health determinants (House, Landis and Umberson 1988). 

In the developing world, urban/rural health differences have received less at- 
tention (Kinsella 2001; Langmore 2001). Much of what is known was summa- 
rized by the National Research Council’s Panel on Urban Population Dynamics 
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(Montgomery et al. 2003). The panel’s work indicated that, on average, urban 
populations in modern-day periods live longer than do rural populations, and 
with the exception of HIV/AIDS, exhibit healthier levels across a range of indica- 
tors. Part of the advantage is thought to be a function of environmental factors, 
such as better equipped and a greater concentration of health facilities, and part a 
function of individual factors, such as characteristics among urban dwellers that 
relate to better health, like higher levels of income and education. Yet, the issue of 
health advantage in the developing world is far from settled. Importantly, there 
is wide intra-region variation. Urban areas tend to include subpopulations of 
slum living poor, who constitute a subset of extremely disadvantaged city dwell- 
ers (Montgomery and Hewett 2005). Evidence also suggests that within-region 
inequalities may be widening, perhaps a result of economic globalization, con- 
comitant market transformations, growing returns to education and weakening 
inequality-reducing infrastructure (Goesling 2001; Neckerman and Torche 2007). 
While virtually no studies have compared urban vs. rural poor, data do indicate 
that the former are substantially worse off than the average in urban areas. 

Within-region inequalities in health are compounded by a number of related 
factors. The provision of public services and infrastructure, such as health care 
facilities, safe drinking water and sanitary waste disposal are often unequally dis- 
tributed within regions, despite their increased availability on average in cities. 
This type of infrastructure is, in turn, exceptionally important in determining 
individual health risks (United Nations Habitat 2003). In addition, across the 
developing world, urban health care systems tend to be characterized by increas- 
ing privatization and monetization, in part due to rising average incomes. While 
the monetization of health care services in cities has resulted in excellent quality 
of care for some, privatized health care delivery results in limiting access to the 
urban poor even if facilities are physically present (Das and Hammar 2007; 
Dussault and Franceschini 2006). 

A reading of the panel's findings indicate that most of the research on rural/ 
urban differences in health in developing countries has considered issues that 
affect non-elderly populations, such as communicable diseases, infant mortality, 
reproductive health, and traffic-related injuries and deaths. In developing coun- 
tries, rural/urban discrepancies in health problems typical among older people 
have been almost completely ignored. ‘The issue, however, is of importance and 
becoming more so for a number of reasons, not the least of which is the rapid 
population aging occurring in much of the developing world. A rise in older adults 
means increasing proportions of budgets devoted to this segment of society, bring- 
ing their health needs to the forefront of policy (Zimmer 2006). Some seniors, by 
virtue of where they live, will have advantages over others. In the current study, 
we consider some of these advantages as factors that potentially explain urban vs. 


rural health differences among older adults. 
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The Beijing Municipality in Context . 


The Beijing municipality is both an interesting and important setting for this 
study. The city is the capital of the People’s Republic of China. The municipality's 
boundary is broader than that of the city, equivalent to a province in China's 
administrative structure, and encompasses an area of about 17,000 square kilo- 
meters. The municipality's population has been growing quickly. According to 
census data, it was about 11 million in 1990 and a little less than 14 million in 
2000. It was estimated at about 17 million in 2007 (China Data Online 2008). 
Although small in relation to the entire population of China, the municipality is 
nonetheless more populous than two-thirds of the world’s countries. In addition, 
it holds special significance given China's recent political and economic history, 
its centrality within the country, and the reality that it sets the pace for the rest of 
the country with respect to social and economic change. 

Specific categorizations for rural and urban within the municipality, and indeed 
for the entire country, are complicated by a system of definitions that differ across 
agency and have changed over time (Montgomery 2008; Chan 2007; Chan and 
Hu 2003). For example, there is an official national government system of rural and 
urban registration, which differs from the National Bureau of Statistics’ definition 
used in censuses and surveys. The NBS system is based largely on population density 
and was newly altered in 2000 (Chan 2007). The official government /ukou system 
has, since the establishment of the People’s Republic of China in 1949, been the 
legal division of rural vs. urban residents (Chan and Zhang 1999). The government 
system requires individuals to be registered as belonging to the “agricultural” and 

“non-agricultural” population depending on the official location of their household. 
What makes this legal system important is that agricultural and non-agricultural 
populations are subject to discrete economic, population, social and health policies. 
There are several administration models for city regions (Chan 2007). The Beijing 
municipality is fairly typical in this regard, with administrative definitions for rural 
and urban based on the division into 18 areas, officially classified as city districts 
(shiqu) and rural counties (xian). Shigu are urbanized areas with high population 
densities and very few if any people actually working in agriculture. Xian are com- 
paratively sparsely populated areas that include a large share of individuals working 
in agriculture. The population of shiqu is mostly registered as being non-agricultural, 
although substantial rural to urban migration from the xian into the core of the city 
means that some people living in a shigu, and classified as living in an ‘urban’ area 
when it comes to the census definition, are officially registered with the government 
as being part of the agricultural population. These individuals are officially classified 
as “temporary” residents of the city and they do not necessarily receive access to the 
health care services available to the official city residents. 

In contrast to much of the developing world, it is in rural China where health 
care has become more privatized and monetized. Moreover, China’s well-publi- 
cized transition to a market economy has been accompanied by a widening of 
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the divide between urban and rural health services (England 2005). Historically, 
China witnessed improvements in health after the establishment of the PRC in 
1949 (Cook and Dummer 2003). Life expectancy within the Beijing Municipality 
is now on par with many developed countries. Much of this progress was at- 
tributed to a system of cooperative medical care and free basic and preventive 
treatment in areas designated as rural by the government (Shi 1993). However, 
the reform era ushered in a decline in cooperative medicine and a subsequent 
increase in privatized fee-for-service practices. Subsequent deleterious effects have 
been well-documented (Cook and Dummer 2003; Liu et al. 2002; Meng, Liu and 
Shi 2000). While access to health service progresses slowly in rural areas of the 
Beijing municipality, urban residents maintain their subsidized care, have access 
to better qualified medical personnel, and are able to draw upon a larger array 
of health resources within closer proximity than their rural counterparts (Beach 
2001; Chan 2007). The data employed for the current study covers a segment of 
the 1990s when urban/rural discrepancies on a social level were widening. ‘The 
city of Beijing was developing rapidly; modern buildings and roads were quickly 
completed, contemporary and rapid transportation expanded, private sector ac- 
tivities picked up, and access to advanced medicine, quality health practitioners 
and technology increased. In contrast, rural areas that surround the city were 
largely involved in agriculture, maintained lower incomes and, by most accounts, 
witnessed deterioration in access to health services. 

Older adults represent a particularly important sub-population within this 
region. China, which has the world’s largest older population, is aging at an ex- 
ceedingly rapid rate. The 1982 census indicated that the proportion of those 55 
and older in the Beijing Municipality at that time was 12 percent, while by the 
2000 census it had increased to 16 percent, and by 2005 it was nearly 20 percent 
(National Bureau of Statistics China, 2010). Although specific projections for 
the municipality are not easy to come by, extrapolation from country-wide data 
suggest the proportion will nearly double to over 35 percent within the next 
30 years. An increasingly larger proportion of functional health problems and 
mortality will be concentrated amongst the older population, which will, in turn, 
have implications for health services and related costs. Hence, understanding the 
mechanisms underlying urban/rural health discrepancies can be important for 
informing policies that can influence the health of elders. 


Methods 


Data 


Data come from 1992 and 1997 waves of the Beijing Multidimensional Longitudinal 
Study of Aging conducted at the Capital Medical University (formerly the Capital 
University of Medical Sciences) in Beijing (Department of Social Medicine 1995; 
Jiang et al. 2004; Kaneda, Zimmer and Tang 2005; Tang, Jiang and Futatsuka 
2002). The original BMLSA was funded by UNFPA and implemented by the 
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Beijing Geriatric Clinical and Research Center at the Capitak Medical University 
in conjunction with the Center for Aging Studies at Flinders University of South 
Australia. The 1992 baseline involved an age and sex stratified sample of older 
adults 55+ living in 3 of 18 official administrative regions within the municipality. 
Sampling was conducted by a three-stage stratified random clustering technique 
that was geared towards ensuring the representativeness to the older population 
of the Beijing municipality. The first stage was the selection of three regions able 
to represent the municipality with respect to social, economic, demographic and 
geographical characteristics based on information from the 1990 national census. 
Xuan Wu, the third largest urban district (shiqu) by area is located near the center 
of the city of Beijing. Da Xing is one of the rural counties (xian) located in the 
rural agricultural plains area to the south. Huai Ruo is the second rural county 
(xian) located in the mountainous northern part of the municipality. The second 
sampling stage involved the random selection of strata consisting of streets in the 
urban area and towns in the counties. The final stage involved a random target 
selection of 3,614 individuals from a roster of elderly residents from within strata 
with a relatively even distribution of men and women in five-year age cohorts 
(55-59, 60-64, 65-69, 70-74, 75-79 and 80+), and an oversampling within Xuan 
Wu. Of the target sample, 3,257 valid cases were interviewed, resulting in a total 
response rate of 90 percent. A weighting scheme is used to account for the sampling 
scheme and assure that results are representative of the districts in that year. 

To further describe the regions, Table 1 is presented. It shows demographic 
information and provides a comparison with the total Beijing municipality and 
the country of China. Xuan Wu's urban character is made clear by a very high 
population density of well over 20,000 persons per square kilometer and virtually 
nobody employed in agriculture. In contrast, Da Xing and Huai Ruo have low 
population densities (Huai Ruo has only 139 persons per square kilometer) and 
much larger proportions of workers listing agriculture as their industry of employ- 
ment (almost 44% in Da Xing). Other indicators shown in Table 1 are consistent 
with what would be expected of rural vs. urban regions. Moreover, The National 
Bureau of Statistics’ definition for urban, used for the 2000 census, defines 100 
percent of Xuan Wu residents as living in an urban metropolitan area compared 
to 28 percent of residents in Da Xing and 39 percent in Huai Rou (Chan 2007). 

‘The 1997 follow-up involved returning to original households. The response rate 
for the 1997 wave was 90 percent. While response rates are respectable by traditional 
survey standards, higher non-response in Xuan Wu is a prominent feature of the 
BMLSA and deserves explanation. The number of interview refusals was small. 
Therefore, loss to follow-up was almost entirely a function of not being found, which 
was mostly due to moving. Unlike younger samples, older adults in China are less 
likely to move from rural to urban areas. The reason for higher non-response in Xuan 
Wu was therefore an elevated rate of movement within the city. Much of this was 
a result of city reconstruction. In all likelihood, the Xuan Wu residents remained 
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Table 1: Demographic Information 
Beijing 
hina _ Municipality XuanWu_ Da Xing Huai Rou 
Population (in 100,000) 1,242.1 HoDUn 53 6.7 3.0 





Sex ratio’ 106.3 108.9 107.0 109.3 106.1 
Population density? 129 808 21,691 648 139 

Average household size 300 3a 2.9 3.7 3A 
% Single person households 91.5 teal 14.6 Tae 9.7 
% Han Chinese 91.5 95.7 92.6 96.2 92.2 
Birth rate? 11.4 6.0 4.0 7.0 TZ 
Death rate* 5.9 52 6.2 4.9 5.8 
Total fertility rate® ice A 6 1.0 8 
% Ages 0-14 22.9 13.6 10.9 18.1 Ties 
% Ages 55+ 14.2 16.3 20.3 13.9 15.0 
% Illiterate ages 15+ 9.1 4.9 3.9 7.1 9.6 
% Working in agriculture® 64.4 13.0 fl 43.5 30.6 


in urban Beijing, moving in with other family members. In order to adjust results 

for differential non-mortality attrition, an additional weight was calculated using a 

regression procedure that determined the probability of being a respondent accord- 
ing to the characteristics age, sex, residence and health status at baseline. The initial 

weight that makes a case representative within the district at baseline was multiplied 

by a function that represented the inverse of the probability of being a follow-up 

respondent, which is to say that respondents with characteristics typical of non- 
respondents were given greater weight. Because residence is one of the characteristics, 
follow-up respondents from Xuan Wu are weighted upwards. 


Estimating Life Expectancy and Active Life Expectancy Using a Multistate 
Life Table Approach 


Mortality as a health outcome among older adults is important for obvious reasons. 
But, functional health is equally critical because it provides an indication of the 
degree to which older adults can conduct daily activities and, consequently, the 
degree to which they require assistance. To facilitate examination of health differ- 
ences across mortality and functional health, the current study considers measures 
of life and active life expectancy. The latter has become a widely used tool for as- 
sessing health across population groups in international research (Mathers 2002; 
Saito, Crimmins and Hayward 1999; Saito, Qiao and Jitapunkul 2003). It allows 
for the quantification of differences in health in terms that are not only easily 
understood but easily translated into future health care needs. Related, active life 
expectancy has the advantage of combining information on both mortality and 
morbidity into a single summary indicator. It does this by partitioning years of 
life into states of health, for example, expected number of remaining years of life 
with and without functional limitation. 

Active life expectancy calculations require division of samples into functional 
health categories that represent active and inactive states. We consider someone to 
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be functionally healthy or independent if they can conduct usual physical move- 
ments without help from others, and can complete tasks that are necessary for 
independent living. In the BMLSA, respondents were asked about a series of tasks 

that can be categorized as either activities of daily living, that is tasks necessary for 
self-maintenance, or more general physical movement tasks that may be necessary 
for independent living (Katz et al. 1963; Nagi 1965). We combine the two types 

of tasks, using six individual items — eating, dressing, getting on and off a bed, 
bathing, walking 300 meters, and walking up and down a flight of stairs — and 

define the active state as the ability to perform all six without help. The inactive 

state is defined as requiring help in performing at least one. 

For the baseline state (2), individuals are coded as active or inactive based on 
responses to the six functional health tasks. The follow-up state (j) is based upon a 
combination of survival status and the response to the six functional health tasks at 
follow-up. Specifically, when interviewers returned they determined whether the 
respondent was still alive through interviews with those living in the household 
and others nearby. If still living, the follow-up interview was conducted, and ques- 
tions about the ability to perform the six functional health tasks were repeated 
exactly. Like the baseline state, those who survived are coded as active or inactive 
based on whether they report needing help. Those who did not survive are simply 
coded as being deceased. As such, for each originating state there are three possible 
follow-up outcomes, resulting in a total of six possible transitions (ij). 

Several methods have been suggested and compared for the estimation of ac- 
tive life or health expectancies, and software is now available that can derive 
these estimates (Laditka and Hayward 2003; Lievere, Brouard and Heattcote 
2003; Mathers 2002; Saito, Crimmins and Hayward 1999; Wolf and Gill 2007). 
Because of its flexibility in the use of covariates, we adopt a method that ap- 
plies hazard rate estimates as probabilities for inputs into multistate life-tables. 
(Crimmins, Hayward and Saito 1994; Hayward and Grady 1990). This method 
requires a two step procedure. ‘The first is the estimation of the rates of transition 
from each originating state (2) to each follow-up state (/), conditional upon a set 
of covariates. We begin with a base model that adjusts urban/rural transition rates 
by age and sex, and is denoted as follows: 


m,(x)=exp(B,,+8,x+B, Sex+8 Urban) (1) 
where m,, designates the hazard of making a transition to state j for an indi- 
vidual beginning in state i within an age interval x. The model provides for 
one estimate of the transition rate for each combination of covariates, meaning 
that the transition rate conditional on individual characteristics is assumed 
to be constant over time. We tested for a variety of functional forms of age 
dependence for all the transitions in the model but in no instance did a more 


complex functional form improve the model’s fit compared to a model where 
the age effect was specified as linear. 
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In order to move to the second step of the estimation — the calculation of 
multistate life tables — the hazard rates (m’) need to be converted to probabilities 
for use in standard life table functions. Hazard rates, which indicate likelihood of a 
transition for individuals within an age group, can be converted to the probability 
that an individual with a given set of characteristics who is an exact age will make 
a transition from i to j between time x and x + n by: 


Py (x,n) =“ (1 + 55) m,(x))' (1- 25) m,,(x)). (2) 


This calculation provides a matrix of probabilities from which to apply the survi- 
vorship function: 


1. (xn) eali(x) p; (xn). (3) 


|. (x+n) indicates the number of persons in state 7 at age x who end up in state 7 at 
age x + n. This function is determined by the product of the number of persons 
in states i at age x and the probability of transition from state 7 at age x to state j at 
x+n, with probabilities being derived from the hazard rate models. Assuming that 
hazard rates are constant over n, we can assume that a movement from active to 
inactive, inactive to active, or from either originating state to death occurs at the 
exact mid-point of the period, and therefore the total number of persons lived in 
state i for individuals aged x to x + n is: 


L,x,n)=2(.5(,(x) +1, n))). (4) 


The sum of L(x,n) beyond age x (denoted as T,(x)) indicates the total number of 
person years lived in state i by those that survive to age x. Thus, the years expected 
in state i at exact age x, indicated as e,(x), for an individual with a particular set 
of characteristics, is: 


e(x) = T(x) pi (5) 


The base model shown in equation (1) will display an urban advantage in 
mortality and functional health. Follow-up models will adjust for characteristics 
that represent domains of socio-economic status, access, social support, behaviors 
and disease profile. As various sets of characteristics are added to the base, we 
expect coefficients that describe discrepancies in transition rates across rural and 
urban regions to be reduced and thus expected years of life and active life should 
converge. We conduct this stage of the analysis by first adding covariates from each 
domain separately, then testing an intermediate model that includes a selection of 
covariates that have a likely causal association with health outcomes, and finally 
a full model that adds all covariates simultaneously. In order to show the urban 
advantage that is accounted for by covariates represented by specific domains, we 
construct a series of multistate life tables for urban and rural areas based on transi- 
tion probabilities derived when adjusting for age, sex, plus significant covariates 
within each domain separately. We also construct life tables that adjust for the 
covariates across domains that are statistically significant in the full model. 
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This technique for estimating life and active life expectancies makes several 
assumptions, in particular that one transition takes place over time and this 
transition occurs midway through the interval. These assumptions can result in 
an underestimation of the number of transitions (Laditka and Hayward 2003). 
However, the underestimation should not considerably influence estimates among 
survivors if hazards are constant over the time period and there is equal underes- 
timation of transitions from the active to inactive state as from the inactive to the 
active state. More importantly, estimates of covariates are less likely to be biased, 
thus our urban/rural effects are likely to indicate valid influences on hazard prob- 
abilities. As confirmation of this, the hazard rate estimates that are found in the 
current analysis conform qualitatively to those from an earlier analysis that consid- 
ered a multinomial logit model for transition probabilities (Zimmer et al. 2004). 


Covariates 


Social Support 

First, marital status is coded as married or not married. The next two items are 
more qualitative. An item on how often individuals are consulted when making 
family decisions is employed to indicate perceived involvement in family deci- 
sion making. Those consulted “most of the time” are considered to be involved. 
Additionally, whether an individual has a confidant on whom they can rely in 
times of need is determined with a survey question that asks, “Do you have anyone 
that you feel very close and intimate with?” 


Socio-economic Status 

Education is measured as having any vs. no formal education. Occupation codes are 

difficult to determine. However, we employ one item that classified “occupation most 
of one’s life” in broad categories and a second that asked whether the work the indi- 
vidual has done most of their lives involved heavy labor. Occupation is coded as white 

collar and non-heavy labor vs. non-white collar and/or heavy labor. Respondents 

were allowed to identify housework as their occupation, and 14 percent did so. They 
are coded according to their response to the second item. Having financial difficulty 
is considered a proxy for income. Specific income or consumption information was 

not collected in the survey, and income measures that may be available would be 

difficult to interpret among an older population in China. Respondents were asked 

whether their income meets the needs of their daily living expenses. We code those 

who report they have enough money as having no financial difficulty. 


Behaviors 
First, respondents are coded into ever smokers or non-smokers. Second, they are 
coded into drinkers of at least one a day vs. those who drink less. Third, ques- 


tions on diet asked about eating fruits and vegetables daily. We code those who 
answered “yes” as having a healthy diet. 
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Access 

A questionnaire item asked who pays for medical expenses. Individuals responded 
in terms of the percent of costs they themselves cover. Individuals reporting that 
they personally pay for less than 100 percent are likely covered by some type of 
insurance, and having insurance is the first indicator. Second, individuals were 
asked whether they received a medical check-up over the past year, which is an 
indication of obtaining regular health service. Third, they were asked whether 
they consider the cost of medical expenses to be a burden. Although this item is 
correlated with the one on insurance (r = .361, p < .01), the correlation is not as 
strong as might be expected, indicating that financial burden is separate. Fourth, 
two questions are combined to indicate time costs related to access. Individuals 
were asked whether they have a problem either with the time it takes to get to the 
nearest medical facility or the time it takes waiting to be seen by a doctor at their 
medical facility. Those that respond in the negative for both are considered to have 
no excess time costs when it comes to accessing health care. 


Disease 

If respondents report ever having coronary heart disease, lung disorders including 
chronic bronchitis, emphysema or tuberculosis, stroke, diabetes or cancer, they 
are coded as having a life-threatening disease. If they report ever having hyperten- 
sion, asthma, ulcers, migraine headaches, arthritis, glaucoma or cataracts, they are 
considered to have a debilitating disease. 


Results 
Basic Urban/Rural Variations 


Table 2 presents mortality and functional independence at follow-up by baseline 
status. Health outcomes are dependent on baseline status. Mortality is also a func- 
tion of baseline status. In both urban and rural areas, the percent dying was several 
times higher among the inactive group. Initial finding suggests urbanites have an 
advantage. A higher percent of those in urban areas active at baseline were active 
at follow-up (about 83 vs. 70 percent), while a lower percent died (about 11 vs. 
17 percent). Among those inactive at baseline, the association is not as transparent 
and is not statistically significant. 

Table 3 presents urban vs. rural distributions for covariates. All are measured 
at baseline. Results illustrate drastically different characteristics among urban vs. 
rural residents. With the exception of sex, differences are statistically significant. 
Differences in some indicators are particularly dramatic. For example, 73 percent 
of urbanites have formal educations compared to 27 percent for those in rural 
areas, Variation in access confirms vast differences in formal health care opportuni- 
ties. Differences in social support and behaviors are also significant. Despite the 
apparent advantage of urbanites across other domains, rural elders are less likely 
to report a life threatening or debilitating chronic disease. 
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Table 2: Functional and Mortality Status 


State at Baseline 


Follow-up State Active Inactive 
Rural (N=811) (N=244) 
Active % 70.4 15.8 
Inactive 13:1 30.2 
Did not survive 16.5 54.0 
Total 100.0 100.0 
Urban (N=1,529) (N=255) 
Active % 83.0 14.1 
Inactive 59 24.2 
Did not survive lies) 61.6 
Total 100.0 100.0 


Life Expectancies and Active Life Expectancies 


Results of the base model are shown in Table 4. Living in an urban area signifi- 
cantly increases the hazard of remaining active relative to inactive and remaining 
active relative to dying. This means that urban residents classified as active at 
baseline are advantaged: they remain active for longer before becoming either 
‘ inactive or dying. In contrast, residence has little impact on transitions from the 
inactive state. Age and sex have expected effects. : 
Figure 1 shows resultant life expectancy and active life expectancy estimates. The 
urban life expectancy advantage presented in the left-hand chart is robust. The order 
from highest to lowest life expectancy is: urban females, urban males, rural females 
and rural males. For example, at age 55 an urban woman can expect to live about 
23 more years, until the age of 78. Her rural counterpart can expect about 19 more 
years, meaning that the urban woman has a four-year advantage. The net difference 
between urban and rural declines with age, but the urban advantage remains evident. 
Interactions between gender and residence were tested and were insignificant. 
Urbanites also have an unambiguous active life expectancy advantage. At age 
55 a man or woman living in an urban area can expect five to six more active 
years than their rural counterparts. The gap remains apparent with advancing age, 
and even at age 85 the urban man or woman can expect two more active years. 
Active life does not differ much between men and women. But, because women 


live longer than men, it implies that women are inactive for a greater proportion 
of their remaining years. 


Accounting for the Urban Advantage 


Results from the intermediate and full models are shown in Table 5. The effects 
of residence are generally not as strong in the intermediate and full model as in 
the base model. Living in an urban area decreases the hazard of moving from ac- 
tive to inactive, but other urban estimates are insignificant. The magnitude of the 
significant coefficient has been reduced from the base model (-.943 to -.555 and 
-.578). It appears then that some of the urban advantage is explained by the covari- 
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Table 3: Distribution of Covariates 


Rural Urban 
Mean age 65.4 64.6 
% Female 49.0 50.5 
Social Support 
% Married 72.6 81.0 
% Consulted 45.9 53.4 
% Having a confidant 67.2 82.0 
Socio-economic Status 
% Having education 27.2 73.4 
% White collar/ Non-heavy labor occupation 3.8 40.2 
% No financial difficulties 38.4 65:3 
Behaviors 
% Never smoked 50.5 D010 
% Not drinking daily 60.1 ioe 
% Eats fruits and vegetables daily 28.3 67.2 
Access 
% Having insurance 10.0 91.5 
% Had check-up in last year 1.4 2146 
% Medical expenses not a burden 42.9 14.7 
% No excess time costs 13.4 17.0 
Diseases 
% Life-threatening disease 24.6 42.2 
% Debilitating disease 22.9 45.8 


Rural N= 1,055 
Urban N = 1,784 


ates, particularly those entered first in the intermediate model. How this translates 
precisely into differences in years of life and years of active life will be shown shortly. 

Age and sex effects are generally significant and in expected directions. Similar 
to the base model, older individuals are more likely to die, transition to the inactive 
state and remain in the inactive state. Women are less likely to die, but more likely 
to become inactive or remain inactive. Social support variables generally relate to 
a decreased hazard of dying. There are substantial socio-economic effects with 
those with higher status having decreased hazards of moving to the inactive state 
or dying. Socio-economic effects among those originating in the inactive state are 
insignificant. Behaviors have limited influence with the exception that not smoking 


Table 4: Parameter Estimates 


Transitions 
Active to Active to Inactive to _Inactive to 
Inactive Deceased Active Deceased 
Residence (1=urban) —-.943** oii -.097 -.013 
Age +.064** +.097** -.036* +.038** 
Sex (1 = female) +,594** -.370"* +,588" -.304* 


-9.577 


-.803 -4.333 


Intercept -8.039 
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delays death among those who are active. There are some effects of access. Having 
insurance and no excess time costs increase the hazard of remaining active relative to 
dying, while the latter has a strong positive impact on the hazard of moving from in- 
active to active and a negative impact on the hazard of dying from the inactive state. 

By subtracting estimated years in rural from urban areas across ages, Figure 2 
shows the remaining life expectancy advantage of urbanites when adjusting for 
specific domains and for all domains simultaneously. At age 55 the urban advan- 
tage determined by base model is 4.6 years. To be specific, life expectancy for a 55 
year old in the urban area is estimated to be 22 years, compared to 17.4 years for 
their rural counterparts. With increasing age, the net urban advantage decreases, 
but a distinction remains. 

Although the multi-state life tables based on rates adjusting for covariates do 
not completely eliminate this advantage, some domains have an influence. At 
age 55 the urban advantage when adjusting for access decreases to 2.7 years. In 
other words, if 55-year-old urban and rural residents had the same level of ac- 
cess, as defined with measures available to us, the urban resident could expect 
to live 2.7 years longer than their rural counterpart, implying that the other 1.9 
years is accounted for by the access measures at our disposal. When adjusting for 
socio-economic covariates, the life expectancy advantage declines to 3.3 years. 
Adjustments for social support and behavior do little to change the urban advan- 
tage. Adjustments for diseases, in contrast, increase the life expectancy advantage 
of urbanites, suggesting that disease differences are benefiting rural residents and 
suppressing a potentially larger urban advantage. ‘This is due to the higher preva- 
lence of disease in urban areas (as shown in Table 3). Also shown in the figure is the 
difference when adjusting for all covariates simultaneously. These results indicate 
that a 55-year-old urbanite can expect to live 2.5 more years than his or her rural 
counterpart and the advantage is reduced to one year by age 85. 

The right-hand side of the figure shows results for active life. The base advantage 
at age 55 is 5.5 years. By age 85 it drops to 2.2 years. Adjustments for covariates 
from all domains simultaneously reduce the advantage to 3.3 years for those age 
55 and 1.6 years for those age 85. Access and socio-economic status each accounts 
for about 1.8 years of the advantage at age 55, and about .4 years at age 85. Social 
support and behavior differences explain little and diseases have a suppressing effect. 


Discussion 


Our results indicated mortality and functional health experiences of older urban 
and rural adults in the Beijing municipality diverge, with urbanites in the Xuan 
Wu district maintaining a robust advantage in both life expectancy and expected 
years of active (or functionally independent) life in comparison to those living in 
the Da Xing and Huai Ruo rural areas. Urban and rural elders differed in other 
ways. Urban elders were far more likely to have health insurance, for example. 
Not only does the inequality in resources favor urbanites, but as many have com- 
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mented the weakening of cooperative medicine and subsequent introduction of a 
fee-for-service system slowed advances in availability in the rural areas in the 1990s, 
the time during which the current data were collected (Beach 2001; Cook and 
Dummer 2003). However, differences in the health service environment, troubling 
as they may be, cannot completely explain health disparities. As has been shown 
with some certainty in the United States, availability across urban and rural com- 
munities does not translate neatly into health advantages in one area over another 
(Hayward et al. 1997). A range of additional influences may determine how those 
in different regions negotiate their health environment. For example, health service 
may be more readily available in urban areas, but expenses related to health care may 
mean that the urban poor are unable to afford treatment or treatment of reasonable 
quality (Dussault and Franceschini 2006; Montgomery et al. 2005). 

We attempted to account for the urban advantage by adjusting life expectancy and 
active life expectancy using a series of covariates that reflect differences in the social 
composition of urban and rural areas. Controlling for access variables but no others, 
the urban advantage in life expectancy was reduced by 40 percent. Controlling for 
socio-economic status variables but no others, the urban advantage was reduced 
by 30 percent. While the full model indicated that some of this explained variance. 
is shared, several access and socio-economic status factors have strong independent 
influences and their causal connections with health outcomes can be fairly well 
defended, pointing to the impact of education and job opportunities as well as the 
benefits of a publicly funded health service program that includes broader health 
insurance and better access to health facilities and practitioners. In contrast, social 
support and health behavior explained very little of the discrepancy, results that 
differ somewhat from those found in the United States. One possible reason for a 
lack of association in the Chinese context is that normative ideals of filial piety that 
translate into high levels of support for older adults in China occur across places of 
residence (Bian, Logan and Bian 1998; Whyte 2003). 

Our findings bring up several additional issues that deserve further discussion. 
First, we found that controlling for diseases had the effect of augmenting the 
urban advantage. Urbanites were in fact more likely to report life threatening and 
debilitating diseases, a result that may be a function of several factors, including 
the above mentioned behavioral differences, longer survival in urban areas, differ- 
ence in the tendency to report diseases, which itself may be a function of access 
to health service, and even effects of pollution on incidence of chronic conditions. 

Second, the urban advantage in mortality and functional independence is 
primarily a function of the impact of urban residence on favorable transition rates 
among those originating in an active state, while urban residence has very little 
impact on transitions originating from an inactive state. Moreover, while socio- 
economic effects were strong when predicting transitions from the active state, 
they were less substantial and not significant when predicting transitions from the 
inactive state, results that support earlier research across a number of countries 
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that has shown the impact of socio-economic status on functional health to be de- 
pendent on originating state (Grundy and Glaser 2000; Liang, Liu and Gu 2001; 
Zimmer and House 2003). This suggests the possibility of different processes for 
onset vs. progression of functional disorder. This idea is not new and has been 
discussed in other studies with respect to the impact of socio-economic inequal- 
ity. Borrowing from this discussion, an explanation could be that urban residence 
results in delay of the onset, but when urbanites do become functionally unhealthy 
the severity of their disorder is great, and recovery and survival not probable. 

Third, our results are in contrast to an earlier study by Zeng et al. (2002), who 
showed rural residents across 22 states of China to be in better functional health 
than their urban counterparts, which they speculate as being due to agricultural 
labor offering health advantages to the rural population. The difference in our 
findings may be a function of several factors, but we believe that a probable one is 
our focus on the Beijing municipality. Specifically, the differences in environments 
between rural and urban areas in the Beijing municipality are particularly stark, 
and rural elders living around but not in Beijing are particularly disadvantaged in 
comparison to their urban counterparts. This was clearly seen in our descriptive 
statistics (Table 3). Therefore, it is uncertain whether rural/urban discrepancies of 
the magnitude shown also exist in other provinces around the country or even in 
other regions of Asia, although the current study does provide a baseline against 
which to compare these differences. 

There are several limitations worth noting. A study of residence on health 
outcomes is generally unable to determine causal direction. It is often uncertain 
whether lower socio-economic status is caused by living in rural areas or, alter- 
natively, if lower socio-economic status individuals are somehow drawn to rural 
areas. However, in the case of the Beijing municipality, this limitation is con- 
stricted first by limited migration among older adults and second by the fact that 
living in an area classified by the government as rural kept people from obtaining 
high socio-economic status in some ways, for example, by limiting their access to 
non-agricultural employment and education. The very low non-response rates in 
our own data attest to the limited movement of older people across areas. Higher 
non-response in urban areas is likely a function of within-region movement, in 
turn a consequence of a rapidly changing urban environment, rather than urban 
to rural migration. Of greater concern is the potential overlap that exists between 
indicators in different domains. It is possible that socio-economic status explains 
some of the effect of health care access on health. While it is beyond the scope of 
the current study to examine these interactions in much detail, we did find that 
there are fairly robust independent effects. 

The greatest limitation to the multistate life table technique employed here 
lies in the assumption that only one transition is completed over an observation 
period. While this assumption is not critical with respect to the effects of covari- 
ates, it will result in an underestimation of the number of transitions (Laditka and 
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Hayward 2003; Wolf and Gill 2007). Although the underestimation should be 
minimal for those who remain alive, given that hazard rates are likely to be fairly 
even throughout the period, it may be a greater problem in the case of death. This 
«5 because those that are in the active state at time of origin and die before follow- 
up may have experienced a period of inactivity before death, which would not 
be captured in this data. Clearly, frequent follow-ups and a shorter time interval 
make estimates more reliable, although such frequency is rarely available in surveys 
of older adults, particularly in China and other developing societies. 

In conclusion, our data capture conditions in the early to mid-1990s within a 
limited but important region of China. China and the Beijing municipality are in 
some ways unique in that, in contrast to much of the developing world, increased 
privatization of health care has resulted in its monetization in rural rather than 
urban areas, which has negatively affected access for rural residents. Analyses based 
on more recent data is needed to test whether the widening of health discrepancies 
is continuing, and data from a wider geographic sample will certainly be necessary to 
test whether our findings can be generalized to other regions. In this sense, our study 

__ can serve as a baseline against which to compare and monitor trends. Moreover, our 
results may provide a sense of what the future holds for the rest of China as the. 
country continues its rapid development and indeed for other developing countries 
in Asia. In this respect, we conclude that the main finding of our analysis is a strong 
mortality and functional health benefit that accrued to those in urban areas, some 
of which is explained by better access to service for individuals living in urban areas 
and higher socio-economic status among urbanites, both factors which themselves 
are important social determinants of health. Therefore, the implications of this for 
China, and possibly other countries in the region, are that investments in rural areas 
with respect to health infrastructure and actions that lead to greater socio-economic 
equity among individuals can be important for reducing health inequalities. 
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For those who wish not only to understand the world, but to change it, the key 
question about social movements of the oppressed is why they (sometimes) suc- 
ceed, and what can be done to make future movements more likely to succeed. 
This is not the question at the center of most academic social movement literature, 
but it is the focus of Marshall Ganz’s analysis of the 1960s’ California farm worker 
movement, their battles with growers and Teamsters, their strikes and boycotts. 

The other prominent book that poses the question of why movements succeed 
is Frances Fox Piven and Richard A. Cloward’s Poor Peoples Movements: Why They 
Succeed, How They Fail (Vintage 1979); interestingly, both Ganz and Piven-Cloward 
were activists in the movements they studied. Evidently significant involvement in a 
movement shapes what social scientists see as the most important, and most interest- 
ing, question. Ganz’s answer is in some sense diametrically opposed to that of Piven 
and Cloward. Piven and Cloward argued “that efforts to build organizations are 
futile;”(xxi) in opposition to this, Ganz writes that “the UFW succeeded, while the 
rival AFL-CIO and Teamsters failed, because the UFW’s leadership devised more 
effective strategy, in fact a stream of effective strategy.”(8) 

Ganz seeks to show not simply that the UFW leadership devised more effec- 
tive strategy, but also why it was able to do so. His answer, presented as a general 
theoretical argument in the introduction, and consistently developed through a 
fascinating and compelling case study, focuses on the social structure of leadership 
and strategy. Certain social structural conditions make it much more likely that 
a movement will develop effective strategy; understanding those conditions can 
guide future movements. 

One crucial element of success is relying on a leadership team, whose delibera- 
tions are open for discussion and disagreement, such that learning takes place 
in the course of the exchanges. The contrast is with a hierarchical or centralized 
operation where no one has the room to disagree with the leader. 

A second strength is a team that includes and fully incorporates people with 
a range of different perspectives and structural circumstances, which leads to 
informed consideration of the full range of options and sensible evaluation of the 
effects on, and responses from, varying constituencies. 
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Third is the level of motivation and commitment by those involved, not simply 
in terms of self-expressed motivation, and not even in levels of time and sacrifice 
(although that is important), but also in the sense of having no option but to go 
forward. Spanish conquistador Cortes burned his boats so his army would have 
no possibility of retreat; Ganz argues that social movement leaders in similar 
circumstances are more likely to find creative strategies for success. 

Finally, and in some sense the underpinning to all the other factors, move- 
ments succeed if they incorporate and rely on those they are trying to organize, 
not simply as bodies to show up at demonstrations, but as the source of whatever 
funds and resources will be used. The AFL-CIO lost, at least in part, because it 
received ample resources from central headquarters, and thus was answerable to 
George Meany in Washington, D.C. Chavez and the farm workers won, at least 
in part, because they lived on the same strike pay as the workers, and depended on 
the meager contributions that farm workers could provide. Aldon Morris makes 
a similar point in The Origins of the Civil Rights Movement (Free Press 1984): 
ordinary blacks, not foundations, funded the early movement. 

Why David Sometimes Wins asks a specific question: why did this group succeed, 

* and not another? It makes no sustained effort to explain why farm worker organizing 
succeeded in the 1960s and not in the 1930s, or why farm workers are far less union- 
ized than auto workers, or any of a range of similar questions that could be asked. 

For an observer of today’s labor scene, there are fascinating parallels between 
the 1960s-70s battles of the Teamsters and farm workers, and today’s battles 
between SEIU and the breakaway National Union of Healthcare Workers. The 
Teamsters then, and SEIU today, emphasize the strength, resources and power 
of their unions; both also operated by making deals with employers. The farm 
workers union then, and NUHW today, had far fewer resources, but much more 
committed members and a greater reliance on a bottom-up approach. (On the 
other hand, the Teamster leadership then was essentially all white men, and SEIU 
leadership today is far more diverse.) The comparison is all the more poignant 
in that Eliseo Medina, a young farm worker activist then, is today an Executive 


Vice-President of SEIU charged with stopping the NUHW insurgency. 


Battle Cries: Black Women and Intimate Partner Abuse 
By Hillary Potter 


New York University Press. 2008. 304 pages. $75 cloth, $23 paper. 
Reviewer: Eve Waltermaurer, State University of New York, New Paltz 


‘The rise of the feminist movement resulted in increased attention to intimate part- 
ner abuse both in research and practice. However, despite clear evidence that black 
women are at a higher risk of domestic violence than white women, much of the 
progress toward preventing this violence against women, attributable to the femi- 
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nist movement, has been made by white, primarily middle-class women for white, 
primarily middle-class victims. As a result, most of our understanding about the 
specific experience of domestic violence for black women has been theoretical 
and many of our efforts to prevent domestic violence have failed this population. 
Ln Battle Cries: Black Women and Intimate Partner Abuse, author Hillary Potter 
provides an analysis of in-depth interviews with 40 black women who have expe- 
rienced intimate partner abuse. This study offers important information that has 
been lacking regarding the nature, reactions and implications of intimate violence 
as experienced by this specific group of women who must navigate through issues 
of race, gender and often social class. 

For Battle Cries, Potter’s research participants were drawn from a purposive 
sample of women who responded to newspaper advertisements about the study 
and a snowball sample in which the women who contacted Potter were asked if 
they knew of other appropriate participants. This second method provided Potter 
with the fortunate opportunity of interviewing four mother-daughter pairs, provid- 
ing insight on the cycle of abuse in a highly intimate manner. From these in-depth 
interviews Potter defines the abused black women within the parameters of what she 
terms “dynamic resistance.” She finds from her interviews that women do not see 
themselves as victims of, but rather as resisters to, the violence they endure. In addi- 
tion to resisting further partner violence, these women also see themselves needing 
to resist family members, religious leaders, non-black partner violence victims and 
the criminal justice system, a unique position due to their race and culture. 

Potter applies a feminist framework to guide her research where individual 
experiences and patterns in those experiences are extracted from the women’s 
stories. Her approach provides us with a unique insight into what is driving the 
consistent quantitative findings regarding risk factors of intimate abuse. The book 
looks beyond the risk of age, race and marital status into how and why these fac- 
tors are continuously identified as risks. For example, her interviews help explain 
the nuances to the statistical findings that black women are more likely to “fight 
back.” As a result of a number of subjects detailing their willingness to physically 
respond to their abusers, Potter writes, “Ihe semantics of viewing the abuse as 
fights and battles redefines the circumstance of ‘woman battering’ for battered 
Black women, making abuse equal to being solicited into physical conflict with 
a stranger.” This redefinition influences both how the women and how outsiders, 
such as those in the criminal justice system, view their abuse. 

‘The author believes that to understand how black women experience intimate 
partner violence, we must hear their stories while considering the unique role they 
play in our society. This comes across quite clearly with the exception of two key areas. 

The chapter, Surviving Childhood, stood out as the least developed overall. 
While it is unquestionable that the experiences these women had as black, female 
children tie very much into their perceptions of both their gender and race, this 
chapter is missing a significant gender or race analysis. The author’s argument for 
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the uniqueness of the black women’s experience is weakened by her argument 
for the generalizability of her findings, that “in a similar manner, other women 
of color, such as Latinas, Native American women, Asian American women and 
immigrant women (of color) can easily be placed alongside Black women in this 
analysis.” (20) If we accept Potter's argument that the experiences of black females 
play a significant role in how they see themselves in terms of victimization, it 
would seem to follow that we should explore other ethnic groups in a similar fash- 
ion. Clearly, these other women have unique cultural experiences that potentially 
affect how they view themselves relative to victimization. 

Battle Cries is well written, providing a resource for students new to the fields 
of violence against women, criminology, social problems, and race and ethnicity 
while also serving as an excellent tool for those who have been actively engaged 
in intimate partner violence research or intervention. This book provides clear 
guidance about how to better respond to the experiences of partner violence in 
the lives of black women, while at the same time it illustrates the need for a similar 
analysis of abused women from other cultures and races. 





* The Catholic Social Imagination: Activism and the Just Society in 
Mexico and the United States 
By Joseph M. Palacios 
University of Chicago Press. 2007. 320 pages. $65 cloth, $25 paper. 


Reviewer: Gary Adler, University of Arizona 


How does a religious tradition, characterized by hierarchical authority and doc- 
trinal tightening, translate into strikingly different initiatives for social justice in 
two neighboring countries? This is the provocative question that Palacios addresses 
by comparing the history, institutional structures and cultural meanings of the 
Catholic social imagination in Mexico and the United States. Conceptually, the 
Catholic social imagination is a “bounded cultural space” (5) defined by the stra- 
tegic and normative orientations of theological doctrine but shaped by lived ex- 
perience. Much like the particularized Catholic social imagination itself, though, 
the path of this argument accents some facets, underplays others, and struggles to 
become a coherent whole. 

‘This nationally comparative project begins with a symbolically and historically 
loaded turning point: the “Jubilee Year” of 2000, during which the Catholic 
Church focused on bringing its social doctrine into the limelight at the exact 
moment that publicly religious presidential candidates were elected in both coun- 
tries. At similar celebration events in the two countries, differences abound in 
theological content, the demographics of participants, visions of social reform 
and legitimate social strategies. Before moving to case studies to explore the roots 
of these differences, Palacios provides an overview of official Catholic social doc- 
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trine that is comprehensive, if not confusing, in its lists of principles, norms, and 
authoritative documents. The multiplicity of messages and logics internal to the 
doctrine deserves more analysis than they receive here. 

‘The two national case studies use organizations and activists to show ways of 
“implementing” Catholic social doctrine. In Mexico, a history of laicidad (strict 
limitations on the public practice and influence of religion) couples with a na- 
tional religious hierarchy focused on non-political pastoral work to produce a 
social imagination that is deeply Catholic, but hardly civic. Local Catholics who 
grasp the social imagination tend to respond by following one of two routes: into 
public activism that sheds connections with local parish life or into local charitable 
work limited to the parish boundaries. With few exceptions, activists simply do 
not have the organizations and cultural tools to be both Catholic and civic at the 
same time. In the United States, many more options abound for Catholics pursu- 
ing social justice. Given a history of open civil society, strong local congregations 
and traditions of public Catholic activism (think Dorothy Day and Cesar Chavez), 
Catholics may engage in range of activities, from professional social ministry to 
neighborhood organizing in the tradition of Saul Alinksy. All this activity is aided 
by an Americanized Catholic social theology, in contrast to that of Mexico’s, which 
is “more Roman than Rome.”(12) American Catholics seem to have more religious 
space for developing alternative theological interpretations and more civic space 
for developing an array of organizational responses. 

A strength of this book is tackling a puzzle that might animate more study of 
international religious groups: how national traditions can simultaneously share 
similar cultural roots but look so different on the ground. Unfortunately, beyond 
the details of the two cases, the theoretical framework presented here may not 
help. A recurring concept of “social opportunity structure” attempts to deal with 
the cultural elements that its social movement cousin does not engage, but the 
extensive fieldwork yields too few insights into how levels of culture work to en- 
able or constrain action. A schema offered for understanding variation in social 
justice activities relies on two axes, one concerning the normative focus of change 
(integral vs. structural) and one concerning the arena of change (church vs. world). 
This classification structure generally works for locating social justice activities that 
are found in the two countries, but it comes with a few hitches. First, the author 
suggests that the intersection of these axes is where an authentic Catholic social 
justice locus would be; however, that center is more an ideal than it is a useful type. 
Second, this categorization scheme does not link with the institutional, historical 
and cultural forces that are mentioned throughout the book. How did particular 
social justice activities develop over time, and did their development shift the 
field of social imagination? It is not surprising that differences in national history, 
institutional structures and theology matter, but we would like to know which 
elements matter more and when in national history each element came to matter. 
Third, the schema does not deal with a lurking question important to both social 
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movement and religion scholars: why do so many Catholics int both countries fail 
to participate in any social justice activity, opting for a more privatized faith that 
ignores the social doctrine altogether? The Catholic social imagination may exist, 
but given the many meanings and activities it purports to cover, we might wonder 
whether the concept itself needs re-imagining. 





The Plot to Kill God: Findings from the Soviet Experiment in Secularization 
By Paul Froese 
University of California Press. 2008. 264 pages. $55 cloth, $21.95 paper. 


Reviewer: Ian Conlon, University of North Carolina at Chapel Hill 


The rise and fall of the Soviet Union offers a valuable case study for sociologists, 
especially those interested in religion. The Soviet regime, in power for nearly 70 
years, embarked on an unprecedented social experiment: the attempted extirpa- 
tion not only of organized religion but of the religious impulse altogether. 

How did the experiment fare? Not so well, according to a new book by Paul 
Froese. In The Plot to Kill God: Findings from the Soviet Experiment in Secularization, 
the author explores the unyielding and often brutal efforts undertaken by the 
Soviets to smother the religious impulse. While largely successful in their assault 
on religious supply, the Soviets failed miserably to extinguish religious demand, 
which despite repression and stigmatization, persisted under Soviet rule and ex- 
perienced a resurgence thereafter. 

The Soviet Secularization Experiment began in the 1920s with a sustained 
—and largely successful — assault on religious supply. Soviet officials instituted nu- 
merous regulations on religious institutions. The Russian Orthodox Church bore 
the brunt of the assault; its resources were seized, its buildings destroyed, and its 
leaders arrested. As a result, 97 percent of Russian Orthodox churches operating 
in 1916 were shut down by 1940. The dismantling of religious supply, however, 

failed to quash levels of theistic belief, which proved resilient. 

When it became clear that disrupting religious supply alone would not suf 
fice, Soviet officials turned their attention to religious demand. If Soviet citizens 
would not relinquish their religious identities, Soviet officials thought, perhaps 
the object of their worship could be changed. To this end, the League of Militant 
Atheists was born. In operation until 1941, the league was a propaganda arm of 
the Soviet regime whose primary directive was to convince the citizenry that reli- 
gion was scientifically falsifiable. In addition, they were to spread the alternative, 
state-sanctioned doctrine of “scientific atheism,” which combined the gospel of 
socialist utopianism with the mandate to spread the message of atheism. Religious 
holidays were co-opted by the regime. Communist luminaries deified. In the end, 
however, the regime's atheistic crusade, complete with its attendant homily and 
liturgy, failed to resonate with the Soviet citizenry. 
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By any metric, the Soviet Secularization Experiment was a failure: religious 
belief persisted despite disrupted supply and stigma; few Soviet citizens bought 
into the dogma of scientific atheism; and following the fall of the Soviet Union, 
rates of religious affiliation increased in every region of the country. Froese argues 
that the failure tells us two things about religion. First, the failure of the supply- 
side approach is testament to the ubiquity and resilience of religious demand, even 
in the face of great adversity. As Froese puts it, “religious demand is elastic, but 
only to a point.” Second, the failure of the League of Militant Atheists to convert 
the Soviet population to scientific atheism demonstrates the inadequacy of the 
doctrine to satisfy the existential need for meaning and purpose. 

As historical narrative, Zhe Plot to Kill God is provocative, informative and re- 
warding; as sociological polemic, however, it is uneven and unconvincing. Froese 
frames his analysis as an indictment of secularization theory, and as presented by 
Froese, secularization theory is thoroughly debunked. Unfortunately, his assault 
deals with a caricature of secularization merely as decline and eventual disappear- 
ance of religion, not with secularization as most in the sociology of religion now 
understand it. First, there is not merely one “secularization thesis” but rather sev- 
eral theories belonging to the “secularization paradigm.” Second, as Gorski (2000) 
notes, “only a few [theorists] posit a historical trend toward religious decline.” 
Fewer sociologists, if any, posit a trend toward ultimate disappearance, yet this is 
precisely the definition Froese employs and summarily dismisses in his analysis. 

Moreover, the forced secularization of society has never been proposed as a 
way to expedite the process, and this leads to the biggest weakness of Froese’s con- 
tribution. He concludes that the failure of the Soviet Secularization Experiment 
demonstrates the fundamental deficiency of atheism as a belief system. Froese 
downplays the possibility that scientific atheism failed, not because of its inherent 
deficiency as a doctrine or because of theism’s irresistible pull, but rather because 
religious affiliation and theistic belief became powerful acts of defiance against an 
oppressive and unpopular regime. Similarly, Froese fails to account for the fact 
that the experiment lasted only 70 years. By comparison, the Christianization of 
Russia, according to Froese, “depended on the success of thousands of Christian 
messengers over hundreds of years.” 

Theoretical weaknesses aside, Zhe Plot to Kill God is a provocative account of a 
unique case study. To be sure, the Soviet Secularization Experiment has much to 
teach sociologists of religion, and Froese’s contribution sheds further light on this 
fascinating episode in history. 
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Waves of Protest: Popular Struggle in El Salvador, 1925-2005 
By Paul D. Almeida 
University of Minnesota Press. 2008. 298 pages. $75 cloth, $25 paper. 


Reviewer: Paul Y. Chang, Yonsei University 


The core of the canon in social movement theory is based on case studies of move- 
ments occurring primarily in the United States and Western Europe. As Almeida 
notes, however, patterns of mobilization in democratic nations can often take 
distinct forms that are not generalizeable to movements in developing nations 
and/or authoritarian contexts. As such, Almeida’s book provides an important as- 
sessment of social movements in El Salvador, arguably one of the most important 
cases in Latin America. 

Almeida argues that the key mechanism motivating “the outbreak and forms of 
waves of popular protest” in El Salvador is the changing political environment. 
(169) The El Salvadorian state abruptly shifted its policies towards mobilizing 
groups, from tolerance to extreme forms of repression, at two significant histori- 
cal junctures. The first occurred when after three years of political liberalization, 
President Romero Bosque enacted in 1930 “new restrictive laws” that effectively 
illegalized various forms of mobilization.(43) This, in turn, facilitated the radical- 
ization of social movements in El Salvador and the 1932 revolt that ended in “one 
of the largest acts of state-sponsored repression in the twentieth century in the 
Western Hemisphere.”(46) Similarly, after a 10-year period of political liberaliza- 
tion (1962-1972), during which time El Salvador’s civil society had ample time to 
(re)develop and nurture movement organizations, “a dramatic shift took place in 
state-civil society relations”(120) when the state pursued a strict repressive policy 
that revolutionized activist communities and ultimately gave rise to civil war in 
the country. 

Utilizing extensive protest event data and rich interviews, and building on 
Political Opportunity Structure arguments (Meyer 2004), Almeida contributes 
to the social movement literature by offering a general theoretical framework for 
understanding the dynamics of protest waves in an authoritarian context. He 
shows that the liberalization of the state facilitated the founding and development 
of social movement organizations and that these organizations tended towards 
less disruptive protest tactics (what he calls “liberalization-induced mobilization”). 
When the state reverted back to extreme forms of repression, however, civil society 
groups drew upon the organizational capital developed in the liberalization period 
(the “organizational holdovers”) to mobilize protests that took on much more 
radical forms (“intimidation-induced mobilization’). 

One curious dimension to the structure of Almeida’s argument has to do 
with the agency of actors to influence the actions of contending groups. Because 
changes in state policy is the central factor driving movement dynamics, Almeida’s 
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historical narrative begs the question of what is motivating the state’s sudden 
shifts towards repression. Almeida seems to suggest that international politi- 
cal and economic forces were behind the changes in the El Salvadorian state’s 
policies towards civil society. For example, the “rapid economic downfall” due to 
“the global economic depression” sparked the “repressive political environment’ 
from 1930 through 1932.(41,209) Also, in the 1960s, Almeida suggests that the 
“United States placed external pressure for political reforms” and this facilitated 
the “military government's liberalizing efforts.”(71) This period of liberalization 
lasted until the early 1970s when “the world petroleum crisis” refocused blame 
on the state for economic grievances which galvanized the brutal repression of 
social movement and labor groups.(98) The war with Honduras also provided an 
external motivation to discipline El Salvador’s civil society as “tens of thousands 
of land-starved peasants from Honduras placed mounting pressure on the central 
government for agrarian reform.” (113) 

In short, Almeida’s explanation of what drove the waves of protest in El 
Salvador is predicated upon the agency of the state and its ability to mould the 
characteristics of social movements. In turn, Almeida suggests that international 
factors were the main causes of the shifts in the state’s policies from tolerance to re- 
pression. Following this logical order, one wonders if social movements possessed 
any agency of their own and what their role was in bringing about political and 
economic reforms in El Salvador. While Almeida clearly demonstrates the ability 
of the state to influence the character of social movements, there is considerably 
less exploration of the possibility of movements to exact change in the state. While 
this might not be one of the driving research questions of his study, without an 
answer, we are left with a unidirectional deterministic picture of social movement 


b) 


evolution. 
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From Black Power to Black Studies: 

How a Radical Social Movement Became an Academic Discipline 

By Fabio Rojas 

The Johns Hopkins University Press. 2007. 304 pages. $ 45 cloth, $25 paper. 


Reviewer: Robert A. Rhoads, University of California, Los Angeles 


In From Black Power to Black Studies: How a Radical Social Movement Became an 
Academic Discipline, sociologist Fabio Rojas effectively explores the key role that 
black nationalism played in the initiation, implementation and development of 
black studies as an academic discipline. Rojas argues that “black studies grew out 
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of the disillusionment with the civil rights movement and the subsequent surge 
of black cultural nationalism.” (207) He offers extensive evidence to support his 
claim, including analysis of historical documents and archival records as a means 
to explore case studies of the Third World Strike at San Francisco State College 
as well as the emergence of black studies at three universities: the University of 
Illinois at Chicago, the University of Chicago and Harvard University. He collects 
national data about black studies programs and their professors, and also conducts 
an extensive examination of Ford Foundation records in order to assess its role 
in supporting the development of black studies. Although vulnerable to minor 
criticism — such as his reliance on New York Times and Los Angeles Times coverage 
to assess incidents of black student protest (What about the many incidents not 
covered by these newspapers or by the media in general?) — overall, the empirical 
inquiry behind his book is quite rigorous. And as a read, portions of the book 
are outright compelling, including Rojas’ chapters on the early influence of black 
nationalism on establishing the socio-political context for the eventual emergence 
of black studies (“The Road to Black Studies”) and his detailed portrait of the 
Third World Strike (“Revolution at San Francisco State College”). 

From Black Power to Black Studies offers significant advances to scholarly un- 
derstanding in two major areas: (1. details and insights about the emergence and- 
development of black studies and ties to black nationalism (black power), and 
(2. sociological understanding of the connections between social movements and 
organizational change in the academy. In terms of the former, Rojas offers impor- 
tant observations about the key role students played in advancing the idea of black 
studies, including their role in promoting black nationalism by organizing black 
unions and developing tutoring and outreach programs in urban communities. 
Black student organizers also were adept at embracing innovative organizational 
structures, such as the Experimental College, and establishing the basis of a black 
studies curriculum through a variety of student-initiated courses. 

In terms of broader sociological contributions, Rojas points to the challenges 
university faculty and staff faced in seeking to address social movement demands, 
often coming from black student activists, while also reconciling political pres- 
sures linked to various university constituencies, including legislative and govern- 
ing bodies. He posits that universities are directly tied to their social environments 
and that external pressures often bear on administrative decisions and organiza- 
tional outcomes. He further argues, that, “By definition, movement participants 
challenge what is accepted by society, thus causing problems for managers. For 
example, the black studies movement challenged the idea that the academic dis- 
ciplines in 1968 were offering an education relevant for black students.”(213) 

Although social movements such as black studies seek to challenge and essen- 
tially alter university norms and practices, they are more likely to succeed if they 
reflect some elements of the culture of universities. In this regard, Rojas suggests 
that “movement-inspired organizational forms are often hybrids combining new 
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politics and old values.”(214) He supports such a conclusion by noting that suc- 
cessful black studies programs build on a model of rigorous academic work and 
research as academic black studies, in contrast to a community education model of 
black studies that targets the needs of minority communities and seeks to involve 
them in program-related decision making. The latter model presented a significant 
challenge to university elitism and traditional models of academic governance. As 
Rojas explains in discussing early efforts to advance black studies at two Chicago 
universities, “Community education failed at both campuses because it was in- 
compatible with the organizational culture of both universities. At the University 
of Chicago, administrators rejected any concept of black studies not consistent 
with its mission as an elite university. University of Illinois administrators were 
more sympathetic, but they could not accept external community control over 
academic activities.’(108) Consequently, what emerged from the black studies 
movement was an interdisciplinary program grounded in a traditional model of 
academic rigor, promoted to some extent by powerful mainstream organizations 
such as the Ford Foundation, which Rojas points out, was more likely to support 
less radical programs oriented toward “racial integration, not separatism.’ (141) 

In the end, Rojas succeeds in capturing the complexities of movement politics 
and the challenges of organizational change in academe. I highly recommend this 
book for any student or scholar of social movements and university life. Indeed, 
I teach an undergraduate course at UCLA on the history of student activism 
(inspired by Howard Zinn’s A People’s History of the United States, 1492 to Present), 
and I intend to add a chapter or two from Rojas’ important work. 





Courting Change: Queer Parents, Judges, and the Transformation 


of American Family Law 
By Kimberly D. Richman 
New York University Press. 2008. 288 pages. $39 cloth, $22 paper. 


Reviewer: Tey Meadow, New York University 


How should we understand an Indiana court’s prohibition of the adoption by a 
gay man of a severely disabled little girl, ordering social services instead to sepa- 
rate her from her brothers and place her in an abusive foster family, when many 
other states have active programs to encourage foster and adoptive parenting by 
gay and lesbian adults? What sociological sense ought we to make of a California 
court’s refusal to recognize as a parent a member of a long-term lesbian couple 
who participated in the planning, conception and rearing of a child, when dozens 
of other state courts have come to the exact opposite conclusion? In this time 
when same-sex couples are choosing to jointly conceive, adopt and raise children 
in every state in the nation and in more than 96 percent of counties (Sears et al. 
2005), their relationships are governed by a dense, conflicted and inconsistent web 


1920 © Social Forces 88(4) 


of family law and court decisions. It is this incoherent body of Yaw that forms the 
data for Kimberly Richman’s engaging, and occasionally heartbreaking, study of 
legal indeterminacy in the past half-century of case law concerning the rights of 
LGBT parents. 

From the Legal Realist tradition in the early 20° century onward, socio-legal 
scholars have debated the implications of the inconsistency and irrationality of 
law. Richman intervenes in these discussions with a novel approach: she views in- 
consistency as a necessary component of an area of law whose continued relevance 
depends on its ability to adapt to changes in social organization of family and sexu- 
ality.(2) She argues that the dominant “indeterminacy thesis,” an empirical and 
normative critique of law and of formal rationality focused largely on discussions 
of bias and injustice, actually prevents us from fully exploring the ways “openness” 
can facilitate social change and provide opportunities for litigants and attorneys 
to “work” ambiguity in the interest of progress.(6-7) Richman consciously exploits 
the inconsistencies in the body of appeals court decisions in child custody, adop- 
tion or visitation cases involving at least one known LGBT parent — 316 in all 

— occurring in 1952 through 2004 to demonstrate the transformations of meaning 
that can occur in the working through of legal norms and concepts. 

As sociologist Judith Stacey (1996) notes, major innovations in the structure of” 
family life are rare. The evolution of LGBT family life has caused courts to rethink 
legal notions of family and sexuality. Early cases focused on breaking down the 
cultural assumption that “homosexual” and “parent” are mutually exclusive iden- 
tity categories; newer cases confront demands to untangle the historical conflation 
of legal and biological parenthood. And all of this poses a poignant challenge to 
liberal rights paradigms to move from privileging the atomistic rights of individu- 
als towards a framework for the legal recognition of the distinctly relational nature 
of family bonds. 

‘These innovations are not linear, nor are they always an uncompromising vic- 
tory for LGBT parents. As legal scholar and author Kenji Yoshino (2006) notes, 
the law places “covering” demands on LGBT people across contexts, requiring 
them to mute crucial aspects of their identities to secure collective rights. Nowhere 
is this more poignantly illustrated than in the cases Richman depicts: of parents 
required by courts to choose between their children and their same-sex partners, 
or in its most pernicious forms, to acquiesce to demands that they present them- 
selves as “non-practicing” gays, to disavow political involvement in LGBT issues 
or communities, or to hide or mask the parts of their own personalities that read 
too “queer” for the comfort of legal institutions. 

In perhaps the least successful portion of the book, Richman theorizes the 
productive power of judicial dissent. “The minority,” she argues, “have a curious 
concurrent jurisdiction over the future.” (127) Dissenting opinions are, in her view, 
a window into the active negotiations over restructuring meaning systems that 
occur between judges. They also function symbolically as formal appeal for rehear- 
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ing of legal issues by the court at a later time. Although she argues these points 
persuasively, whether her data support those claims is a less convincing argument. 
Richman’s contribution is an analysis of the way legal indeterminacy itself does 

social work, by inviting legal actors to reframe debates, reformulate legal concepts 

and push for innovations in legal thinking. In this way, it is not concrete law 
that is constitutive of social change, but it is the legal arena and its invocation of 
dialogue and disagreement that provides a frame for the “revision, negotiation and 

eventual sedimentation’ (18) of news ways of defining and legalizing families. This 

is certainly an important intervention in our evolving understanding of the rela- 
tionship between law and social change. And, if Richman is right, her work might 
illuminate the ways legal actors engage in a dialectic process with law, watching it 
for signals of evolving logics, for invitations to reframe concepts and arguments, 
for cracks in the veneer of precedent where inroads and innovations can be made. 
For this reason, I wish she had done more work to establish how the cases she 

surveys actually affected the evolving strategies of the LGBT family movement. 
Though she interviews judges and lawyers, she seems mainly to reflect on their 
understanding of and reactions to specific court decisions, and less on the ways in 

which those understandings may have changed what they did next. 
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Imperial Citizens: Koreans and Race from Seoul to LA 
By Nadia Y. Kim 
Stanford University Press. 2008. 328 pages. $65 cloth, $25 paper. 


Reviewer: Jiannbin Lee Shiao, University of Oregon 


Imperial Citizens recently won 2009 book awards in the race/ethnicity and in the 
Asia and Asian America sections of the American Sociological Association. Author 
Nadia Kim conducted an ethnographically enabled interview study in two coun- 
tries to illuminate how Korean conceptions of race/ethnicity precede and exceed 
the lives of Koreans in the United States. For applying a transnational approach 
to Korean Americans, a group often set aside as a model minority, the awards 
are well deserved. However, the exhaustiveness of Kim’s research raises questions 
about the reach of transnationalism in understanding U.S. racial/ethnic dynamics. 
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Imperial Citizens argues that a transnational approach is esseittial to fully under- 
stand how immigration intersects with race relations. It is decidedly not about the 
assimilation of Korean Americans, but seeks to encompass both their lived experi- 
ences and the transnational circulation of racial discourse. Kim convincingly argues 
against the assumption that the racial/ethnic consciousness of U.S. ethnics is born 
with immigration and matures mainly in second-plus generations. However, it re- 
mains unclear how directly imperialism shapes the experience of U.S.-born Koreans. 

The book’s main contribution is to distinguish the nativistic racism that dis- 
tances Asian Americans from social citizenship, from the racism that constrains 
blacks from upward mobility. Kim also extends political scientist Claire Kim's 
theory of racial triangulation with its two-dimensional space of racial status to 
the international stage of the U.S. military occupation of South Korea. In brief, 
she grounds anti-Asian racism in the continuing power of the United States over 
Asians in their ancestral homelands. 

After Chapter 1’s ambitiously dense theoretical foundation, Chapter 2 provides 
an introduction to the South Korean version of U.S. race/ethnicity: tanil minjok 
or ethnonationality. This chapter establishes the distinctiveness of ethnonational- 

* ity and explains its many roots from Confucian hierarchies to reactions to Japanese 
colonialism. Chapters 3 and 4 trace the development of ethnonationality after’ 
the Korean War into the attitudes that post-1965 emigrants have brought to 
the United States. Under the U.S. occupation, South Korean ethnonationality 
changes as it becomes redefined within a racial triangle that includes white and 
black military personnel. 

Chapter 5 examines how living in Los Angeles changes the attitudes of the new 
arrivals. These shifts largely confirm the contact hypothesis: a new appreciation of 
white heterogeneity, recognition of black individuality, and surprise at the pres- 
ence of Latinos, developing into a sense of empathy. Their new social interactions 
also make being categorized as Asian a personally salient reality. Chapters 6 and 7 
analyze how Korean immigrants respond to their new situation. Their pre-migrant 
triangulation makes their situation familiar; however, they experience this status 
more directly in the United States where they are a minority within a minority. 
Furthermore, Kim argues that they contend with discrimination and subordina- 
tion as visible foreigners and foreign model minorities, and rather than oppose 
oppression or seek assimilation, they opt to take pride in their relative class status 
and seek to redefine the mainstream to include Korean Americans, becoming 

“approximate Americans.” 

Kim reports very similar perspectives among her second-generation respondents 
in Chapter 8. Given the extensive documentation of generational differences in 
attitudes, her emphasis on similarities seems incomplete especially when one of her 
subjects states, “It’s different than coming here as an adult [when] your identity is 
basically formed, you're basically trying to survive... whereas younger people have 
to try to figure out how to fit.”(235) Chapter 9 completes the transnational circuit 
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with the social remittance of migrant narratives of their U.S. experiences. Kim finds 

that migrant narratives have little cache in South Korea; instead non-migrants ide- 
alize opportunities in the United States, regard emigrants as traitors, and essentially 

disregard their experiences as whining. In brief, the homeland is neither empty of 
difference before migration nor a passive vessel when migrants return. 

In the final chapter, the strengths and weaknesses of Kim’s argument for trans- 
nationalism come into focus. Clearly U.S. institutions have exported American 
racial concepts, and any account of immigration begins too late if it begins when 
immigrants arrive. But does this approach change how sociologists understand 
variation in the immigrant experience, or does it mainly push back its genesis? 
Kim gives scant attention to the racial consciousness of the immigrant generation, 
but non-transnationalists have documented her same second-generation findings. 

Sociologists who seek to bring transnationalism to the study of U.S. race/ 
ethnicity face a challenge in how to theorize imperialism in ways that differentiate 
new patterns in immigrant adjustment, rather than simply enriching the same 
substantive patterns. For raising this and other important questions, Imperial 
Citizens is an excellent contribution to the sociology of race/ethnicity, migration 
and the Asian-America experience. Its style is well suited for capstone seminars, 
honors classes and graduate seminars. It uniquely complements research on ethnic 
assimilation and will provoke timely debate on the future of race and ethnicity. 





A Nation of Emigrants: How Mexico Manages Its Migration 
By David Fitzgerald 
University of California Press. 2009. 264 pages. $55 cloth, $21.95 paper. 


Reviewer: Robyn Rodriguez, Rutgers University 


Mexican immigration to the United States has been a key topic of research across 
the social sciences. Scholars have examined questions such as: what determines 
migratory flows from Mexico to the United States; how are these flows sustained 
in the face of increasingly restrictive U.S. immigration laws; to what degree do 
Mexican migrants assimilate or are they living their lives more transnationally; 
is migration from Mexico, particularly undocumented migration, reshaping the 
form and substance of American citizenship? 

Interestingly, scholars have been less attentive to the Mexican side of the im- 
migration equation. The ways national belonging and citizenship may be trans- 
forming in Mexico as it is affected by processes of emigration have not been fully 
explored. Indeed, studies about the impact of international migration on the state 
and society of labor-sending countries continue to be scarce. Fitzgerald’s book, 
therefore, is a vital contribution to the scholarship on Mexican migration as well 
as broader scholarly debate about nationalism, transnationalism and citizenship 


under conditions heightened by global mobility. 
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Based on a mix of methods including archival, interview, participant observa- 
tions as well as survey research, Fitzgerald traces how the Mexican state has long 
been concerned with — and has attempted to regulate — processes of emigration. 
During the early 20% century emigration, characterized as “demexicanization,” 
was opposed and actively prevented by the central government with limited suc- 
cess. The uneven and sometimes contradictory implementation of restrictive 
emigration policies was due in part to the Mexican state's lack of internal cohe- 
sion. As well, policies introduced by Mexico’s more powerful northern neighbor 
undermined attempts by the Mexican state to control the out-flow of its citizens. 
In the 1980s, the Mexican state’s orientation towards emigration was practically 
reversed. Shifting from coercive strategies of emigration control, the state began 
to ultimately encourage out-migration as the remittances generated by overseas 
migrants became more and more appealing to the central government. Indeed, 
Mexican states have launched their own initiatives to engage with, and benefit 
from, overseas Mexicans. For example, the state of Zacatecas, since 2003, has 
allowed Zacatecanos overseas to run in the state’s congressional and county elec- 
tions. Meanwhile in Jalisco, the state government has found novel ways to funnel 

* migrants’ remittances into business investment and infrastructural development. _ 

Despite years of tensions, the Mexican state would ultimately partner with and 
build upon the Catholic Church’s infrastructure and methods for sustaining its 
reach over its mobile population. Indeed, Fitzgerald suggests that the model of 
what he calls “citizenship 4 la carte” or “emigrant citizenship” that has emerged 
most recently in Mexico is drawn largely from the kind of pastoral care engaged 
in by the Mexican Catholic Church to maintain its hold over its flock. As the 
Mexican church’s flock moved northward, its reach did too. Hometown asso- 
ciations have been a key mechanism by which the Church, and now the state, 
maintains its ties with overseas Mexicans. Indeed, much of the scholarship on 
Mexican transnationalism has focused on the central role of HTAs in facilitating 
and sustaining Mexican immigrants linkages to their communities of birth. HTAs 
however, as Fitzgerald reveals, are not a recent invention. Hometown associations 
were actually first encouraged by the church to sustain internal migrants’ linkages 
with their hometowns. In Arandas, the site of focus for Fitzgerald, domestic HTAs 
were crucial to local economic development. In fact, domestic HTAs in the mid- 
20" century appear to have had a far greater developmental impact than more 
contemporary trans-border ones. Nevertheless, today’s trans-border HTAs are 
important both economically and politically to emigrant-sending communities. 

While the state and the church have a more favorable stance toward emigration 
and migrants’ continued homeland ties, migrants’ transnationalism, at least from 
the perspective of some of their compatriots, is often unwelcome. While U.S. 
pundits may decry Mexican immigrants’ refusal to assimilate into the mainstream 
of American life, the fact is, these very migrants are often perceived by their com- 
munities of origin as having become too culturally dissimilar from locals. The term 
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“nortefios, used to describe those who have lived and worked in the United States, 
has a negative connotation. Nortefios are believed to infect their communities 
with crime, vice and other bodily and social ills. Neither the state nor the church 
is ready to revert back to their earlier positions on emigration, however, and have 
instead attempted to valorize what they consider to be ideal migrants. 

Fitzgerald’s book complicates conventional understandings of assimilation and 
transnationalism and is an essential reading for immigration scholars. Just as im- 
portantly, it is a book for scholars interested in understanding the consequences 
of globalization on nation-state formation. Fitzgerald argues that with increasing 
emigration, the “fusion of a territory, a government and a people has cracked 
apart.”(154) This, as Fitzgerald convincingly argues in this study of Mexico, has 
not led to the dissolution of the nation-state, but rather its reconfiguration. 


Restructuring the Philadelphia Region: Metropolitan Divisions and Inequality 
By Carolyn Adams, David Bartelt, David Elesh and Ira Goldstein with 
Joshua Freely and Michelle Schmitt 

Temple University Press. 2008. 248 pages. $74.50 cloth, $29.95 paper. 


Reviewer: Philip Nyden, Loyola University Chicago 


Restructuring the Philadelphia Region takes a fresh look at a social landscape that 
is no longer easily broken down into city vs. suburbs, or even older suburbs vs. 
newer suburbs. Examining opportunity structures in employment, housing and 
education, Adams and her co-authors argue that traditional categories do not 
work. Instead they suggest five community types: Urban Centers, Stable Working 
Communities, Established Towns, Middle-Class Suburbs and Affluent Suburbs. 

The authors use a variety of theoretical frames and data sources to provide a 
comprehensive new look at a predominantly older urban region. In updating 
Philadelphia: Neighborhoods, Divisions, and Conflict in a Postindustrial City, a 1991 
book written by many of the same co-authors, Adams et al. continue to focus on 
how development patterns, private investment and government policies produce 
the “inequalities of life chances.” However, in this book they expand the focus to 
the entire Philadelphia metropolitan area. Adams et al. effectively recognize and 
draw from urban research orientations as diverse as “uneven development,” the 
“LA School's poly-centered model” of regionalism, and New Urbanism, in explain- 
ing the realities of 21* century Philadelphia. 

The chapter on housing opportunity was particularly valuable not only in 
understanding the Philadelphia region, but in guiding anyone interested in the 
challenges facing most American cities. Although written before the national 
housing market meltdown of 2008, the authors’ documentation of the inequities 
in the housing market as well as the increasingly risky investments being made by 
developers, banks and middle-class homeowners speaks to the perspicacity of the 
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book. Perhaps it underscores that fact that policy makers and regulators should be 
paying much more attention to the on-the-ground research than to the promise 
of a perpetual cornucopia of rising housing values as the way the middle class can 
expand its wealth and the lower class can pull itself out of poverty through home 
ownership. The authors document how mortgage discrimination and over reliance 
on sub-prime loans in lower-income communities negatively affect community 
economic and social health. 

In the next chapter on educational opportunity, the authors do an excellent 
job in, first, pointing out how educational attainment is central to the health of 
the region, and second, analyzing how educational theory and innovation meets 
practice in schools throughout the Philadelphia region. For example, they docu- 
ment how the overall educational attainment of a community is a better predic- 
tor of school performance than income. At the same time, they document the 
continuing racial gap in education, with black students consistently performing 
more poorly; this was apparent even after researchers control for district differ- 
ences in property taxes, per student expenditures and other factors. The analysis 
of impact of school choice, charter schools and private schools (the Philadelphia 

"region has a particularly high percentage of students attending private schools) is . 
especially informative. 

‘The analysis of the “value proposition,” or the extent to which households are 
rational in their decisions to move to or stay in particular communities, is also 
provocative. There appears to be a disconnect between what prospective movers 
say they are looking for in new community and the type of community into which 
they ultimately move. Although “good schools” are high on many home shoppers’ 
wish list, analysis of actual moves shows those same buyers do not necessarily 
move to communities with higher average SAT scores or school expenditures per 
student. Although part of this may be a product of limited detailed knowledge, 
Adams et al. make a strong case that household moves are most heavily “influenced 
by expectations regarding a return on one’s investment.” (165) 

A concluding chapter focuses on “Who Takes Responsibility for Addressing 
Inequality?” Here the authors’ focus on the “Third Sector,” including business 
improvement districts, neighborhood development organizations, community 
development financial institutions, foundations and quasi-public corporations. 
While helpful in understanding some of the policy actors in Philadelphia, the 
reader gets the feeling that he is experiencing the policy equivalent of window 
shopping along a city street. Only those who can afford to set up shop and present 
the nice displays are present. What is missing are the people and institutions from 
the disenfranchised low-income or working class communities. Details of the 
struggles and victories of neighborhood groups to get rid of abandoned housing 
are absent. The work of ecumenical alliances to develop faith-based employment 
programs is not prominent. ‘The grassroots, more human texture, of social change 
movements are not included among the discussion of regional organizations and 


Book Reviews @ 1927 


more official government agencies. This may very well be beyond the scope of this 
well-written and well-researched book, but it is a side that needs to be considered 
in order to understand urban life in Philadelphia or in any city in this first decade 
of the 21* century. 

Restructuring the Philadelphia Region is unquestionably a valuable contribu- 
tion to the understanding of contemporary trends and policy Philadelphia and 
in all cities. While it is effectively a case study, its lessons reach far beyond this 
mid-Atlantic metropolitan area. It also represents a great updating of our view of 
metropolitan Philadelphia which has been the focus of much past writing, from 
Jane Jacobs’ extensive use of Philadelphia as an example in her Death and Life of 
Great American Cities to Sam Bass Warner’s history of Philadelphia, Zhe Private 
City. Whether used as an addition to a research scholar’s bookshelf or as a college 
classroom text, this is an accessible, useful contribution to the field. 





Faith Makes Us Live: Surviving and Thriving in the Haitian Diaspora 
By Margarita A. Mooney 
University of California Press. 2009. 296 pages. $55 cloth, $21.95 paper. 


Reviewer: Melissa J. Wilde, University of Pennsylvania 


Faith Makes Us Live is the result of a carefully designed and exhaustively executed 
internationally comparative ethnography of the everyday lives, struggles and re- 
ligious beliefs of Haitian immigrants. Wanting to understand the ways that the 
Roman Catholic Church is helped or hindered by its relationship with the state in 
its attempts to minister to Haitians, author Margarita Mooney learned Creole and 
French and then spent four to six months in Miami, Montreal and Paris attending 
religious services, singing in the parish choir, interviewing and participating in the 
daily routines of immigrants and community and Church leaders in each location. 

Mooney found that the Church and state collaborate in Miami to create a 
vigorous ministry that provides a host of essential social and spiritual services to 
Haitians. In comparison, the Church and state conflict in Montreal and Paris in 
ways that continually stymie the Church's efforts in both places. 

As a result of its comparative design, Faith Makes Us Live provides a number 
of contributions. First, as a study of the resources and struggles of the dually 
disadvantaged Haitians, who must deal with the stigma of African descent and 
having among the lowest education attainment and occupational skills of most 
immigrants today, the book provides an original and compelling account of an 
often over-looked group. Furthermore, the book contributes to scholarship on 
immigration more generally by emphasizing the advantages provided by a coop- 
erative relationship between religions institutions and the state and by underlining 
religion’s importance, not just in terms of material resources, but also in terms of 
emotional and spiritual support, for immigrants. 
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Overall, Faith Makes Us Live is impressive in both its breadth and depth. Mooney 
succeeds at painting rich and compelling pictures of Haitians in each location while 
never losing sight of the ways in which they are differently enabled or constrained by 
the broader social and political environments in which they are embedded. 

However, as with any book that is really path breaking, it leaves the reader with 
many questions, most of which have to do with Mooney’s conclusion that Haitian 
immigrants in Montreal and Paris would be much better off if the Quebecois and 
French governments provided more support for the Church. 

The first factor that raises questions is that while the United States clearly has 
the friendliest Church-state relationship of the three countries, it is also without 
a doubt the most religious. Whereas immigrants tend to become more religious 
upon arriving in the United States, Mooney’s data suggests that this is not the case 
for the Haitians who arrive in France.(190-92) While this finding is a contribu- 
tion in and of itself, if immigrants in Paris and Montreal are falling away from 
the Church after immigration, would giving money to the churches there be as 
effective as it is in Miami? 

The second question raised by the book has to do with the fact that in Miami 
"the Church benefitted from its decision to build an ethnically-based Haitian par- . 
ish within a Haitian ethnic enclave. While both of these are absent in Montreal 
and Paris, isn’t the creation of an ethnic parish well within the capabilities of the 
Church in both countries, even if that parish would lack a nearby ethnic enclave 
(given the public housing policies that intentionally discourage their formation 

in both Montreal and Paris)? 

Because I think the answer to this question is yes, part of the story about why 
the RCC is less capable of helping Haitians in Montreal and Paris is about the 
ways in which the hierarchy has been affected by the much less friendly political 
and cultural environments in each city (of which those Church-state relationships 
are both cause and effect). While Mooney alludes to this a number of times, this 
argument often gets lost in her focus on the state. 

Finally, additional questions are raised by the way in which Money drew her 
sample, by first going the parishes which most intensively ministered to Haitians 
in each of her cities, and then picking their most active lay leaders to interview 
and observe. While this sampling strategy allows her to paint rich portraits of 
immigrant religiosity throughout the book, how representative is her respondents’ 
intense religiosity among Haitian immigrants, particularly in the less religious 
Paris and Montreal? Secondly, and even more importantly, given that drugs, gangs, 
early childbearing and prostitution are becoming more prevalent among second- 
generation Haitians in all three cities, and given recent research that shows that 
sexually active women (and one would assume especially prostitutes) avoid ac- 
cepting help from religious congregations because they fear they will be judged 
(Crawford Sullivan 2009), it seems necessary to ask: Would giving more money 
to the RCC really be an effective strategy to combat these social ills? 
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Despite these questions, Faith Makes Us Live does convince the reader that the 
Church provides Miami's Haitians with invaluable resources that those in Paris or 
Montreal simply must do without. Furthermore, and perhaps most importantly, 
it does so by richly describing the religious beliefs and practices that play an im- 
portant role in that process. In these ways, the book is an important contribution 
that will be a touchstone for scholars of immigration in the United States, France 
and Canada, and for that matter, any country that finds itself struggling to adapt 
to the different cultural, political and religious views of immigrants. 
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Rave Culture: The Alteration and Decline of a Philadelphia Music Scene 
By Tammy L. Anderson 
Temple University Press. 2009. 240 pages. $79.50 cloth, $25.95 paper. 


Reviewer: Allison McKim, New York University 


Tammy Anderson's Rave Culture is based on several years of intensive, multisite 
ethnographic research on the twilight era of raves in Philadelphia. This insightful 
and ambitious book gives one of the fullest pictures to date of the American rave 
scene, which centers on all-night dance parties featuring electronic dance music. 
A new fan of the music, Anderson was disappointed to find the culture was los- 
ing popularity and its grassroots, antiestablishment ethos. Using informants at 
an EDM record store, ethnographic observation of parties and interviews, she 
sets out to understand the forces behind this decline. Anderson argues that the 
intersection of five forces, generational schisms, musical fragmentation, formal 
social control, excessive deviance and commercialization worked to strip raves 
of loyal participants and shift party promoters and DJs to more mainstream 
cultural practices. 

Anderson’s investigation of the sources of cultural change provides a welcome 
break from much of the prior work on youth subcultures, which defines scenes in 
terms of either drug abuse or working class resistance. The book’s great strength is 
its subtle and complex empirical account of the rave scene, including the overlap- 
ping social groups, parties and the scene's vague, contested borders with other 
forms of nightlife. Conflicts over scene borders and their symbolic markers are 
a common but under-researched feature of many youth music cultures. Chapter 
2 lays out the fragmentation and decline of raves by explaining the range of dif 
ferent parties, tastes and practices that produce social distinctions within the 
scene. Placed on a continuum of commercialization, she charts the vast gray area 
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between illegal, “underground,” raves and the glitzy, highly sexualized club scene. 
Chapter 3 describes six ideal type rave-goers, ranging from “loyalists” to outsiders 
who “spillover” from mainstream bars. While each category of people and parties 
is not equally fleshed out, Anderson’s complication of the dichotomy between 
underground and commercialized is an important contribution. 

Chapter 4 is lays out Anderson’s argument about the forces of “alteration and 
decline” and is the heart of the book’s analysis. She raises several important ques- 
tions about the standard analysis that grassroots cultural production eventually 
loses out to homogenized commercialized success. Anderson shows how other 
forces shape commercialization. The intensive policing and numerous federal 
laws that targeted raves in the early 2000s led party promoters and musicians to 
renounce rave culture, move EDM into mainstream nightclubs and restrict admis- 
sion to those over 21. While many ravers aged out of the culture and their own 
drug use, the shift to nightclub venues failed to recruit new, young fans into raves. 
Furthermore Anderson argues that as the scene declined for reasons unrelated to 
commercialization because the music fragmented by genre. While it is not clear 
why this is unrelated to commercialization, Anderson makes a convincing case 
“that commercialization does not guarantee scene expansion, but in this case actu- 
ally undermined the scene and the market for EDM. 

Chapter 5 looks at the cultural work done by people, especially cultural pro- 
ducers, to adapt to these changes, restore elements of the rave culture, or preserve 
what they have. Anderson is able to further complicate the overly-simplistic story 
of commercialization by showing how these strategies reveal multiple different 
relationships between music and the other cultural elements of rave. In Chapter 
6 she applies her theory of scene alteration and decline to two places where EDM 
has thrived, London and the Spanish island of Ibiza. Using a few short stints of 
fieldwork, Anderson finds that rave commercialized but this resulted in expansion. 
The model does not account for EDM’s continued popularity, but it illuminates 
how greater tolerance for hedonism and lesser social control allowed commercial- 
ization to sustain a diverse, class and age stratified scene. 

Despite the complexity with which Anderson describes the current rave scene, 
her characterization of the scene during its height is thin — the product of relying 
on her informants’ idealizations of “back in the day.” Without a clear understand- 
ing of the presumably purer rave scene of yesteryear, it is hard to assess decline or 
specify how or why the scene has changed since the late 1990s. Anderson tries to 
resolve the problem of defining authenticity by relying on her subjects’ operating 
definition of “real” raves. However, she does not consider that the earlier rave 
culture also had commercialized elements and contained internal divisions over 
authenticity. In addition, her key informants are mostly white men who are highly 
involved in music and party organization. This produces too static a picture of 
the early and peak rave scene and obscures changes that are the product of earlier 
conflicts or the decreasing quality of parties. 
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Nonetheless, Anderson's book is an important contribution to the literature on 
youth subcultures and among the most comprehensive examinations of American 
rave culture. Its model of scene change will provide useful of tools for with inter- 
ests in collective identity, cultural change and cultural criminology. 
eS ee 
Memphis and the Paradox of Place: Globalization in the American South 
By Wanda Rushing 
University of North Carolina Press. 2009. 272 pages. $59.95, cloth, $21.95 paper. 


Reviewer: Japonica Brown-Saracino, Loyola University Chicago 


Wanda Rushing’s Memphis and the Paradox of Place: Globalization in the American 
South fills a number of voids in urban sociology. Relying on a case study approach 
and drawing on the strengths of urban, cultural and historical sociology, Rushing 
attends to place, historical processes and public spaces in an understudied city and 
region. Her attention to these features unites typically disparate traditions, and as 
a result the book breaks new ground by challenging established approaches to the 
study of place, cities and globalization. 

First, as the title suggests, Rushing is far more attentive to place than most soci- 
ologists and urban scholars. Rushing not only attends to the factors that influence 
the specific place she studies, but also, following Thomas Gieryn’s (2000) call for 
scholars to acknowledge place’s agentic qualities, her book treats place as an in- 
dependent variable. For example, instead of merely documenting globalization’s 
impact on Memphis, Rushing works to demonstrate how the meanings that local 
and extra-local actors assign to Memphis influence its role in the global economy 
and residents’ response to the pressures and opportunities that globalization presents. 

Second, Rushing’s book calls our attention to an underexplored city in an un- 
derstudied region. As urban scholars frequently note, too much of our knowledge 
of American cities is predicated on studies of Chicago and, secondarily, on other 
large metropolises such as New York and Los Angeles. Rushing’s book helps to fill 
this void in our understanding of American cities — not simply by attending to 
Memphis but also by demonstrating how the broader region’s economic, political, 
cultural and demographic characteristics have influenced the city’s development. 

Third, Memphis and the Paradox of Place is highly attentive to historical pro- 
cesses. Better than most Rushing demonstrates that the contemporary dynamics 
her book captures are rooted in a long and varied history. She reveals how events, 
from the Civil War and yellow fever outbreaks to the assassination of Dr. Martin 
Luther King, Jr., continue to influence Memphis. Furthermore, Rushing demon- 
strates not only how the meanings actors assign to Memphis have evolved, but also 
the relationship between contemporary meanings and older narratives. 

Finally, public spaces, such as parks, have a central place in Rushing’s book. 
Rushing does not treat them, as many might have, simply as the spaces in which 
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events occur, but instead grants them their own analytic weight. They at once 
convey and help determine the meaning residents and other actors assign to the 
broader city. These meanings, Rushing argues, are of real consequence, defining, 
for example, the content of public spaces as well as which spaces are preserved. 

The book’s complex rendering of Memphis and its fusion of disparate analytic 
traditions is facilitated in large part by Rushing’s reliance on “a narrative case-study 
analysis,”(10) or what she describes as “an in-depth, holistic, historical-sociological 
investigation.” The data she relies on include archival materials, literary and his- 
torical works, an oral history collection and her own experience and interactions 
as a longtime Memphis resident.(30-31) This methodological approach facilitates 
the book’s empirical breadth and enables Rushing to pose a set of wide-ranging re- 
search questions, from tracing the evolution of a single park to locating Memphis’ 
position in a global hierarchy. 

However, the approach comes at the cost of some empirical depth and, as a 
result, in some cases it is unclear which actors attribute a given meaning to a place 
or whether sides taken in a particular historical debate accurately reflect the full 

_ complexity of the work either side conducted. In the vein of other interpretive 
approaches to the study of cities, the book provides a portrait of Memphis, its _ 
residents and institutions that depends largely on the author's interpretive analysis 
of outcomes and secondary sources. 

Of course, attending more closely to micro-level meaning making would have 
required Rushing to sacrifice the book’s key strengths: its empirical and historical 
breadth. As a result of that breadth the book provides an overarching understand- 
ing of Memphis’ development that will likely invite others to attend to the micro- 
level conditions of place construction in Memphis and beyond. 

In sum, Rushing’s book fills a void in urban studies and pushes sociologists 
in a variety of subfields to attend to a number of under-examined empirical and 
analytic categories. In its attention to disparate histories, institutions and facets 
of Memphis — from the ecological to the economic — Memphis and the Paradox of 
Place serves as a model for application of the case study approach to the research of 
cities, particularly those devoted to local history. Finally, the book is a must read 
for those interested in globalization’s influence on cities, for Rushing’s account of 
the multi-directional relationship between material and symbolic global and local 
conditions moves this line of inquiry in an intriguing and productive direction. 
In short, many features of Memphis and the Paradox of Place will be of great value 
to a variety of scholars, most notably its insights on Memphis and the American 
South, place, space and globalization. 


Reference 


Gieryn, T. 2000. “A Space for Place in Sociology.” Annual Review of Sociology 26:463-96. 
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Migration Miracle: Faith, Hope and Meaning on the Undocumented Journey 
By Jacqueline Maria Hagan 
Harvard University Press. 2009. 238 pages. $29.95 cloth. 


Reviewer: Sarah J. Mahler, Florida International University 


The immigration literature is a very crowded field and growing more crowded each 
day. How can anyone read even the yearly volumes published in this area, let alone 
the hundreds if not thousands of new journal articles, reports, blogs, etc.? The 
sheer mass gives the impression that there is little if anything original to say. Yet, 
periodically a new publication comes along and makes even the seasoned academic 
shake her head and wonder, “Now why hasn't anyone published on this before?” 
Such was my experience reading Jacqueline Hagan’s new book, Migration Miracle. 

This infinitely readable volume fills a gap in the U.S. migration field’s extensive 
literature — in often vivid narrative prose — by documenting the importance faith 
plays in immigrants’ decisions to migrate and their ability to survive harrowing 
journeys, particularly those of undocumented Central Americans. A growing 
literature engages the importance of faith, and religion more narrowly, in the 
acculturation processes of immigrants particularly the volumes published as the 
outcome of research underwritten by Pew Charitable Trusts — the Gateway Cities 
project of which I was a part. And conferences on religion and migration abound. 
Hagan knows this work, but she quite correctly argues that there is far more to 
the importance of immigrants’ faith than just whether or not it aids their adapta- 
tion. What she sets out to do and achieves very effectively, is to communicate the 
intimacy of faith in the migration process, from prayers and promesas as people 
contemplate and plan their trips, to seeking human counsel and divine interven- 
tion when inevitable problems arise and barriers appear that thwart their travel. 
Hagan knows this terrain well; she has long researched Guatemalan migration and 
has published previously on how important faith is to embarking on migration. 
Over five years of transnational fieldwork, she and her team did copious amounts 
of data collection for this book, including some 250 interviews with immigrants 
and nearly 100 with religious leaders from Mexico and Central America. They also 
observed religious festivals, visited sacred spaces and traced migration pathways 
through the desert, finding in the latter harrowing pleas for help alongside inspir- 
ing affirmations of faith inscribed into the harsh landscape. 

After an introductory chapter in which Hagan spells out her objectives and 
methodology, she develops chapters that mirror the migration experience. Chapter 
2 explores the religio-psychological preparation migrants undergo as they plan to 
emigrate. Included in the planning are the promesas, the covenants they make to the 
divine for help in crossing. The axis mundi of the book, Hagan explains this tran- 
scendent reciprocity system in which successful migrants return the favors through 
devotional activities that express their gratitude, fortifying their faith in the process. 
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In Chapter 3 the stories shift from those of migrants to those of the many faith 
leaders who advocate for, and provide sanctuary to, the modern-day Pilgrims during 
their arduous ordeals. “Miracles in the Desert” is the title of Chapter 4, recounts the 
travails of migrants navigating unfamiliar and harsh lands in prose reminiscent of 
familiar refrains from Exodus. Difficulties test faith among today’s migrants —as they 
have since time immemorial — but faith accompanies them, inspires their determina- 
tion, and miracles do happen. No wonder Hagan dedicates the entirety of Chapter 5 
to narrating immigrants’ promesas. These pledges are made before the journey begins, 
are fortified during the journey, and are repaid after the pilgrim reaches the Promised 
Land. However, full reciprocity can only occur, Hagan writes, when migrants return 
to their homelands and “pay homage to the sacred figure in the original shrine in 
which the pledge was made.”(154) Promesas, then, are as transnational and as cycli- 
cal as the migratory life itself, inextricably bound together to sustain and nourish 
hope. As much or more than promises made to migrants’ families, these spiritually 
sealed covenants inspire migrants to continue their journeys despite overwhelming 
difficulties and odds. It is this faith, Hagan underscores in the book’s conclusion, 

. that we social scientists typically miss when we emphasize viewing migration in 
largely economic and political cost-benefit analyses. Among migrants themselves - 
the language is more saturated with belief, faith and miracles. She is right to push us 
in emic directions; this point brings me to my only real criticism, however. A book 
so devoted to expanding our understanding through people's subjective experience 
contains many stories about, but very few voices of, the migrants themselves. So 
much data was collected that elicited quotations, why not include more of this 
original data than the paraphrases which dominate the book? Perhaps as an anthro- 
pologist I’m echoing a disciplinary bias, but I do feel that people's own voices tend 
to tell their stories best. Nonetheless, Migration Miracle will captivate readers with 
its sensitive, nuanced portrayal of human fortitude fueled by faith. 





Redesigning Social Inquiry: Fuzzy Sets and Beyond 
By Charles C. Ragin 
University of Chicago Press. 2008. 240 pages. $45 cloth, $18 paper. 


Reviewer: Stephen L. Morgan, Cornell University 


In Redesigning Social Inquiry: Fuzzy Sets and Beyond, Charles Ragin has offered to 
the social sciences a robust appeal for renewed commitment to small-NV comparative 
research, pursued with a toolkit of analysis techniques that he has developed. The 
book contains three arguments that are developed simultaneously: (1. Case-oriented 
explanation remains vital to the social sciences; (2. Fuzzy-set-based Qualitative 
Comparative Analysis offers tools that enable the construction and evaluation of 
complex causal explanations; (3. Prevailing forms of quantitative analysis cannot 
elucidate causal complexity and are therefore ill-suited to small-NV research. 
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I do not know of a single sociologist who would disagree with Ragin’s position 
that case-oriented explanation is central to the sociological enterprise. This wide- 
spread agreement is, to no small degree, the result of Ragin’s 30 years of eloquent 
writing on this point, complemented by longstanding admiration for the highest 
quality ethnographic and historical work in the sociology cannon. In Redesigning 
Social Inquiry, Ragin reminds all of us why we feel this way. 

Although the specific details of fsQCA are well beyond a short review of this 
type, Redesigning Social Inquiry succeeds in its goal of explaining that the utility 
of fsQCA is “to explore evidence descriptively and configurationally, with an eye 
toward the different ways causally relevant conditions may combine to produce 
a given outcome.”(141) The demonstration proceeds in orderly fashion through 
all 11 chapters, with a sufficient number of clear, detailed examples. 

In Redesigning Social Inquiry, Ragin also moves QCA and fsQCA away from 
reliance on notions of sufficiency and necessity in defining causal effects and 
instead embraces counterfactual dependence as well. This was a major weakness 
(by my reading) of the foundational literature on QCA. Too many researchers 
came to believe that simple notions of sufficiency and necessity could be inferred 
from near-invariant associations in small-N research, which thereby cut them off 
from dominant currents of thought on causality in both philosophy and quantita- 
tive methodology. Ragin and co-author John Sonnett show how counterfactuals 
fit nicely within both QCA and fsQCA, and this is an important development. 

I expect that Redesigning Social Inquiry will also increase the usage of fsQCA 
among comparativist researchers. One often hears the relevance of fsQCA chal- 
lenged among those most at risk of deploying it. Is it worth the effort? Does 
the Boolean algebra kill off the scholarly impulse? Does the formality constrain 
creativity? Does it suppress narrative detail? Does it give sufficient scope to com- 
plex narrative time? In Redesigning Social Inquiry, Ragin takes on these whispered 
critiques will full force, not by direct confrontation, but through convincing 
demonstrations of the power of fsQCA. I came away from the book with a deeper 
appreciation for the power of fsQCA to help build subtle causal explanations, and 
I suspect this will be true for all readers. 

No review is complete without at least some critical content, and in this regard 
there is low hanging fruit in Redesigning Social Inquiry. It is perhaps common in 
pioneering methodological work to conjure up an opposition, but the enmity in 
this book is excessive. According to Ragin, “conventional quantitative research” is 
(1. obsessed with correlations to the exclusion of more fundamental joint distribu- 
tions, (2. ignores set relations, relying only on linear associations between interval- 
scaled variables, (3. universally condemns any examination of cases selected on the 
outcome variable, (4. adopts linear additive models of causation, only paying lip 
service to nonlinear and interactive effects, and (5. has no interest in understanding 
the joint distributions of the causal variables that it implores should be included 
in “net effects” regression models.(cf., 16, 22, 29, 149, 157) Even when moving 
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beyond such “conventional” practice, with references to the work in counterfactual 
causal analysis that has developed since the 1990s, the commentary is both out of 
date and incorrect. This less conventional, but increasingly important, literature 
gives substantial attention to modeling of causal effect heterogeneity, interactions 
and mechanisms, all within a framework that usefully formalizes counterfactual 
dependence. There is much common ground that could have been discussed when 
extolling the necessity of considering causal complexity through fsQCA. Instead, 
Ragin offers comments on this literature only in the service of explaining its pur- 
ported irrelevance to small-/NV research, offering shallow statements on the necessity 
of large-Ns and the necessity of manipulability. 

In the end, Redesigning Social Inquiry would be easier to recommend if its criti- 
cism of conventional quantitative research had been replaced with a deeper engage- 
ment of state-of-the-art quantitative causal analysis. As it stands, I can therefore 
recommend only two-thirds of the book. But for this majority of pages, it is a 
hearty recommendation. 





_ Portable Communities: The Social Dynamics of Online and Mobile Connectedness 
By Mary Chayko 
SUNY Press. 2008. 318 pages. $74.50 cloth, $24.95 paper. 


Reviewer: Danah Boyd, Microsoft Research and Harvard Berkman Center 


There is little doubt that networked technologies have enabled people to connect in 
new ways. From the earliest days of the internet, scholars have worked to make sense 
of online communities and how people leverage technology to engage in a vast array 
of everyday practices. In that tradition, Mary Chayko’s Portable Communities seeks 
to explore “the social dynamics of online and mobile connectedness.” 

The primary contribution that Chayko tries to make concerns the framing of 
this historical trajectory. She introduces the phrase “portable communities” in op- 
position to Howard Rheingold’s “virtual communities” to challenge assumptions 
embedded in the latter. Chayko argues that there is a tendency to view that which 
is virtual as “imaginary” and “not real” (although she does not take up an alterna- 
tive reading: “not physical”). She rightly points out — and highlights throughout 
the book — that mediated connections are very much real. As such, she believes 
that labeling such communities as “virtual” sends a “damaging message.” (10) 

While her critique of “virtual” is well-taken, her choice of “portable communi- 
ties” creates unnecessary confusion. She defines portable communities as “groups 
whose members connect via online and mobile technologies whether they meet 
face-to-face frequently, occasionally, or never.”(8) The goal of such a move is to 
get away from frameworks that focus explicitly on the online component of con- 
nections or those that create an unnecessary dichotomy. Yet, both “communities” 
and “portable” send mixed signals. Although she introduces “communities” as col- 
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lections of people and uses “portable communities” in this vein, she also employs 

“online communities” to refer to online spaces where people gather. Furthermore, 
because the term “portable” is typically used in relation to mobile devices, this 
word carries its own baggage. 

It’s also difficult to understand why portability is the most salient shift to high- 
light. Communities have never been wholly co-present and, as such, they've always 
been partially imagined. Taking the community with you is not as significant as 
being able to access it regardless of physical location. As such, mediating technolo- 
gies (and especially mobile ones) erase the need to think of location when thinking 
of community, making portability seem like an odd choice as it is explicitly about 
location and movement. What Chayko may really be getting at is the emergence 
of ever-present communities thanks to mediating technologies. 

Chayko also introduces the phrase “sociomental” in opposition to “virtual” to 
emphasize the imagined or cognitive construction of social exchanges. “Socio- 
mental space is the cognitive analog to physical space. It is a kind of mental habitat 
where portable communities “gather” and where cyberspace can be said to be 
situated.”(22) In laying out this argument, she focuses on the in-betweenness of 
mediated social life. She argues that technology alters people's sense of presence or 
togetherness, creating a cognitive connectedness that feels quite distinct. 

With these conceptual frameworks in place, Chayko then turns to consider dif- 
ferent dynamics of communities. She covers a broad terrain, providing a contem- 
porary map of the central issues that commonly emerge when scholars examine life 
online: intimacy, connection and support; hanging out and playing; socializing, 
flirting and dating; privacy, control and availability; identity and self-expression; 
and inequality and power. The bulk of the book weaves together the perspectives 
of 87 informants, the public accounts of others through blogs and traditional 
media, and the voices of other scholars’ informants as found in their publications. 
Chayko’s informants are primarily American adults drawn from convenience and 
snowballing; her interviews are conducted online. Rather than using these differ- 
ent voices to anchor her own claims, Chayko primarily draws on others’ words to 
shed light on, and highlight the contributions made by, previous scholars investi- 
gating mediated sociality. In essence, the voices are exemplary glue that help craft a 
contiguous story from the claims made about mediated interaction over the years. 

While the voices Chayko use enrich the narrative, her choice to label her infor- 
mants by what they do (e.g., MobileUser5) dehumanizes their voices. Although 
background on each informant is provided in the appendix, this move makes it 
hard to get a sense of the people behind the voices. This is not to say that names 
alone provide rich information about informants, but it feels peculiar to boil 
people down to what Chayko sees as their salient mediated practice. While the 
naming choice creates unnecessarily distance, the voices themselves are quite 
compelling and work well alongside the literature-driven claims. Reading these 
alone sheds light on contemporary mediated practices. 
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Although some of the linguistic and stylistic choices are puzzling, this book 
successfully documents the current state of thinking surrounding mediated so- 
ciality, using personable voices to flesh out what it means to live in “portable 
communities” today. From looking at how people connect during an emergency 
to the voyeuristic habits of participants, Chayko provides a taste for a wide swath 
of practices in mediated life. 
wou a ee eS 
Unanticipated Gains: Origins of Network Inequality in Everyday Life 
By Mario Luis Small 
Oxford University Press. 2009. 312 pages. $29.95 cloth. 


Reviewer: Michael McQuarrie, University of California, Davis 


To date, sociological research on social capital has mostly treated it as an individual 
attribute that is the product of the number and type of connections an individual 
has with other individuals. While generally successful, this research also developed 

_in ways that were enabled by social network analysis, sometimes at the expense 
of other dimensions of social capital such as the contexts and settings that shape . 
its development. Mario Luis Small’s new book on childcare centers in New York 
corrects this bias and will decisively reorient sociological research on social capital 
away from network connections among individuals to the organizational and 
institutional settings that enable and constrain network connections. He calls this 
the “organizational embeddedness perspective.” 

To address the gap he identifies in social capital research, Small takes a meth- 
odologically pluralistic approach that utilizes different methods and data sources 
to address the key question: the significance of organizations and institutions in 
the production of social capital. For individual-level data he utilizes the Fragile 
Families Survey in combination with in-depth interviews of parents who make use 
of the centers he studied. For organizational data he utilizes a survey of 300 centers 
alongside case studies of 23 centers. The choice of childcare centers is admirable. 
The organizational and institutional variety encompassed by childcare centers and 
the constant nature of client interest enables Small to capture significant variety 
and range within a nonetheless usefully delimited object of analysis. Small’s bril- 
liance as a sociologist shines through in the research design which yields data that 
are straightforward and convincing. As a consequence, Small has no need to yield 
ground to conceptual imprecision or interpretive license. This book is an excellent 
example of the positive benefits of skillful research design, especially in combina- 
tion with multiple methods. 

‘The analytical content of the book is straightforward and clear. Each chapter 
considers a key question contained in the subtitle. Often the chapters open with 
questions that emerge from qualitative interview data which are then precisely 
answered using quantitative data. Small finds that the well-being of mothers was 
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higher when they had children in daycare, and that such mothers were less likely 
to be socially isolated. This well-being is closely tied to the friendships mothers 
make in childcare centers, friendships that are, in part, produced by the way 
centers reconfigure how mothers use their time. Centers encourage participation 
and cooperation while also enabling time-constrained mothers to pursue multiple 
goals at once. 

In addition to these useful findings, Small discovers that strong ties predomi- 
nate at the center, but they are also “domain-specific,” that is, they tend not to 
carry over to settings beyond childcare. This combination of the characteristics of 

“weak” and “strong” ties is probed by Small to reveal that in forming these types 
of ties mothers are taking advantage of the relationship-sustaining “work” that is 
accomplished by the organization. 

Usefully for people with more of a background studying organizations, Small 
goes beyond interpersonal ties to consider inter-organizational ties. Here he is on 
much less firm ground and makes much less use of the existing literature, but he 
nonetheless establishes his key point. Namely, childcare centers act as “brokers? 
that connect mothers not just to other mothers but to other organizations and 
their resources. These ties do not simply produce more of what social ties produce, 
but add distinctive value to the utility of the centers. 

One can quibble with aspects of Small’s treatment, but most of these issues are 
not central to the analysis. There is an issue with the framing of the book since, 
for many, social capital is a concept that is specifically designed to highlight the 
importance of organizations (though the salience of organizations is usually as- 
sumed in this literature rather than demonstrated). Small mobilizes his findings 
to question Wilson's deinstitutionalization thesis even though he does not have 
the longitudinal and comparative data necessary to address it. An issue of greater 
analytical salience is the lack of attention to research on organizations. This inat- 
tention enables a treatment of organizations and fields as unproblematic contain- 
ers of resources rather than socially productive entities that mobilize resources in 
ways that can have a variety of sociologically salient effects. 

Nonetheless, these are quibbles. The point of the book is to assess the signifi- 
cance and theoretical salience of organizations and institutions in the production 
of social capital. Small addresses this question decisively, demonstrates a model 
research design for meso-level analysis of the connections between micro processes 
and macro structures and, in doing so, he charts a new direction for research on 
both social capital and contemporary cities. There are gaps in Small’s consider- 
ation, but he is not attempting a total analysis of childcare centers. He set out 
to answer a research question and, at the same time help reorient the agenda of 
urban and social capital researchers. This combined goal is ambitious enough and 


Unanticipated Gains fulfills them both admirably. 
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Trends in Global Gender Inequality 


Shawn F. Dorius, University of Michigan 
Glenn Firebaugh, Pennsylvania State University 


This study investigates trends in gender inequality throughout the world. Using data 
encompassing a large majority of the world’s population, we examine trends in re- 
cent decades for key indicators of gender inequality in education, mortality, political 
representation and economic activity. We find that gender inequality is declining in 
virtually all major domains, that the decline is occurring across diverse religious and 
cultural traditions, and that population growth is slowing the decline because popu- 
lations are growing faster in countries where there is the greatest gender inequality. 


This study investigates whether, globally, women and men are converging or di- 
verging with respect to key dimensions of welfare such as educational attainment, 
economic activity, length of life and representation in national legislatures. Several 
studies find evidence of cross-country or global convergence—or at least the halting 
of divergence—on a number of welfare indicators, including education (Morrisson 
and Murtin 2007; Goesling and Baker 2008), income (Firebaugh 2003; Firebaugh 
and Goesling 2004; Sala-i-Martin 2006), fertility (Dorius 2008; Wilson 2001) 
and the Human Development Index (Crafts 2002), leading one development 
scholar to declare that “nearly everything that matters is converging.’ (Kenny 
2005:1) The question is whether Kenny’s declaration applies to gender inequality 
as well. 

Gender inequality differs from other types of inequality in significant ways, 
so we cannot tacitly assume that gender inequality traces the same path as other 
inequalities. Gender inequality exists when men (or women) enjoy a dispropor- 
tionately large share of some valued good such as political power or long life. Men 
and women can differ in any number of domains, so gender inequality intersects 
other types of inequality (educational, economic, political and so on). Women 
may be gaining on men in some domains and falling further behind in other 
domains. In a few domains, such as life expectancy, women have the advantage, 
so “declining gender inequality” in this instance refers to men gaining on women. 
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These features of gender inequality imply that it is important to examine the 
domains of gender inequality separately. 

Our objective in this article is to describe recent trends in global gender 
inequality as measured by convergence or divergence of indicators used by the 
United Nations to monitor progress toward the Millennium Development 
Goals (United Nation 2008). 


What is Global Gender Inequality? 


First consider what is meant by gender inequality. Assuming a population that 
consists of 50 percent women and 50 percent men~as is roughly the case for 
the world (Appendix B)—there is gender equality on some characteristic Y when 
females constitute half of those with Y. Gender inequality occurs, then, when 
women and men deviate from their 50 percent share. It is a simple matter to gauge 
inequality by the absolute value of the observed difference between men’s share 
and women’s share, that is, gender inequality = |s,—s,,], where s, (female share) 
is the number of women with Y divided by the total population with Y and s,, 
(male share) is the number of men with Y divided by the total population with Y. 

It turns out that the absolute difference between s, and s,, is exactly the Gini 
coefficient for gender inequality in the case where there is an equal population of 
men and women (Subramanian 2002). Of course, in the real world the population 
of men and women is only approximately the same, so to measure gender inequal- 
ity we use a more precise formula for the Gini for two groups (Nielsen 1994:658): 


yb -2 (1) 


where s is the share of Y for the g” group (here, women or men) and p, is the 
population share of the g” group.' We use Equation 1—a Gini measure—because 
we know that Ginis meet the key requirements for measures of inequality (Allison 
1978). (Other measures that are sometimes used, such as female/male ratios, do 
not necessarily meet the axiomatic requirements for measures of inequality.) In ad- 
dition, with Equation 1 we can compare global gender inequality to other global 
inequalities, which very often are measured as Ginis. 

Because data for estimating gender inequality are available at the national 
level, researchers must make a determination among three competing methods 
of aggregating these national bundles of gender data to the world level. Each way 
corresponds to one of Milanovic’s (2005) three concepts of inequality, and each 
allows the researcher to describe fundamentally different types of cross-national 


or world inequality. One way is to take the simple average of nations’ gender 
inequality. For Equation 1, this is: 


Gender inequality = ID Db, ~ P/N forj = 1,2,... N countries (2) 
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Equation 2 is comparable to Milanovic’s Concept 1 inequality, which he calls 
“unweighted international inequality.” Concept 1 inequality would be used, for 
example, to address the question of whether gender inequality is rising or falling 
in the average country, where size of country is not taken into account (Klasen 
2005). Similarly, Concept 1 inequality would be useful for ranking nations on 
one or more gendered outcomes (see, for example, Hausmann, Tyson and Zahidi 
2007; United Nations 2000). 

Alternatively, we could calculate the population-weighted average of nations’ 


gender inequality: 
Gender inequality = dy DNs, ~ Pd, for ja 1m comniries (3) 


where 7. is the proportion of the world’s population residing in country j. Equation 
3 is comparable to Milanovic’s Concept 2 inequality, which he calls “weighted 
international inequality.” In the case of gender, Concept 2 inequality would be 
used, for example, to determine the /evel of national inequality where the average 
person lives (Hausmann, Tyson and Zahidi 2007). 

Finally, instead of calculating s, and p, separately for each country, plugging 
the values into Equation 3, and summing over countries, we could add up the 
raw numbers in each country to calculate s, and p, for the world. We could, for 
example, use data on population size and gender-specific literacy rates for each 
country to calculate the number of literate men and women in the world, and 
calculate s = number of literate men (women) in the world/total literate for the world. 
In this case the Gini is simply Equation 1 applied to the whole world: 


Global gender inequality = Db, es p, =|s,—p abe? (4) 


where s,, is women’s share of Y worldwide and p,, is women’s share of the world 
population. 

Equation 4 is comparable to Milanovics Concept 3 inequality, which he calls 
“true world inequality.” In the case of gender, this type of inequality conceptually 
bears on the question of whether the world’s citizens are living in a world where 
men and women are converging. Concepts 2 and 3 are not the same. For the world 
(Concept 3) there are currently about as many women as men pursuing tertiary 
education (the global Gini is close to zero). That does not mean, however, that the 
average person lives in a country where gender inequality is zero (Concept 2 inequal- 
ity) because enrollments favor men in some countries and women in other countries. 

The defining feature of Concept 3 (global) inequality —Milanovic’s (2005:10) 
“true world inequality’ —is that it refers to inequality over the world treated as a 
single piece. Global gender inequality refers then to gender inequality for the world 
as a whole, as if there were no national borders. In the case of global inequality, 
individuals are viewed as citizens of the world, and their location in a particular 
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Table 1: Literate Adults in 1970 for 134 Countries . 
Estimated Population 
millions Women’s Share 
Total number of adults, age 15+ 2,103 
Total number of adult women, age 15+ 1,059 504 
Total number of literate adults 1,374 
Total number of literate adult women 615 448 


Notes: Literacy is defined as the percentage of females and males ages 15 and older 
who can, with understanding, read and write a short, simple statement about their 
everyday lives. See Appendix A for data sources. The women’s population share reported 
above (.504) varies slightly from the share reported in Appendix A (.507) because the 
latter is based on more than 134 countries. 


nation is relevant only to the extent that the data used to calculate global inequal- 
ity typically are collected at the national level. Concept 3 inequality is relevant, 
for example, to supra-national organizations such as the United Nations, who 
* are interested in trends for the world, and not just trends in individual countries. 

Table 1 illustrates Concept 3 inequality using adult literacy. Complete male’ 
and female literacy data are available for 134 countries in 1970, representing 
approximately 91 percent of the world’s population. When we sum the numbers 
across the 134 countries we find that the approximately 2.1 billion total adults 
in those countries were nearly evenly split between women and men (50.4% 
women, 49.6% men). Among those who are literate, however, only 44.8 per- 
cent are women, indicating that women constitute a smaller share of the literate 
population than they do of the total population. On the basis of these data and 
Equation 4, global gender inequality on literacy in 1970 as measured by the Gini 
is .112, the difference between .504 and .448, times 2. 

It is Concept 3, or global gender inequality, that is our primary focus in this article, 
where we describe recent trends in global gender inequality across a broad range of 
indicators including adult literacy; enrollments in primary, secondary and tertiary 
education; total years of schooling completed; national parliamentarians; economi- 
cally active workers; adult mortality; and life expectancy at birth. It is important to 
stress that viewing individuals as citizens of the world is the defining feature of global 
inequality because that is what distinguishes Concept 3 inequality from concepts 
1 and 2. We want to know whether differences across important life domains are 
growing or declining when we compare the world’s women to the world’s men, and 
thus we ask, “Is global society becoming more or less equal with regard to gender?” 


Gender Inequalities: Unique Among Other Inequalities 


Unlike most other types of inequality, gender inequality is not additively adie 
able, that is, total gender inequality is not the sum of within-region and between- 
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region gender inequalities. This is the case because regional gender inequalities can 
offset one another. The possibility of offsetting effects is of most concern to the 
present study because it draws attention to the danger of inferring global gender 
inequality trends from average country trends (whether unweighted or weighted). 

To illustrate, if men’s literacy rates are higher in some regions of the world and 
women’s rates are higher in other regions, then global gender inequality could 
be zero despite the gender disparities within regions. Contrast this with income 
inequality, for example, where global income inequality exists if there is inequality 
in any part of the world (see formulas in Firebaugh 2003). 

Global gender inequality nonetheless is sometimes inferred from the simple or 
weighted average gender inequality of countries. Neither method reliably portrays 
the level of global inequality. Consider population-weighted averages first. In 
some regions of the world, college enrollment is now higher for women than for 
men; in other regions enrollment continues to be higher for men. A population- 
weighted average of the several within-region inequalities in college enrollment 
overstates the global level of inequality in enrollment by failing to capture the 
offsetting effects of the regional inequalities. 

Unweighted or “simple” country averages can be even more unreliable because 
unweighted averages calculated from country scores give undue weight to small 
countries and deflate the contribution of highly populous countries to world trends. 
Thus individual women and men of India, who comprise about 18 percent of the 
world’s adult population, have much less weight than individual women and men 
of Luxembourg, who comprise .007 percent of the world’s population. Table 2, 
where country averages are compared to global estimates with respect to adult sur- 
vival rates for women (the probability that a 15-year-old female will survive to age 
60), illustrates the danger of using simple country averages to analyze global trends 
in gender inequality. On the basis of country averages, we conclude that this prob- 
ability rose from .713 in 1960 to .792 in 2000, an increase of 11.1 percent. The use 
of global data (ignoring national boundaries) provides very different estimates of 
the probability that a 15-year-old female citizen of the world will survive to age 60. 
In 1960 that probability was .656; 40 years later it had increased by 27.6 percent 
to .837. In this case, the global female survival rate improved faster than female 
survival in the average country because country averages overstated female survival 
in 1960 and understated the gains for women in the ensuing 40 years. 

Much of what we know about global gender convergence to date comes from 
the averaging of trends across countries. Because such studies tell us more about 
gender inequality for the average country than they do about gender inequality 
for the world, it is important to verify the received wisdom about global gender 
inequality by using Concept 3 inequality. 

Hence the first aim of our study is to use the Table 1 method to provide more 
reliable estimates of gender convergence/divergence worldwide. We want to know 
whether, and how fast, women are catching up with men in various domains. 
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Table 2: Change in Women’s Status in the Average Country vs. Change for the 
World’s Women: Survival Probabilities for Women, 1960 vs. 2000 


Probability of Survival Ages 15 to 60 


1960 2000 % Change 
Country Average 113 192 11.1 
Global 656 837 27.6 


wee 
Notes: Change in women’s status as measured by change in the female adult survival 
probability, that is, the probability that a 15-year-old female will survive to age 60. See 
Appendix A for data sources. 


Our strategy is first to establish the general global contours of gender inequal- 
ity (so that we know what it is that we are trying to account for), then to turn 
to the country and regional patterns to assist in our understanding of the global 
trends. In accounting for the global trends, we hypothesize” that (1. that gender 
convergence is slowed globally by regional variation in population growth and (2. 
that gender convergence is not a uniform process across the world’s regions. 


Three Hypotheses About Global Gender Inequality 


Hypothesis 1. Global gender inequality is declining across a wide variety of welfare 
measures. 


Reports such as the World Economic Forum’s “The Global Gender Gap Report” 
(Hausmann, Tyson and Zahidi 2007) suggest that women’s position relative to 
men’s has improved in the average country. We hypothesize that the direction of 
the trends (though not necessarily their magnitudes) will be the same for global 
gender inequality as it is for gender inequality in the average country. 

There are plausible theoretical grounds for this prediction. Both exogenous 
and endogenous change models point to declining global gender inequality. We 
refer to world polity/global culture theory as an exogenous model because it posits 
that institutional and cultural change at the nation-state level is driven by supra- 
national forces (Meyer et al. 1997; Boli and Thomas 1997). Proponents stress 
the importance of a network of international actors (non-governmental organiza- 
tions, international non-governmental organizations and, more recently, women’s 
international non-governmental organizations) that share deep similarities in 
organizational structure, institutional goals and values, and means of achieving 
institutional missions. This transnational network of actors, including nation-state 
member organizations such as the United Nations, comprises the world polity, a 
critical vehicle for the spread of a world culture (Boli and Thomas 1997). Evidence 
of the world culture is found in the proliferation of world conferences such as the 
Fourth World Conference on Women (Beijing 1995), and in initiatives by na- 
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tion-state member organizations—the United Nations’ Millennium Development 
Goals and the Convention on the Elimination of All Forms of Discrimination 
Against Women—where universal goals of the world society are espoused. 

Beyond shared values, members of the world polity work to enact change 
within countries all over the world. The result is persistent exogenous pressure 
to conform to values and practices aligning with the world culture (Strang and 
Meyer 1993). According to proponents of the exogenous model, ideologies of 
progress and economic growth, social equality (including gender equality), and 
the political and human rights of the individual gradually diffuse across coun- 
tries regardless of cultural traditions, level of economic development or political 
orientation. The result is that countries increasingly adopt broadly similar goals, 
policies and practices. As Paxton et al. (2006:902) put it in their study of women 
in politics, “this pressure is universal, having an impact on all countries and all 
political milestones for women.” 

Within the exogenous model paradigm there is a growing body of work that 
focuses explicitly on gender, and it is this literature that makes the most direct 
case for Hypothesis 1. Beyond the general norms of equality, progress, ratio- 
nality and modernization through economic development, we have, in recent 
years, witnessed the proliferation of “national machineries for the advancement 
of women” across the globe (True and Mintrom 2001:30). The growing cadre of 
national and international women’s institutions shares the common goal of gen- 
der mainstreaming across all domains where emerging and longstanding gender 
inequalities exist. On at least one key measure of human welfare, representation 
in national parliaments, previous research indicates that world polity models 
exert considerable influence on improvement in women’ status around the world 
(Paxton et al. 2006; Ramirez et al. 1997). This research suggests a global trifecta for 
women, with nations, transnational networks of non-state actors (NGOs, INGOs 
and WINGOs), and nation-state member organizations (IGOs) contributing to 
change in gender policies that ultimately lead to more equal gender outcomes.” 

Alternative theories lead to the same general conclusion, but place more em- 
phasis on endogenous sources as engines of change for gender inequality. Chief 
among these endogenous sources is economic growth. Modernization theory 
(what Forsythe, Korzeniewicz and Durrant 2000, refer to as the “Modernization- 
Neoclassical Approach”) has long argued that economic development will improve 
social conditions and lead to broad improvements in human welfare. At its peak, 
modernization theory led to the articulation of the Women in Development and 
the Gender and Development movements (see Koczberski 1998 for a review). 
Both movements emphasize the positive role of economic development in im- 
proving the position of women around the world. As industrialization spreads, it 
brings with it many of the trappings of the civil society, including “universalistic 
mechanisms for allocating people to jobs.” (Chang 2000: 1660) This line of reason- 
ing posits that economic growth and institutions common to industrial societies 


1948 © Social Forces 88(5) 


facilitate the decline in gender differences. The continued influence of moderniza- 
tion as a tool for social equality is evident in global economic institutions, such 

as the World Bank, that search for “business reasons for gender equity.” In this 

way the provision of resources for women’s education, family planning and micro- 
credit are all justifiable, ultimately, as serving market-based economic growth and 

efficiency norms (Goetz 2001). Insofar as gender equality has been successfully 
linked to the development project (Berkovitch and Bradley 1999), it appears to 

have garnered greater international support. 

Finally, the world is becoming more interconnected as the flow of informa- 
tion, communication, goods, services and people continues to expand. The in- 
terconnectivity of the world’s people through economic development points to 
the leavening effect of technological change, particularly as new technologies 
are adopted and spread (for evidence of the global spread of technologies key 
to cultural diffusion see Kenny 2005). The breadth of the flow of people and 
technologies across different regions and countries exposes the world’s citizens to 
new ideas, institutions and practices that directly or indirectly affect the status of 
women relative to men. Modernity in all its forms—institutional, ideational and 
structural — penetrates ever farther into the less developed world. The increasing 
adoption of modern contraceptive methods in some of the world’s poorest regions 
is a case in point (Casterline 2001; Zlidar et al. 2003). 


Hypothesis 2. The decline in global gender inequality is being slowed by country 
differences in population growth rates. 


Hypothesis 1 anticipates a general decline in global gender inequality across 
a broad range of welfare measures. Hypothesis 2 posits that the decline would 
have been even faster if populations had grown at the same rate for all countries. 
Instead, population has tended to grow the most rapidly in regions of the world 
where gender inequality is highest, and this has served to slow the rate of decline 
in gender inequality globally. 

Table 3 presents preliminary evidence on the potential braking effect of the 
world’s uneven population growth. To construct Table 3 we grouped the world’s 
countries into quartiles on the basis of their level of gender inequality and then 
calculated percentage increase in population from 1970 to 2000 for each quartile. 
Table 3 compares the population increases for the lowest quartile (the quartile 
containing the countries with the lowest levels of gender inequality on literacy, 
adult survival and economically active workers, respectively) with the quartile 
containing the 25 percent of countries with the highest levels of gender inequality 
on those indicators. As Table 3 shows, population growth is significantly higher 
in those countries where gender inequality is the greatest. é 

The positive association between population growth and gender inequality 
seen in Table 3 is not surprising since poorer regions of the world tend to have 
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Table 3: Recent Percentage Population Increase for Countries Grouped by Gender 
Inequality Quartile in 2000 


Gender Inequality 


Lowest Quartile Highest Quartile 
Literate Adults 43.2 200.1 
Surviving Adults 77.0 166.7 
Economically Active Adults 81.7 191.6 


Notes: Gender inequality was estimated using male and female share among all literate, 
surviving and working adults, respectively. Statistics reported here weight countries 
equally. See Appendix A for data sources. 


more traditional attitudes about gender roles (Ingelhart and Norris 2003) and 
rapid population growth. Nonetheless we doubt that economic underdevelop- 
ment accounts for all the association between greater gender inequality and faster 
population growth; part of the relationship between gender inequality and popu- 
lation growth most likely is causal (Riley 1997) because a number of studies show 
that—independent of income—fertility tends to be higher where women have 
less input in decisions about family size (Dodoo 2002; Mason and Smith 2000). 
To appreciate the potential importance of uneven population growth for the 
global trend in gender inequality, note that from 1960 to 2005 sub-Saharan 
African and MENA (Middle East and North Africa) nations saw their share of 
the world’s people rise from about 14 percent to about 22 percent. Indeed, at the 
dawn of the 21% century, 95 percent of all births occurred in less developed coun- 
tries (United Nations 1999a). These facts suggest that global gender inequality is 
growing unless the decline in gender inequality within nations is rapid enough 
to offset the inequality-boosting effect of faster population growth in the more 
unequal nations (which we believe is the case, as indicated by Hypothesis1). 


Hypothesis 3. The decline in global gender inequality is also being retarded by slow 


and halting change in gender inequality in some non-Western cultural traditions. 


We anticipate that notable relative gains for women in some cultural regions of 
the world are being offset in part by negligible relative gains for women—perhaps 
even losses for women —in other regions of the world. In other words, the overall 
downward trend in gender inequality may be masking sharp differences in rates of 
change across the world’s regions and cultural traditions. Hypothesis 3 does not con- 
tradict Hypothesis 1 —we still anticipate that women’s advances are outstripping men’s 


advances across a variety of domains—but it does qualify Hypothesis 1 by positing 


. . > . 
that women’s relative gains are not advancing in lockstep across the world’s regions. 
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In our discussion of Hypothesis 1 we speculated on the positive correlation 
between gender equality and the rise and spread of a world culture. But some 
standing cultural traditions may be more resistant than others to change with 
respect to women’s rights, so we cannot assume that women’ gains are universal. 
According to Ingelhart and Norris (2003:9, italics in original), “cultural change is 
a necessary condition for gender equality: women first need to change themselves 
before they can hope to change society.” On the basis of their analysis of survey 
data representing more than 70 societies from 1981 to 2001, Ingelhart and Norris 
provide evidence of a rising tide of support for gender equality (Ingelhart and 
Norris 2003), but a tide that is rising unevenly. So much so, in fact, that the 
unevenness appears to be leading to a growing gap between Western views about 
equality and gender, and traditional values espoused in poorer societies. Lopez- 
Claros and Zahida (2005:11) draw similar conclusions to those of Ingelhart and 
Norris when they state that “there is little doubt that traditional, deeply conserva- 
tive attitudes regarding the role of women have made their integration into the 
world of public decision making extremely difficult.” 


Measuring Global Gender Inequality 


By relying on aggregate data for the entire world, this study is among the first to 
investigate the global trend in gender inequality. Unfortunately we cannot do this 
for all measures of interest (there is, for example, no reliable data on “men’s income” 
and “women’s income” for the world, nor is there reliable longitudinal data on 
caloric consumption by sex). We do, however, have sex-specific data for key welfare 
indicators in the areas of education, representation in national assemblies, economic 
activity and health (for prior studies using similar indicators, see Ashford 2001, 
2005; Bradley and Khor 1993; Mason 1986; United Nations 1995, 2000). 
Because we are interested in change in global inequality, we must employ indi- 
cators that, first, cover most of the world’s population and, second, are available for 
multiple points in time. In addition, because we are interested in gender inequality, 
we focus explicitly on domains where men and women can be compared (thus 
we focus on education, for example, as opposed to fertility or maternal mortality). 
We begin with the four indicators the United Nations (2004) uses to monitor 


progress toward the Millennium Development Goal of promoting gender equality 
and empowering women:* 


1. Ratio of literate women to men, ages 15-24. 

2. Ratio of girls to boys in primary, secondary and tertiary 
education. 

3. Proportion of seats held by women in national 
parliaments.” 

4. Proportion of wage earners in the non-agricultural 
sector who are women. 
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We follow the U.N. indicators exactly, with one exception: Instead of restricting 
the age range to 15-24 for literacy, we use a more comprehensive measure based 
on literacy rates for those ages 15 and older. Doing so allows us to assess change 
in gender inequality in literacy for women vs. men in general, not just for young 
women vs. young men.° 

We supplement the U.N. indicators by (1. adding a measure of total years of 
schooling completed and (2. adding two measures based on gender disparities 
in mortality. As Edwards and Tuljapurkar (2005:665) observe, “Inequality in 
life span is arguably the most fundamental inequality that exists among human 
populations.” We use differences in female and male life expectancy at birth as 
one of our indicators of gender inequality in mortality and gender disparities in 
survival rates from age 15 to age 60—the probability of a 15 year old surviving to 
age 60 on the basis of current age- and sex-specific survival rates—as our second 
indicator. Regrettably there are no reliable world data on healthy life span, so we 
rely on data for actual life span in line with prior studies of global health inequality 
(e.g., Goesling and Firebaugh 2004). 

Appendix A describes our data and sources in more detail, and Appendix B details 
the world population coverage by indicator and year. Data are drawn from a number 
of sources, including the World Development Indicators Database (World Bank 
2004), the United Nations Women’s Indicators and Statistics Database version 4.0 
(United Nations 1999b), the United Nations Common Database (United Nations 
2005a) and the Barro and Lee education attainment data set (Barro and Lee 2000). 
Results are based on a listwise complete set of countries, although the set of countries 
varies across indicators. We use different sets of countries for each indicator in order to 
maximize the number of available cases, but this reliance on different samples creates 
no serious problems of comparability across indicators because our data cover more 
than 90 percent of the world’s population for six of the nine indicators, and more 
than 80 percent for all the indicators. Missing data is not our principal concern here; 
the greater concern is measurement error for the data we do have. 

The data cover the years 1960 to 2005 (the interval is shorter for some variables) 
and are measured in either 5- or 10-year intervals. The trends in gender inequality 
are generally monotonic (see figures 1 and 2), so to save space our tables present 
results only for the first and last year for each indicator. 


Recent Trends in Global Gender Inequality 
Results for Hypothesis 1: Is Global Gender Inequality Declining? 


Figures 1 and 2 present trends in global gender inequality over the past several 
decades for each of our nine welfare indicators (first- and last-year point estimates 
are provided in Table 4). The results in the figures highlight three features of global 
gender inequality. First, global gender inequality has declined for all nine welfare 
indicators (average change per decade in the Gini index ranges from -.001 for life 
expectancy to -.049 for tertiary school enrollment). 
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Second, gender convergence is occurring much faster in some domains than in 
others. The biggest success story for women involves tertiary education where, by 
the year 2000, women had eliminated the gap in post-secondary educational en- 
rollments, While it may seem counterintuitive that inequality in higher education 
has declined faster than inequality in primary and secondary enrollments, consider 
that contrary to primary and secondary schools, the distribution of institutes of 
higher education is markedly uneven, with most colleges and universities located 
in advanced industrial or post-industrial societies. It is these societies that have 
experienced some of the biggest gains in gender equality in higher education. 

Third, the /evel of global gender inequality varies dramatically across domains. 
Women are the least disadvantaged with respect to longevity (indeed, they hold 
a slight advantage over men in that domain, as is also the case with adult survival 
rates and now with higher education) and are the most disadvantaged with respect 
to representation in national legislatures. Women’s disadvantage in political rep- 
resentation is extreme (Gini = .79 in 1980 and .69 in 2005). To put inequality of 
this size in perspective, the Gini measure of inequality in pretax U.S. household 
income was .39 in 1998 (Jones and Weinberg 2000). For other nations the Gini 
coefficient for income inequality ranges from .25 for Denmark and Sweden to 
.63 for Botswana and .71 for Namibia (United Nations 2005b). Worldwide, then, 
women’s and men’s participation in national legislatures is as unequally distributed 
as is income in the world’s most unequal countries. 

Women are also disadvantaged with respect to economic activity. We estimate 
that the Gini for gender inequality on labor force participation declined from 
about .27 in 1960 to about .18 in 2000. As a point of comparison, the Gini 
coefficient for global income inequality over the same 40-year period was about 
.63 in 1960 and .64 in 2000 (see Figure 2). It is important to stress, however, that 
economic activity captures only part of the difference in women’s and men’s status 
with regard to work because economically active women generally receive lower 
wages and often endure poorer working conditions than do men. Thus the Ginis 
reported here serve as a lower limit on the actual level of global gender inequality 
in the economic domain. 

In short, with respect to political power and economic activity, the disparities 
between women and men worldwide remain sizeable. The good news is that, for 
all nine of our welfare indicators, global gender inequality is declining—and the 
decline is rapid in many cases. In general, school enrollments exhibited the most 
rapid declines in gender inequality. In the last two decades of the 20" century gender 
inequality was eliminated for tertiary school enrollments and was reduced by half 
for primary enrollments and by nearly two-thirds for secondary school enrollments. 
Gender inequality in literacy was halved in the final three decades of the past century, 
and gender inequality in average years of completed schooling narrowed by nearly 
30 percent. Finally, gender inequality is low and declining negligibly in the case of 
our two longevity indicators: adult survival rates and life expectancy at birth.’ 
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Figure 1. Change in Global Gender Inequality: Five Education Indicators 
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Note: First- and last-year point estimates are reported in columns 1 and 2 of Table 4. 


Figure 2. Change in Global Gender Inequality: Four Other Welfare Indicators 
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Table 4: Observed Global Gender Inequality and Projected Inequality Under the 
Counterfactual Condition of Constant Population Shares 


Population Shares 
Observed Held Constant 

First Year Last Year Last Year 
Indicator Gini Gini Gini 
Literate Adults (1970-2000) 112 .056 044 - 
Primary School Enrollments (1980-2000) 082 .039 .030 
Secondary School Enrollments (1980-2000) —.118 041 .030 
Tertiary School Enrollments (1980-2000) .085 012 .029 
Years of Schooling Completed (1975-2000) LO Zo 103 
National Legislators (1980-2005) 189 .687 Na 
Economically Active Workers (1960-2000) 211 ito 164 
Surviving Adults (1960-2000) .061 042 .045 
Expected Years of Life (1960-2000 033 028 031 


Notes: The simulation is a projection forward in time, that is, population shares (column 
3) are held constant at initial levels (1960, 1970, 1975 or 1980, depending on the 
indicator). Our conclusions would be the same if we projected backward in time. Italicized 
values indicate female advantage. 


How Much is the Decline in Global Gender Inequality Being Slowed by Uneven 
Population Growth? 


From the evidence in Table 3, we can infer that uneven population growth across 
the world’s regions is serving to retard the decline in global gender inequality be- 
cause populations are growing faster in regions of greater inequality. It makes sense 
to ask, then, how much more gender inequality would have declined worldwide if 
the population shares of nations had remained constant. This question suggests a 
statistical exercise in which we assume, counterfactually, that population growth 
remained constant across all countries in recent decades. To be more specific, 
we project the level of global gender inequality in the last year, holding country 
population share constant at first-year levels while allowing gender inequality 
within countries to change at the observed rates. The key assumption here is that 
the observed change in Y (gender inequalities within nations) would have been 
the same had nations’ populations grown at the same rate.® 

Table 4 reports the results of the projections. The overarching conclusion is 
that (as expected) faster population growth in high-inequality regions did indeed 
retard the decline in global gender inequality. This result is seen by comparing the 
estimates in the last two columns of Table 4. The next-to-last column reports the 
observed level of gender inequality, and the last column reports the projected level 
under the assumption of uniform population growth for all countries. The results 
are striking: In every instance where women are disadvantaged, the projected 
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Ginis indicate that female disadvantage would have declined even faster if popula- 
tions had grown uniformly across nations. In short, our results strongly confirm 
Hypothesis 2: The decline in global gender inequality was slowed substantially by 


unevenness in nations rates of population growth. 


/s the Decline in Global Gender Inequality Being Retarded by Slow and Halting Change 
in Some Non-Western Cultural Traditions? 


A number of empirical approaches have been used to capture macro-level variation 
in gender outcomes by cultural or religious traditions (see, for example, Forsythe, 
Korzeniewicz and Durrant 2000; Kenworthy and Melami 1999; Paxton et al. 
2006). We use a coding scheme that classifies each country according to majority 
religion. Admittedly, religion is a crude indicator of cultural tradition, and reli- 
gious differences coincide with other historical, political and economic differences. 
But there are few alternatives in a global study, and Inglehart and Baker (2000) 
find broad coincidence between religious and cultural “zones” worldwide. We de- 
viate slightly from more conventional religious coding schemes in that we reduce 
the number of categories to just five (Buddhist, Chinese Universist, Christian, 
Hindu and Muslim) because these religious groupings capture a measurable share 
of the world’s population in each category.’ Appendix Table C provides details on 
our coding scheme as well as a list of countries assigned to each religious tradition. 
In anticipating that culturally-entrenched barriers to equality in some regions 
of the world would slow the spread of gender inequality, we had supposed that re- 
sistance to change would be concentrated in non-Western cultures, with predomi- 
nantly Muslim cultures being particularly slow to change (for suggestive evidence, 
see Forsythe, Korzeniewicz and Durrant 2000; Kenworthy and Melami 1999; 
Paxton et al. 2006). Indeed, we would not have been surprised to find backlash ef- 
fects resulting in the further disadvantaging of women in some cultural traditions. 
Because we reasoned that religious/cultural traditions that resist (or facilitate) 
women’s relative progress in one domain would tend to resist (facilitate) women’s 
progress in other domains, we anticipated a pattern of change that might be highly 
uneven across cultural traditions, yet looked similar across all our indicators. 
Our findings fly in the face of those expectations. As Figure 3 shows, women's 
relative progress was highly uneven across cultural traditions (as expected), yet 
there is no single pattern to the unevenness. In line with the results of Ingelhart 
and Norris (2003), then, we find “uneven tides” in the relative progress of women 
from region to region, but the unevenness itself is uneven, varying across the 
educational, economic and political domains. In recent decades women’s relative 
advances in school enrollment and in adult literacy have been fastest under Hindu 
and Muslim traditions and slowest in majority Christian countries. Ceiling effects 
no doubt explain why the decline in educational inequality was slower for majority 
Christian regions (i.e., primarily the West); women in the West continue to gain 
faster than men in the educational area (tertiary schooling) where enrollment rates 
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Notes: Countries were coded to a majority religion using the simple majority religion, where majority represents the religion with the largest number of 
within-country adherents. We use the five-category religious coding scheme in order. to capture a meaningfully large share of the world’s population in 


each category, but admit that the high level of aggregation we employ here masks considerable variation within each category. “China Universist’ is the 


term used by Barrett, Kurian and Johnson (2001) and refers to the amalgam of Toaist, Confucian and Buddhist religious traditions adhered to by many 


Chinese and East Asians. A negative sign indicates that female share declined during the study period. 


Sources: See Appendix A for gender inequality data. Religion data are drawn from the Association of Religion Data Archives (2005) estimates of religious adherents. 
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have not yet approached the ceiling. Women’s representa- 
tion in national legislatures is increasing fastest in majority 
Christian regions and most slowly in Chinese Universist 
regions. With respect to labor force participation, women 
are catching up with men in four of the five traditions, with 
the greatest gains in Christian and Muslim regions and the 
slowest in Buddhist and Hindu regions. 

Looking at Figure 3 as a whole, we conclude that wom- 
en’ relative progress generally has been slowest in Buddhist, 
not Muslim, societies. While the relatively slower rates of 
progress toward gender parity in economic activity in Asian 
countries may seem perplexing, cross-national research on 
female labor force participation suggests a possible explana- 
tion. In the early stages of industrialization, female labor 
force participation declines, but then begins to rise in the 
later stages of industrialization (Pampel and Tanaka 1986). 
During the study period, 1970-2000, much of Asia was in- 
dustrializing (declining female participation rates), the West 
was transitioning to post-industrialism (rising female rates), 
and the economies of the poorest countries were predomi- 
nantly pre-industrial. This covariance of stage of economic 
development with female labor force participation rates is 
in line with our findings in Figure 3. 

The most significant message in Figure 3, though, is that 
declining gender inequality is a pervasive phenomenon: For 
all our welfare indicators where women are disadvantaged, 
all religious traditions exhibit declining gender inequality in 
some domains; and there are no major regions of the world 
where gender inequality is rising rapidly. Among Christian, 
Hindu, Buddhist and Muslim religious traditions, the fe- 
male share among national legislators increased by more 
than 1.5 percent per year. Among college students, female 
shares increased by more than 2 percent per year for four of 
the five major religious traditions. And in the Muslim world 
the growth rate in the female share of economically active 
workers was greater than in any other region. 


Discussion 


This article documents a decline in global gender inequality 
across nine key welfare indicators. Though uneven, the de- 
cline is noteworthy on several fronts. Gender inequality has 
been one of the most enduring forms of inequality across 
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all societies over the course of human history, and the gender divide has been, as 
Epstein (2007) suggests, one of the deepest and most resistant forms of inequal- 
ity existing in the world. Yet the world may be changing. Just over 100 years ago, 
women the world over were denied even the basic right of citizenship, the right 
to vote, whereas today women have achieved near universal suffrage (Paxton et al. 
2006; Ramirez et al. 1997). Even in the domain where female disadvantage is the 
most extreme, in national legislatures, inequality has been eroding in recent years. 
Female disadvantage in higher education has been eliminated at the global level, and 
if the observed rates continue with respect to adult literacy and secondary school 
enrollments, the global gender gap will completely close in those domains as well 
in the next two decades. In historical perspective, then, we are witnessing a series of 
remarkable events—global “firsts.” Blau, Brinton and Grusky (2006) have dubbed 
the pervasive decline in gender inequalities in the United States “one of the most 
important egalitarian movements of our time,” and we would extend that conclu- 
sion to the world as a whole, at least for the key welfare indicators we measure here. 

In Destined For Equality, Jackson (1998:19) claims that “a pattern of declining 
gender inequality has appeared in all nations with modern economies and political 
structures. The timing, rate and form of specific changes have varied consider- 
ably, but the fundamental pattern has been similar. This consistency suggests that 
the essential causes of gender inequality decline must be conditions or processes 
intrinsic to the development of modern institutions.” Although we did not report 
the results, we tested for a correlation between economic growth and change 
in gender inequality across countries (weighted by population size) and found 
only weak and inconsistent correlations. This finding is consistent with Chang’s 
(2000:1661) assertion that “insofar as such changes are worldwide in nature and 
diffuse across nations of varied levels of development, the case can surely be made 
that this cultural egalitarianism is not always structural-functional in origin.” The 
results of our analysis, particularly our finding of pervasive gender decline regard- 
less of cultural traditions and economic growth, suggest that the diffusion of gen- 
der equality is not solely a function of socioeconomic change. The pervasiveness 
of the global egalitarian trends, combined with weak correlations of these trends 
to economic growth and religious traditions, lead us to conclude that exogenous 
change and cultural diffusion models may offer the most promising explanation 
for the global patterns we have found here. Exogenous change models might also 
account for some of the “lumpiness” in gender inequality across nations, regions 
and religious traditions, where it has been noted that the diffusion of the world 
polity has itself been rather uneven (Beckfield 2003). 

Of course, the patterns we have found are not all the patterns that exist, and 
it is important to note that trends we found do not necessarily apply either 
to gender policies (Curtin and Devere 2006) or to other important domains 
such as the household division of labor, where significant gender inequalities 
persist (Charles and Grusky 2004). One nonetheless should not downplay the 
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significance of the gains for women across the range of domains we examined. 
On the basis of current trends one can be cautiously optimistic about the future 
of gender equality at the global level. 

Optimism about the future of global gender equality must be cautious for two 
reasons. First is the obvious point that there is no guarantee that current trends will 
continue. Second, gender equality can be seen as a two-step process that can be 
summarized colloquially as “first get in the club, then attain equality within the club.” 
Most of the indicators we examine here focus on attaining membership —enrolling 
in school, joining the economically active population, becoming a member of the 
national legislature. Gender parity on these indicators is only part of the story be- 
cause, to cite one example, men and women are entering highly sex segregated labor 
markets, at least in industrialized countries (Charles and Grusky 2004). The same is 
true for the educational domain, where Jacobs (1996) reminds us that educational 
inequalities are multifaceted, occurring not just in access but also in the quality of 
education and in the rewards to educational attainment (see also Bradley 2000). So 
even if women achieve parity with men in terms of enrollment rates, there is still 
the issue of gender parity among those enrolled. That issue alone provides ample 
motivation for further research on trends in global gender inequality. 


Notes 


1. Note that the Gini is zero—signaling perfect equality across groups—when s, (a groups 
share of Y) equals p, (a group’s share of population) for each group. The Gini reaches its 
maximum value when one group has all the Y (i.e., s.= 1 for one group and zero for the 
other). The maximum for Gini is 1+d where dis the difference between the population 
shares for the two groups. The maximum is greater than 1.0 when Y is monopolized 
by the smaller group and less than 1.0 when Y is monopolized by the larger group. 
(Example: If = .49 for men, and men possess all the Y, the Gini [equation 1] is|1 - .49| 
+ |.00 - 51] = 1.02. If women, the larger group, possess all the Y, the Gini is .98.) That 
presents no problem here because 4 is virtually zero in our analyses. For all practical 
purposes, then, the gender Ginis that we report range from 0 to 1. 


2. Because our study is descriptive, we use the term /ypothesis to indicate a priori 
expectations about what the world looks like. We do not mean to suggest that we are 
testing causal propositions. 


3. A counterpoint to our assertion in Hypothesis 1 that global gender inequality is 
declining across a broad range of indicators is found in this same literature, where 
some research suggests that policies, in fact, have little effect on practices (Hathaway 
2002; Hafner-Burton and Tsutsui 2007). 


4. Some prior studies of gender inequality rely on summary measures of gender inequality 
such as the gender empowerment index or the gender-related development index 
(Apodaca 1998; Dijkstra 2002; Ogwang and Abdou 2003; United Nations 2003), 
but the GEM and GDI have been criticized on theoretical and technical grounds 
(Bardhan and Klasen 1999, 2000; Charmes and Wieringa 2003). In addition, gender 
inequality is multidimensional (Bradley and Khor 1993; Mason 1986), and these 
dimensions do not always move in tandem, nor do the dimensions necessarily respond 
to the same causes. Because composite indexes might mask divergent trends among 
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its component dimensions, it is important to begin with separate indicators. Finally, 
summary indexes such as the GEM place greater demands on the data. Because 
complete, or nearly complete, data for all indicators comprising an index are often 
available only for richer countries, the use of such indexes may severely reduce and 
bias the population coverage. 


5. Note that number of national legislators is not a constant fraction of population size 
because small countries sometimes have more legislators than large countries do. The 
concept of global gender equality remains the same: We say there is global equality 
when the number of female legislators in the world equals the number of male 
legislators in the world. To calculate global gender inequality for national legislators, 
then, we do not weight by country’s population, but simply add up the number of 
women and men legislators. This strategy enables us to remain true to Concept 3 
inequality, whose defining feature (as noted earlier) is it that refers to inequality over 
the world treated as a single piece. 


6. While not reported here, we also measured gender inequality in literacy among 15-24 
year olds, in line with the U.N. indicator. As one would expect, the decline in gender 
inequality is more rapid among those in the 15-24 age range than it is among those 
in the 15+ age range we report in this research. We report the more conservative 15+ 
estimates primarily because this age range encompasses a larger share of the world’s 
literate people. 


7. The question of why men have narrowed the gap on survival rates and life expectancy 
is beyond the scope of our study. Perhaps worldwide rates of smoking and smoking- 
related illness and death are rising faster for women than men (see Preston and Wang 
2006). Perhaps also HIV/AIDS has disproportionately affected women. 


8. Simulations of this sort constitute only first order effects because they ignore the 
interaction of change in population with change in gender inequality (White and 
Preston 1996). It is possible that if populations had not grown at a faster rate in the 
countries with high gender inequality, the rate of change in gender inequality would 
have been different. Thus, the results might overstate or understate the growth effect. 
The exercise nonetheless is useful because it does indicate the first order effects, where 
the growth simulation can be seen as an extreme (Heuveline 1999) or upper bound 


for change in global gender inequality. 


9. For example only a single country, Israel, has a majority Jewish population. “Chinese 
Universist” is a term used by Barrett, Kurian and Johnson (2001) to refer to the 
amalgam of Taoist, Confucian and Buddhist religious traditions adhered to by many 
Chinese and East Asians. 
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Appendix C. Country Coding of Majority Religious Tradition 


Countries By Religious Tradition (population shares in parentheses) 
uddhist (.056): Bhutan, Cambodia, Japan, Myanmar, Sri Lanka, Thailand, Vietnam. 

Chinese Universist (.208): China, Singapore. Christian (.366): Angola, Argentina, 
Armenia, Aruba, Australia, Austria, Bahamas, Barbados, Belarus, Belgium, Belize, 
Bolivia, Botswana, Brazil, Bulgaria, Burundi, Cameroon, Canada, Cape Verde, Central 
African Republic, Channel Islands, Chile, Colombia, Congo, Congo (Dem Rep), Costa 
Rica, Croatia, Cuba, Cyprus, Czech Republic, Denmark, Dominican Republic, Ecuador, 
El Salvador, Equatorial Guinea, Estonia, Ethiopia, Fiji, Finland, France, French Guiana, 
French Polynesia, Gabon, Georgia, Germany, Ghana, Greece, Grenada, Guadeloupe, 
Guam, Guatemala, Guyana, Haiti, Honduras, Hungary, Iceland, Ireland, Israel, Italy, 
Jamaica, Kenya, Korea (Rep), Latvia, Lesotho, Lithuania, Luxembourg, Macedonia, 
Madagascar, Malawi, Malta, Martinique, Mexico, Micronesia, Moldova, Namibia, 
Netherlands, Netherlands Antilles, New Caledonia, New Zealand, Nicaragua, Nigeria, 
Norway, Panama, Papua New Guinea, Paraguay, Peru, Philippines, Poland, Portugal, 
Puerto Rico, Reunion, Romania, Russian Federation, Rwanda, Saint Lucia, Saint Vincent 
& Grenadines, Samoa, Sao Tome & Principe, Slovakia, Slovenia, Solomon Islands, South 
Africa, Spain, Suriname, Swaziland, Sweden, Switzerland, Tanzania, Timor-Leste, Togo, 

- Tonga, Trinidad & Tobago, Uganda, Ukraine, United Kingdom, United States, Uruguay, 
Vanuatu, Venezuela, US Virgin Islands, Zambia, Zimbabwe. Hindu (.175): India, Mauritius, 
Nepal. Muslim (.18): Afghanistan, Albania, Algeria, Azerbaijan, Bahrain, Bangladesh, 
Bosnia & Herzegovina, Brunei, Burkina Faso, Chad, Comoros, Djibouti, Egypt, Eritrea, 
Gambia, Guinea, Indonesia, Iran, Iraq, Jordan, Kazakhstan, Kuwait, Kyrgyzstan, Lebanon, 
Libya, Malaysia, Maldives, Mali, Mauritania, Morocco, Niger, Oman, Pakistan, Palestine, 
Qatar, Saudi Arabia, Senegal, Sierra Leone, Somalia, Sudan, Syria, Tajikistan, Tunisia, 
Turkey, Turkmenistan, United Arab Emirates, Uzbekistan, Western Sahara, Yemen. 
Notes: The Association of Religion Data Archives provides estimates on the number of 
within country adherents for the following religions: Bahai, Buddhist, Chinese Universist, 
Christian, Confucianist, Ethno-religionist, Hindu, Jain, Jew, Muslim, Shintoist, Sikh, 
Spiritist, Taoist, Zoroastrian. From these religions, countries were coded to a “majority 
religion”, where majority represents the religion with the largest absolute number of 
within-country adherents. This yielded 7 majority religion codes, which we further 
reduced to five by coding Israel, which is a Jewish majority, to Christian due to the 
similarity in Ethno-religious and economic traditions. Finally, a small number of ethno- 
religionist majority countries were removed from the analysis because they comprised 
such a small share of the world’s people. 


The Shifting Supply of Men and Women to Occupations: 
Feminization in Veterinary Education 


Anne E. Lincoln, Southern Methodist University 


A confining limitation for the occupational sex segregation literature has been the in- 
ability to determine how many persons of one sex would have entered an occupation 
had the other sex not successfully entered instead. Using panel data from all Ameri- 
can colleges of veterinary medicine (1976-1995), a fixed-effects model with lagged 
independent variables finds support for the concurrent effects of many hypothesized 
feminization mechanisms. Declining relative earnings and policies aimed at increas- 
ing production of graduates affect applications from men and women similarly, but 
feminization is driven by the decline in men’s college graduation and their avoidance 
of fields dominated by women. The findings demonstrate the relative contributions 
and interdependence of supply and demand to occupational sex composition and the 
job search process more broadly. 





Discussions about the dearth of women in science and engineering inevitably 
stoke the larger and perpetually-smoldering social debate concerning the origins 
and remedies of the differential distribution of women and men across occupa- 
tions. Occupational sex segregation is a core issue in the study of gender inequality 
because it accounts for much of the persistent wage gap that exists between women 
and men (Cotter et al. 1997; Roos and Gatta 1999). A burgeoning literature 
documents the contributory aspects of the phenomenon, typically conceptualized 
in terms of supply (workers’ characteristics) and demand (the characteristics of em- 
ployers, workplaces and jobs). These complementary literatures have contributed 
substantially to our understanding of the forces that shape the sex composition 
of occupations; however, data limitations have restricted integration of the two 
components primarily to theoretical grounds. As a result, the important question of 
how much employer discrimination contributes to an occupation’s sex composition 
and how much stems from employee's decisions, remains unresolved. Expectations 
for the supply-demand point of conjunction have been gleaned from workers’ job 
search strategies (Drentea 1998; Granovetter 1985), and otherwise inferred from 
the supply-side evidence for individual preferences, aspirations and attributes (Shu 
and Marini 1998), discriminatory employer recruitment, hiring and promotion 
practices on the demand side (Kmec 2005; Reskin and McBrier 2000), and the 
sex-segregated employment outcomes evinced by labor statistics (e.g., Reskin and 


I thank the Association of American Veterinary Medical Colleges and the Washington State Univer- 
sity School of Veterinary Medicine for making the data available. Special thanks go to Michael P 
Allen. Appreciation goes to Julie Kmec and Tom Rotolo. Direct correspondence to Anne E. Lincoln, 
Department of Sociology, Southern Methodist University, RO. Box 750192, Dallas, TX 75275- 
0192. E-mail: lincoln@smu.edu. 
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Roos 1990). However, the key linkage of supply-side factors to occupational sex 
segregation remains open to empirical verification. As Coventry (1999) points out, 
the degree to which women and men actually furnish themselves—both successfully 
and unsuccessfully—to different occupations remains largely unknown. 
Researchers are working to bridge this gap. The sex composition of the occupa- 
tions to which youths aspire is predictive of the sex composition of their actual jobs 
14 years later (Okamoto and England 1999), but the substantial disjuncture be- 
tween specific occupational aspirations and the actual occupation entered (Jacobs 
1989) indicates the presence of additional intermediary sorting processes. To that 
end, recent experimental research testing the mechanisms of the career choice 
process demonstrates that cultural influences produce gendered self-assessments 
of ability (Correll 2004), which in turn contribute to the selection of gendered 
educational tracks by high school and post-secondary students (Correll 2001). 
Other research suggests that men avoid college majors and graduate academic 
fields that are 24 to 54 percent female (England and Li 2006; England et al. 2007). 
Insofar as gendered educational trajectories are a factor in the sex composition of 
skilled labor pools (Granovetter and Tilly 1988), these findings clarify the way 
* that macro-level belief structures about gender can produce what are ostensibly 
preferences and prompt the differential accumulation of career-relevant human _ 
capital by boys and girls. However, the portion of occupational sex segregation 
explained by aspirations and educational specialization relative to demand-side 
forces remains unresolved without the crucial information about how individuals 
then act on these accrued characteristics in the context of employment seeking. 
This article presents one of the first efforts to empirically disentangle the con- 
tribution of multiple supply-side factors relative to demand-side factors at a point 
where individuals can act on their preferences and accumulated human capital. 
Unique organization-level data linked with state-level public data sources illumi- 
nate the gendered sorting process at the entry point of the requisite professional 
education track of one scientific field, veterinary medicine. Longitudinal data 
on the applicant pool to all American veterinary medical colleges demonstrates 
decline and growth in men’s and women’s attempts to enter the profession dur- 
ing a period of substantial compositional change, feminization. Compositional 
change emphasizes the gendered nature of the assortative matching process that 
can be otherwise obscured in occupations that exhibit more stable sex composi- 
tions. Such transformations therefore present the opportunity to elucidate the 
concomitant factors that contribute to both fluctuation and, by inference, stabil- 
ity in the sex composition of occupations and inform the worker-job matching 
process more generally. These data permit tests of four hypotheses advanced in 
the sex segregation and feminization literatures regarding the concurrent effects 
and relative strengths of occupational expansion, male flight, declining earnings 


and occupational competition on the supply of aspiring veterinarians over the 
course of two decades. 


Feminization in Veterinary Education ¢ 1971 


The difficulty in obtaining suitable data must be mentioned. In addition to the 
rarity of this kind of data, the structure of veterinary medical graduate education 
presents an unusual opportunity to test previously hypothesized feminization 
mechanisms. Like several firm-level case studies of hiring (Fernandez, Castilla and 
Moore 2000; Fernandez and Sosa 2005; Fernandez and Weinberg 1997; Petersen, 
Saporta and Seidel 2000), the present analysis exchanges breadth for depth by 
examining compositional change in the professional education track of this oc- 
cupation. The limitations to generalizability posed by case studies should not 
dissuade researchers from using such data to delve deeply into specific cases to test 
conjectured relationships, refine or qualify theory, and put forth new hypotheses 
on the basis of the findings. 


Stability and Change in Occupational Sex Composition 


Meritocratic assumptions figure prominently in supply-side accounts of labor mar- 
ket disparities and occupational sex composition. However, sociological research 
demonstrates the power that self-perceptions of merit have to influence career- 
relevant decisions (Correll 2001; Fiorentine 1987),! and that these self-perceptions 
can be manipulated when cultural beliefs about differences in ability between the 
sexes are invoked (Correll 2004). Consequently, to more fully understand supply- 
side contributions to occupational sex composition, it is imperative to determine to 
what extent women and men act on their accumulated socialization, abilities and at- 
tributes, and what factors influence women and men in this process (Marini 1989).? 

Some recent accounts of occupational sex segregation emphasize the work- 
family nexus as the primary contributor to an occupation’s sex composition, over 
conventional explanations of socialization and discrimination (Polachek 2006). A 
derivative of human capital theory in economics and the status attainment tradi- 
tion in sociology, these explanations maintain that individuals engage in personal 
cost-benefit analyses when planning their eventual entry into the labor market. 
More specifically, women’s lower levels of labor force participation than men is 
argued to stem from differential commitment planned in order to accommodate 
childbearing and family responsibilities. Accordingly, women and men differen- 
tially invest in education or on-the-job-training (Becker 1985), though gender 
differences in average lifetime labor force commitments are converging (Polachek 
2006). Consequently, occupations that experience compositional change are of 
great interest to sex segregation researchers. 

Several theorists have sought to knit together supply and demand to explain 
an occupation's sex composition. Building on Thurow (1975), relative attractive- 
ness theory (Strober and Arnold 1987) and gender queuing theory (Reskin and 
Roos 1990) are complementary explanations for the interaction between supply 
and demand in occupations that experience compositional transition. Generally 
in these perspectives, both workers and employers make comparisons of each 
other in terms of their comparative appeal. On the demand side, employers may 
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prefer male employees if they believe that men will be more productive or provide 
other advantages over women (Correll, Benard and Paik 2007), such as meeting 
customers’ expectations of male employees. Employers have also justified male 
incumbents in jobs that require lifting despite the ease with which the skill can be 
tested and even when other predominantly-female jobs in the same firm require 
the same physical abilities (Reskin and Padavic 1988). Net of the preferences that 
are enacted in employee recruitment, promotion and hiring practices, employ- 
ers are constrained by the supply of potential employees (Reskin, McBrier and 
Kmec 1999). Consequently, on the supply side of the process, it is argued that 
prospective employees weight occupations on the basis of their wages, flexibility, 
promotion opportunities and prestige relative to other occupations with similar 
worker human capital characteristics and requirements. 

The increasing workforce participation of women over the past four decades has 
not resulted in their equal distribution across the occupational structure. While wom- 
en have moved into some occupations in great numbers, they remain seriously under- 
represented in others, (Weeden 2004). At the extreme, some occupations that were 
once male enclaves have become female-dominated. Precisely because predominant- 

" ly-female jobs historically have had fewer appealing attributes than male-dominated _ 
jobs, there ostensibly has been less incentive for men to move into female-dominated 
occupations. Accordingly, when an occupation experiences compositional change, it 
is more likely to feminize than masculinize (Reskin and Hartmann 1986). 

While the term evokes images of a growing female presence in an occupation, 
feminization often entails a change in male presence. In some instances, a scarcity 
of male workers may prompt employers to recruit women, as the automobile and 
electrical manufacturing industries did during World War II (Milkman 1987). In 
other cases, demand prompts occupational expansion, in which case the number 
of male workers is retained or even increased while women are added, as in many 
health professions in the 1970s (Phipps 1990). At the other end of the spectrum 
is male abandonment. The general finding of Reskin and Roos (1990) is that male 
exodus prompted, then accelerated, compositional turnover in some occupations 
that were feminizing in the 1970s. However, support is not ubiquitous across 
occupations for the notion of “male flight.”(Wright and Jacobs 1994) 

Though men do not always flee feminizing occupations, they may be deterred 
from entering those occupations. The feminization literature suggests that any 
combination of factors—declines in occupational prestige, employment security, 
promotion prospects, and real earnings, and deskilling— prompt men to revise 
their career plans. This body of research suggests a series of hypotheses relevant to 
the pattern by which men and women supply themselves to an occupation. 


Occupational Expansion 


Feminization research often cites an increase in demand for a good or service (e.g., 


Roos and Manley 1996). Though demand may be compounded by a shortage of 


Feminization in Veterinary Education ¢ 1973 


qualified workers, an occupation may respond to growing demand by increasing the 
number of workers in the occupation (Milkman 1987; Reskin and Roos 1990). By 


itself, there is no reason to believe that occupational expansion only attracts one sex. 


H1: If the demand for labor in an occupation grows, more 
prospective workers will attempt to enter the occupation. 


Preemptive Flight 


The exodus of entrenched incumbents often occurs when members of a lower- 
status group begin to enter (but, see Coventry (1999) and Wright and Jacobs 
(1994). Less is known about the extent to which potential entrants are dissuaded 
from entering by the presence of a lower-status group. Men avoid undergradu- 
ate majors and doctoral degree programs as the proportion of women in those 
fields increases (England and Li 2006; England et al. (2007). The devaluation of 
women’s labor may stigmatize occupations with higher proportions of women in 
them (England, Hermsen and Cotter 2000), such that jobs performed largely by 
women pay less than comparable jobs done by men (England 1992), and men’s 
wages within an occupation are lower the more heavily female-dominated it is. 
Men may prefer to work with other men, perhaps to preserve their masculinity 
(Williams 1989), although this may apply more to manual occupations (Reskin 
and Roos 1990). Finally, women may be perceived as less competent than men 
(Goldin 2006). Consequently, the presence of women may signal to prospective 
male entrants that a field has undesirable remunerative, promotional, or prestige 
characteristics (Reskin and Hartmann 1986; Strober 1984). 


H2: If the proportion of women in an occupation grows, 
fewer men will attempt to enter it. 


Conversely, the presence of women in an occupation may prompt other wom- 
en to enter. Most social contacts are between people who are similar (McPherson, 
Popielarz and Drobnic 1992). Thus, information networks are, in part, sex-based, 
and women will tend to distribute information about employment or education 
opportunities to other women (Drentea 1998; Granovetter 1973, 1985). In ad- 
dition, the presence of women can demonstrate to other women that entry is an 
attainable goal. These women may serve as mentors and role models (Ely 1994), 
as well as influence internal processes to allow subsequent women to be accepted 
and promoted (Cohen, Broschak and Haveman 1998; Fernandez and Sosa 2005). 


H3: If the proportion of women in an occupation grows, 
more women will attempt to enter it. 
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Stagnant Wages 


An occupation in which earnings decline may fail to draw or retain enough men to 

outpace feminization (Reskin and Roos 1990). Wage stagnation may occur prior to 

women’s entry due to technological developments that reduce or change the requi- 
site skills or the course of feminization itself may depress salaries if women command 

lower wages than men in the occupation (Pfeffer and Davis-Blake 1987). The cross- 
sectional nature of much labor data makes this assertion difficult to test empirically 
(Catanzarite 2003), though some research is beginning to address the problem (see 

England, Allison and Wu 2006). Regardless, any decline in earnings should make 

the field less appealing to men who are looking to maximize their earnings. 


H4: Ifthe wages of an occupation decline over time relative 
to the wages of the male labor force, fewer men will 
attempt to enter it. 


Women value income to the same degree as men (Jencks, Perman and Rainwater 
_ 1988), so women should be motivated to move into a male-dominated occupation 
that is feminizing because it will be more lucrative than the female-dominated - 
occupations that have been previously been available to them (Reskin and Roos 
1990). For a time, then, the wages in a feminizing occupation should still be an 
improvement for women compared to other occupations. 


H5: If the wages of an occupation decline over time relative 
to the wages of the female labor force, fewer women 
will attempt to enter it. 


Occupational Jostling 


Occupations compete within a labor market for qualified workers. When occupa- 
tions jostle for workers, the socio-demographic compositions of each field are relative 
to each other such that declines in one correspond to increases in another (Rotolo 
and McPherson 2001). This is a more specific version of the stagnant wages thesis. 

Aspirants may compare occupations that have similar skills requirements and 
attempt to enter them according to their relative rankings in their particular 
queue. Some men reserved pharmaceutical careers as a contingency if they were 
not admitted to medical or dental school Phipps (1990). 

Prestige, a component of an occupation’s queue ranking, is thought to be a factor 
in the jostling process (Reskin and Roos 1990). A substantial literature documents 
historical stability in relative occupational prestige rankings. Therefore, when men 
compare similar occupations, both their wages and prestige are important: 


H6: If the wages of occupations with similar specific human 
capital requirements and prestige diverge over time, 
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fewer men will attempt to enter the lower-paid, less- 
prestigious occupation. 


In contrast, although women and men differ little in the extent to which they 
value an occupation’s income (Jencks, Perman and Rainwater 1988), a male- 
dominated occupation that is feminizing initially should be appealing to women 
because the occupations that have historically been available to women have not 
been as lucrative and because women are less likely to expect to be the primary 
breadwinner for their families (Polachek 2006). However, over time: 


H7: Ifthe wages of occupations with similar specific human 
capital requirements and prestige diverge over time, 
fewer women will attempt to enter the lower-paid, less- 
prestigious occupation. 


Hypotheses 6 and 7 apply to not only an occupation in which wages are higher than 
the declining occupation in question, but also one in which wages are lower. In 
longitudinal studies, the wage trajectories can diverge, converge or remain in parallel. 

Research on occupational sex composition and change generally has relied 
upon descriptive statistics to identify changing sex ratios, historical turning points, 
prestige changes and earnings ratios that influenced the entry of women or result- 
ed in the flight of men. However, the selection bias inherent in labor statistics—of 
employed persons only—prejudices assessments of the gendered sorting processes 
that match workers to employment. To effectively address the effects of supply and 
demand in the job search process, research must also include failed employment 
attempts. Rarely, researchers have been able to examine job applicants. These 
single-employer studies shed much-needed light on the process by which employ- 
ers make hiring decisions and match candidates to jobs, but cannot address the sex 
composition of the applicant pool nor explain gender differences in the decision 
to apply (Fernandez, Castilla and Moore 2000; Fernandez and Weinberg 1997; 
Petersen, Saporta and Seidel 2000). 

Recognizing this problem, England and Li (2006) examined the choices of 
college undergraduate students because degree majors are generally open to any 
aspirant. In support of devaluation theory, they found a curvilinear relationship 
between women’s enrollment in certain fields of study and men’s subsequent 
enrollment in those majors, but found no support for the queuing theory predic- 
tion that salaries positively affect male entry. These studies provide important 
information about the supply side of the feminization process, but are limited 
in the number of processes they examine.Finally, Fernandez and Sosa (2005) 
examine a series of supply and demand-side processes on the applicant pool for 
the female-dominated customer service representative job at a financial service 
institution. This important study is the first, to my knowledge, to empirically 
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examine pre-hire mechanisms, many of which never go beyond the theoretical due 
to the extreme difficulty in obtaining data of this nature. They find that gendered 
social networks play a role in the sex composition of applicant pools, that female 
incumbents are more likely to refer an applicant than men, and both sexes are 
more likely to make a same-sex referral. Using U.S. Census data on the local labor 
pool, they also infer that the relatively low advertised starting salary attracted more 
female applicants, who comprise a larger segment of the local low-wage distribu- 
tion. Their study is a deep examination of one firm's applicants and its screening 
practices, providing valuable insight with obvious limitations. In this case, they 
cannot test the concurrent influence of some feminization mechanisms on the 
applicant pool directly, including the expansion of the job (adding employees), 
its wages or male avoidance. 


The Feminization of Veterinary Medicine 


Once the country’s most male-dominated profession (Weeden 2004), veterinary 
medicine has shifted from 98 percent male in 1960 (U.S. Bureau of the Census 
1960), to 50.9 percent female today (American Veterinary Medical Association 
2010). As professions tend to retain incumbents for life due to investments in 
training (Reskin and Roos 1990), veterinary medicine has integrated directly 
from the increase in female practitioners (Verdon 1997), which stems from gender 
changes in the veterinary colleges’ enrollment and graduation (Figure 1). Since 
applicant record-keeping began in 1976, a decline in men’s enrollment followed 
a decline in men’s applications, which occurred despite Congressional initiatives 
to combat predicted shortages of health care professionals, including veterinarians. 
The application structure of veterinary education is particularly favorable for 
an analysis of feminization. Entry into the veterinary programs is unbounded by 
undergraduate prerequisites.’ Consequently, applicants are not prohibited by late 
entry into the science “pipeline.”(Xie and Shauman 2003) However, unlike many 
other professional programs, aspirants are restricted by state of residence. As state- 
supported institutions, veterinary colleges have historically been state institutions 
first and professional institutions second. Most colleges give preference to state 
residents, and many do not admit any nonresidents at all (American Association 
of Veterinary Medical Colleges 1975-1995). Some have exclusive agreements to 
consider applicants from specific states that do not have a veterinary college. Thus, 
veterinary applicants historically have been limited to a single institution, either 
in their own state or in the one state with which their state contracts (Kavanaugh 
1975). Ultimately, most students who apply to the veterinary colleges are residents 
of the state in which the college is located (Pritchard 1989). For comparison, in 
2006, there were 125 accredited medical schools in 45 states and more than three- 
quarters of American medical programs enrolled more nonresidents than residents 


in first-year classes (AAMC 2006). 
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Figure 1A-B. Sex Composition of First-Year Enrollment and Graduates of All American 
Colleges of Veterinary Medicine, 1960-2000 
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Data and Methods 


The primary data source 1s the Comparative Data Report, the AAVMC’s annual, 
confidential survey of all American veterinary medical colleges. State-level data 
on the profession comes from the Veterinary Demographic Data Reports, an in- 
termittent publication of the AVMA, and the Journal of the American Veterinary 
Medical Association. Other state-level information comes from the decennial 
U.S. Census, the Current Population Survey, the American Medical Association, 
the AAVMCG, the National Center for Education Statistics, the 1 and 5 percent 
samples from the Integrated Public Use Microdata Series at the University of 
Minnesota (Ruggles et al. 1997), and the Current Population Survey (King, 
Ruggles and Sobek 2003). 

Table 1 presents descriptive statistics for all variables, and Table 2 reports 
correlations. The unit of analysis is the college year. The data are unbalanced 
panel data, as the same colleges report each year, but there are varying numbers 
of observations over time for each college. Since 1975, the first year observed 
in this study, nine additional American veterinary medical colleges have opened, 

_ bringing the total to 27. 

Independent analyses are conducted on the two dependent variables: the num-. 
ber of qualified male and female applicants for the Doctor of Veterinary Medicine 
degree to each of the American veterinary medical colleges annually from 1975 
through 1995. A relative measure, the proportion of women in the colleges’ ap- 
plicant pools, is inappropriate because it is not independent of male applicants. 

The colleges define qualified applicants as those who meet the minimum stan- 
dards for application, including grades and residency. Unqualified applicants are 
not reported. The 1976-1977 academic year, the first year the veterinary programs 
reported the number of qualified applicants, is the first year of the analysis. The 
study ends in 1995 because the AAVMC implemented substantially different 
application procedures in 1996, including a centralized service that reported all 
applicants regardless of academic qualifications or residency. 

State- and college-level variables control for the annual tuition, number of 
bachelor’s degree recipients, and whether the veterinary college accepts appli- 
cants from another state. State-level data on the sex composition of bachelor’s 
degree recipients has only been collected sporadically and is insufficient for 
analysis (Snyder 2004). To distinguish between change over time and variation 
between the colleges, the analysis controls for year, ensuring that the college and 
state coefficients are year-specific and demonstrate deviations from the annual 
mean. Other variables, including state population and level of urbanization were 
highly correlated with bachelor’s degrees and did not contribute anything to the 
analysis. The independent variables are adjusted by one year to address timing 


differences between date of application and the college attributes and market 
conditions of the previous year.‘ 
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Table 1: Descriptive Statistics, 1976-1995 





ITD 25 
Dependent — a) y 
Male applicants Number of qualified male 387.26 141.33 
applicants, adjusted one year = (148.56) (41.49) 
Female applicants Number of qualified female 185.53 321,09 
applicants, adjusted one year (105.75) — (123.11) 
Independent and Control 
Tuition ~ One year of veterinary Le 1.42 9.20 
(1,000s) (.90) (40) 
B.A. degrees Number of bachelor’s degree 28.70 36.19 
recipients in a state year (20.25) (24.71) 
(1,000s) 
Nonresident contract College has formal contractto accepts. 
accept another state’s another 
qualified residents = state's. : 
oo students=1 
Class size Number of seats for incoming 89.37 
class (25.28) 
Female students Proportion of women students 14 
enrolled (.06) 
Female faculty Proportion of women veterinary .04 
___ faculty (.03) 
Female veterinarians Proportion of women veterinary a 
__ practitioners : (.05) 
Male professional- Ratio of a state’s male labor .80 
veterinarian earnings ratio force earnings and mean (.12) 
veterinary earnings in a state 
Female professional- _ Ratio ofa state's female labor — 37 A4 
veterinarian earnings ratio —_ force earnings to mean (.05) (.06) 
veterinary earnings in a state 
Physician-veterinarian Ratio of a state’s mean 1.66 ZAy 
earnings ratio physician to veterinarian (.26) (.36) 
earnings 
Lawyer-veterinarian Ratio of a state’s mean lawyer 1.10 1.45 
earnings ratio and veterinarian earnings (.21) (.22) 
Pharmacist-veterinarian Ratio of a state’s mean 83 .90 
earnings ratio pharmacist and veterinarian (.14) (.13) 
earnings 
Optometrist-veterinarian _ Ratio of a state’s mean 1.11 1.23 
earnings ratio ] optometrist and veterinarian (.16) (.25) 
earnings : 
Female physicians Proportion of female physicians = 22 
___ inastate (.02) (.03) 
Female pharmacists _ Proportion of female 21 AN 
a pharmacists in a state (.05) (.04) 
Female lawyers Proportion of female lawyers in a 24 
. a state (.02) (.03) 
Female optometrists Proportion of female 08 18 
- | optometrists in a state (.05) (.08) 
Year 1975 = 1 


Year 


Note: Numbers in parentheses are standard deviations. 
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Occupational Expansion: The number of positions available in the entering class. 

Preemptive Flight: The proportion of female veterinarians in the state and 
the annual proportion of women enrolled and on the faculty in each college’s 
veterinary program. 

Stagnant Wages: Mean male and female professional/managerial labor force 
earnings in a state relative to mean veterinary earnings in that state. 

Occupational Jostling: Veterinarians historically have compared themselves to 
physicians and chafed at not being accorded the prestige afforded their counter- 
parts in human medicine (e.g., Koltveit 1972). That the basic graduate courses for 
veterinarians and physicians traditionally have been combined at some universi- 
ties may have contributed to this sense of inequity (Redisch 1971). Perhaps in 
response, the AAVMC prints a comparison of application and enrollment rates 
at the medical and veterinary colleges in its annual Veterinary Medical School 
Admission Requirements and emphasizes the fact that veterinary colleges are far 
more selective than medical programs (AAVMC 1986). 

Medicine has historically been more prestigious and lucrative than veterinary 
medicine (Blau and Duncan 1967; Nakao and Treas 1994). How do other occupa- 
tions jostle with veterinary medicine when they differ in earnings and prestige? 
Law, pharmacy, optometry, medicine and veterinary medicine all score similarly 
on scales of socio-economic skill, indicating that they all require a similar type 
and level of human capital, despite considerable variation in prestige and earn- 
ings.> The annual CPS does not contain enough physicians, pharmacists, lawyers, 
optometrists or veterinarians in each state to compute each profession’s mean 
annual earnings; STATA’s ipolate command calculated values for each state for the 
years between each decennial census. Interpolation, a single imputation procedure 
appropriate for use with panel data, linearly calculates missing values with respect 
to time based on non-missing values in the dataset (StataCorp 2003). Sex differ- 
ences in wages cannot be distinguished for these occupations. Individually, the 
income variables are highly correlated with each other and the year; using wage 


ratios circumvents this problem. 


Method 


A pooled time series of cross-sections is robust to the issue of unequal periods 
between the time-series data. The data are in the form of x,, for which x, is a 
vector of observations for unit i and time ¢. The analyses are conducted with a 
fixed-effects model using generalized least-squares estimation procedures. Cross- 
sectional time-series data are prone to autocorrelation, in which there is serial 
correlation in the errors of prediction for each observation over time. ‘The gen- 
eralized least-squares technique corrects for autocorrelation in time-series data 
and produces unbiased estimates of the standard errors. ‘The fixed-effects model 
is preferred to the random-effects model, though the fixed-effects model discards 
information about the stable over-time variation between the colleges (Green 
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1990). A Hausman test verifies that a fixed-effects model is more appropriate 
than a random effects model for both men (chi-square = 102.47, P < .0000) and 
women (chi-square = 79.76, P < .0000). 

The general equation for the fixed-effects model is written as: 


jie = Qo ts YBa + Qi t Ei 


imwhichz=tLe Nor Salas Ele, ]=0, and Varle, ]=oE”. t signifies the year of 
observation and 7, marks the number of observations for a given veterinary medi- 
cal college, i. Standardized coefficients permit direct comparisons of the effect size 
between the feminization mechanisms. 


Results 


Both the medical and veterinary medical professions more than doubled in size 
between 1970 and 2000. During this same period, the proportion of women 
practicing professionally as veterinarians increased by a factor of eight to 39.5 

. percent, while female physicians increased to 26.8 percent from 7 percent (U.S. 
Census Bureau 2000). This is partly because medical school enrollment has only - 
recently reached rough gender parity (AAMC 2007), whereas veterinary medicine 
enrollment sex-integrated two decades earlier. The differences prompt investiga- 
tion into the social and economic changes that caused the earlier transposition in 
the veterinary college applicant pool (Figure 2). 

Separate regression analyses were conducted for the annual number of male 
and female applicants to the colleges (Table 3). Model 1 demonstrates baseline 
state-level and organizational effects. Increasing tuition costs reduce applicants to 
the programs. Reflecting the increasing gender disparity among college graduates, 
the effect of the bachelor’s degree variable on male applicants is nearly three times 
that of the effect for women. Colleges that accept nonresidents do not receive 
significantly more applicants. 

Models 2 through 6 test the hypothesized feminization mechanisms concur- 
rently in several different permutations. 

Occupational Expansion: As anticipated, the increasing class sizes prompted by 
Congress significantly spurred the number of applications to the colleges from 
women and men, except for men in Model 4. 

Preemptive Flight: As women’s enrollment in the veterinary programs grew, 
fewer men applied to the veterinary programs, but the increasing presence of 
women on the faculty and as practitioners was immaterial. There is no effect of 
female enrollment, faculty or practitioners on the number of female applicants. 

Following England and Li (2006), Model 2a for men includes a squared term 
for women’s enrollment, because it was significant in Model 2, to measure whether 
men are linearly discouraged from applying by female students. The squared 
term is positive and significant, indicating that applications from men are not 
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Figure 2. Sex Composition of Applicant Pool to All American Colleges of Veterinary 
Medicine and Bachelor’s Degree Recipients, 1975-1999 
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Figure 3. Ratio of Four Professions’ Earnings to Veterinary Earnings, 1970-2000 
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of all four comparison occupations during the period under study (Figure 3). 
However, only the growing wage gap for medicine and law significantly reduced 
the number of applicants to the colleges. The overarching pattern that emerges 
from the four models suggests that men and women are similarly sensitive to the 
relative differences in wages between occupations, and second, that prestige seems 
to be less important than relative wage trajectories among occupations with similar 
human capital requirements. 
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A variation of the male flight thesis argues that within an ‘occupation, men 
monopolize the best opportunities in the higher-paying, more desirable jobs. 
Unfortunately, it is impossible to ascertain from the data sources whether a vet- 
erinary applicant relocated to a state that has a strong college. Unlike colleges 
of medicine and law, there is no clear prestige hierarchy among the veterinary 
colleges during the period under study according to the author’s reading of their 
journals. One measure, the US News & World Report rankings, did not begin 
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ranking the veterinary colleges until 1997, and not all the colleges participated. 
To the extent that a top 5 or top 10 placement in 1997 is at all retrospectively ac- 
curate, adding these variables to Model 2 finds that rank is not significant for men 
or women.’ Absent other empirical evidence, there is no basis to conclude that 
cross-state relocation was an important influence on applicants given that neither 
formal contracts to accept nonresidents nor the program ranking have significant 
effects on applicants to the colleges during this time period. 


Discussion 


Although conclusions about individual-level deci- 
sions cannot be drawn from organization-level 
data (Reskin 2003), the effects of the feminiza- 
tion mechanisms on the veterinary applicant pool 
over 21 years demonstrate a great deal of con- 
sistency with the expectations generated by the 
feminization literature. In terms of demand for 
trained workers, the Congressionally-mandated 
expansion of the veterinary colleges to avert 
practitioner shortfall spurred applications, while 
relative wage stagnation depressed applications. 
But, consistent with the thesis that professions 
are “relatively attractive,” applications to the vet- 
erinary colleges waned relative only to fields with 
the largest wage gaps and highest prestige. Finally, 
as women’s veterinary enrollment surged, fewer 
men applied to the colleges. The analysis suggests 
that men avoided feminizing occupations like 
pharmacy on the basis of both women’s presence 
in pharmacy and its lower wages. However, the 
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S 3 slightly higher wages and prestige of optometry, 
3 the least feminized of the four comparison occu- 
6 8 pations, were not sufficient to overcome interest 
3 & in veterinary medicine the way that the wages/ 
~ = _ prestige of medicine and law were. 
av Beyond the empirical confirmation of many 
8 7 hypothesized feminization mechanisms, the anal- 
is ysis demonstrates that feminization mechanisms 
nee can function simultaneously and specifies the 
= a manner in which they do so relative to each other 
3° for men and women. In particular, these findings 
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as much influence over behavior as economic factors. Of key import, the results 
reveal substantial similarities between the applicant behavior of women and men. 
In fact, of the hypotheses tested, only the effect of womens enrollment differed 
between men and women. The size, direction and rank order of the standardized 
feminization coefficients indicates that for the most part, men and women respond 

quite similarly to social and economic forces. One social influence, the increased 
class sizes resulting from Congressional action, generated more applications from 
men and women. Another social influence—women’s enrollment in the veterinary 
colleges—was generally the largest negative influence on applications from men. 
Economic factors in terms of relative incomes and tuition costs also played a sub- 
stantial role. Tuition costs consistently had a larger negative effect on men’s applica- 
tions than those from women. For women, the declining economic position of the 

profession had the largest depressive effect, but this was largely offset by the increase 

in class sizes. The size and direction of the coefficients are broadly consistent with 

other research that finds that men and women place similar value on occupational 

attributes (Jencks, Perman and Rainwater 1988) and suggest that this is largely true 

even as occupations experience a significant change in sex composition. 

"Does the presence of women stigmatize occupations to men? Certainly, the 
same stratification processes may undergird the seemingly disparate motives and 
mechanisms of inequality for different ascriptive groups (Reskin 2003). Therefore, 
if social contacts are hypothesized to prompt women’s interest in an occupation, 
the possibility that networks factor in men’s candidacy should be considered, 
as well. The networking and homophily arguments applied to predict women's 
application behavior are also descriptive of men’s behavior, inasmuch as the col- 
leges that enrolled a smaller proportion of men received significantly fewer ap- 
plications from men, a finding that is consistent with the patterns of referrals 
that Fernandez and Sosa (2005) found among customer service representative 
applicants. Conversely, the proposition that women’s presence stigmatizes an oc- 
cupation should be equally plausible for women. The negative effect of women’s 
enrollment on male applications supports devaluation theory and suggests that 
men may indeed choose not to enter feminizing occupations. However, since the 
proportion of female practitioners had no effect on male applications, then at first 
glance, the female threat threshold for male aversion would seem to be greater 
than 32 percent (the highest proportion of female practitioners during the study), 
which is consistent with other studies that find 30 to 40 percent female induces 
compositional changes (Nesbitt 1997; Pfeffer and Davis-Blake 1987), or even as 
great as the 54 percent found at the bachelor’s degree level (England and Li 2006). 
However, the representation of women in the student body did have a significant 
effect on male applications from the beginning of the study, when the proportion 
of women students was much lower—24 percent in 1975. Consequently, it.ap- 
pears that the role of women as peers may be a greater factor than women’s general 
proportional representation in men’s decisions not to apply. 
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‘The results of the analysis indicate that for every 1 percent increase in women in 
the veterinary college student body, about 1.7 fewer men will apply the subsequent 
year. (By comparison, each $1,000 increase in tuition reduces the number of male 
applicants by about 1.2.) However, the curvilinear relationship between applica- 
tions from men relative to women’s enrollment adds nuance to devaluation theory, 
which expects a linear male aversion to all women’s roles. Finally, contrary to the 
female role model and inroads expectations, women’s increasing presence does not 
measurably encourage women. This finding also is contrary to research that finds 
that women’s presence negatively affects women’s receipt of doctorate degrees after 
a discipline reaches 38 percent female (England et al. 2007). 

In their classic work on occupational feminization, Reskin and Roos (1990) did 
not classify veterinary medicine as a feminizing occupation. Veterinary medicine 
also did not appear in a more recent study of integrated occupations, those that by 
1990 were between 45 and 55 percent female (Gatta and Roos 2005). Why, then, 
is veterinary medicine feminizing? Both the profession and the popular press have 
put forward a variety of explanations for women’s increasing presence, ranging 
from technological change (improvements for animal restraint), more role models, 
a family-friendly flexible work schedule, reduced admission discrimination and 
the socialization of women as nurturers, while decreased applications from men 
have been rationalized by stagnant incomes, the “stigma” of working with women,” 
and loss of autonomy and prestige (Schweitzer 2007; Smith 2006). Most of these 
explanations are not easily tested empirically. 

The explanation of reduced barriers to admission for women has merit, although 
it fails to explain why other professions that have also eliminated that bias have not 
feminized or not at the same rate. Veterinary colleges historically have shaped the 
sex composition of the profession by excluding women entirely or employing annual 
enrollment caps. The veterinary college at Texas A&M, the last program to exclude 
women, required the force of the Texas state legislature to sex-integrate in 1963 
(JAVMA 1963). Nevertheless, women’s opportunities remained sharply limited even 
in programs that had been nominally integrated for decades. Prior to the enactment 
of the 1972 Title [X education amendment that formally prohibited sex discrimi- 
nation by federally-funded education programs, the veterinary college at Cornell 
University employed an annual enrollment cap of two women (Sandler 2007). After 
Title IX, the veterinary programs continued to play an important, albeit different, 
role as a gatekeeper to the profession. Since 1976, the first year of recordkeeping, 
roughly the same percentage of the male and female applicant pool enrolled in the 
veterinary programs. For example, in 1976, 15.4 percent of male candidates and 
16.3 percent of female candidates enrolled. At that time, women comprised 32.6 
percent of the applicant pool. In 1995, when women comprised 67.5 percent of 
candidates, the colleges continued to enroll nearly equivalent proportions of the 
male and female applicant pools, 34.3 percent and 34.7 percent, respectively. I 
found the same pattern for graduate programs of law, osteopathy and medicine 
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(Figure 4) as did Weaver et al. (2005) for dental colleges and Cole (1 986) for medical 
colleges. Presumably, men and women are similarly qualified, and this explains the 
similarities in enrollment rates. After the artificial restraints on women’ enrollment 
were removed by Title IX, it seems highly likely that veterinary graduate school 
admissions procedures were egalitarian and ensured that compositional changes in 
programs and professions have a supply-side component. 

The results of this study demonstrate only one consistent difference between 
male and female application patterns—men’s strong negative response to women's 
increasing enrollment. During the period under study, increasingly more women 
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than men graduated from college, placing a larger percentage of women in the 
applicant pool from which the colleges drew sex-proportionate cohorts. Men 
then increasingly avoided the feminizing veterinary programs. Consequently, the 
engine of feminization is not fueled by women who are inordinately attracted to 
veterinary medicine, but rather driven by men’s lower rates of college graduation 
and their aversion to women students. Additionally, the finding that men fled the 
more feminized pharmacy to veterinary medicine, but also fled from veterinary 
medicine as its wages declined relative to medicine and law, suggests that feminiza- 
tion is passed along between occupations. 
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The points raised herein prompt the question of whether sex integration in vet- 
erinary education will be associated with the eventual occupational outcomes that 
Reskin and Roos (1990) identified: resegregation, in which occupations become 
female-dominated, or ghettoization, in which men control the more lucrative 
specialties in an occupation. ‘The evidence suggests that veterinary medicine will 
resegregate, as the number of women graduating has held a sharp lead over men 
for the past 20 years, and veterinary applications and enrollment continue to 
be female-dominated. Indeed, national figures show a veterinary applicant pool 
that has hovered at 80 percent female since 2004. Ghettoization seems less likely. 
Women have surpassed men in one of the most lucrative specialties, companion 
animal practice (AVMA 2009), which accounts for two-thirds of private practice 
employment, while men dominate the less lucrative domain of large food animal 
medicine, a point that received national attention recently (Belluck 2007). 


Conclusion 
As overt demand-side gender inequality declines (Blau, Brinton and Grusky 2006), 


- scholars increasingly are scrutinizing those forces that structure supply and the 
nexus between supply and demand. Supply-side explanations of occupational sex _ 
segregation, which focus on the attributes that workers bring to the employment 
process, posit that workers select occupations for a variety of economic and social 
reasons. However, many of the consequential factors implicated in research on 
occupational sex segregation and feminization have resisted hypothesis testing 
due to the limitations of the available data and the difficulty inherent in studying 
changes in the sex composition of an entire occupation. 

This article presents an empirical test of multiple simultaneous effects of proposed 
feminization mechanisms on supply-side behavior. Longitudinal applicant behavior 
explains some of the compositional change while circumventing the demand-side 
selection pitfall of examining only employed workers and the supply-side unreliabil- 
ity of career predictions found in surveys of occupational aspirations. By studying 
compositional changes at the point of entry to professional education, this research 
was able to specify mechanisms that are—and are not—contributing to feminization 
at a key point. The analyses demonstrate that gender differences in behavior are 
minimal and clarify the predictions of queuing and devaluation theories. 

Certainly, findings from the study of professions should necessarily be inter- 
preted conservatively and assume limited generalizability to other occupations, 
notably those that do not have the same structured entry requirements or require 
specialized investment in graduate training or a college degree. Nevertheless, that 
many of the feminization hypotheses have been generated from research on oc- 
cupations other than the professions lends support that the feminization processes 
identified herein to apply to a wide variety of occupations. ‘ 

Beyond the factors examined in this study, what initially prompts a field’s femi- 
nization? Initially, women’s ambitions probably expanded beyond traditional careers 
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due to the women’s movement (Shu and Marini 1998) coupled with legislation that 
granted women equal access to education. Accordingly, women became freer to 
move into previously male-dominated educational tracks, and the sex composition 
of higher education grew more reflective of the applicant pool. This suggests that 
continued attention should be directed toward factors that influence gender differ- 
ences at all stages of educational trajectories (e.g., Correll 2001, 2004), particularly 
those feminization variables identified as influential in this analysis. 

Twentieth-century sex segregation trends demonstrate that levels of sex in- 
tegration wax and wane in accordance with historical events and are, to some 
extent, occupation-specific (Weeden 2004); hence, the integration trajectory of 
one vocation may not be fully descriptive of another. Weeden found substantial 
variation between occupations in the pace of integration and even reversals at 
times. Nevertheless, the present analyses have implications for understanding the 
relative contributions of supply- and demand-side factors to the sex composition 
of occupations and the employment search process generally. As with pharmacy 
and veterinary medicine two decades prior, more women than men applied to 
American medical colleges for the first time in 2003 (AAMC 2008). Recent medi- 
cal journal discourse emphasizes two decades of wage stagnation in physician 
residency programs and posits that law or business may have drawn more male 
applicants as a result (Paik 2000). While it is likely that the medical profession, 
and indeed any other occupation, may be jostled in favor of more lucrative em- 
ployment alternatives, the present study suggests that occupational prestige is 
intertwined in the competition for applicants, and the prestigious, lucrative alter- 
natives to medicine are few. Indeed, an exclusive emphasis on economic jostling 
ignores men’s declining rates of college graduation and male aversion to women 
students. However, since 1999, medical schools have consistently increased class 
sizes; subsequently, the number of applications from women and men also in- 
creased. Ultimately, as Figure 4 demonstrates, these findings are important in the 
context of the gender-egalitarian graduate program admissions policies that enable 
the supply-side contribution to the sex composition of occupations. 


Notes 


1. Female undergraduate premedical majors of modest abilities are less likely than 
comparable men to apply to medical school (Fiorentine 1987). Fiorentine and Cole 
(1992) concluded that female premed students have more “normative alternatives” to 
their medical career goals than men should they encounter challenges during their 
post-secondary education. 


2. Worker preferences may be further influenced by sex-role socialization and the 
perceptions of what jobs are available (Cole 1986; Reskin and McBrier 2000). 


3. Historically, no specific undergraduate major has been required for enrollment in the 
veterinary colleges or other health professions. In 2005, 41 percent of medical school 
matriculants earned bachelor’s degrees in a major other than biological sciences or 
health sciences; 12 percent were social science majors (AAMC 2006). 
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4. Time lags up to 10 years indicate that a one-year correction results in the best-fitting 
model. 


5. The historical stability and relative distance of these scores is well documented. Blau 
and Duncan’s (1967) classic measure of occupational status ranks physicians and 
lawyers in the 90-96 point range, pharmacists in the 80-84 range, and veterinarians 
in the 75-79 range. Optometrists do not appear on the ranking. On the Duncan 
SocioEconomic Index, they score 92, 89, 93, 97 and 90, respectively. On the Nakao 
and Treas (1994) scale, veterinarians score a 62, optometrists 67, pharmacists 68, 
lawyers 75 and physicians 86. 


6. Models 3 through 6 yield similar inflection points ranging from 63.9 percent to 66.8 
percent. 


7. A random effects model was required because a fixed-effects model drops the 
unvarying top-10 rank variable. 
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Sex Typing of Jobs in Hiring: Evidence from Japan 
Eunmi Mun, Harvard University 


Using unique data on employers’ pre-hire preferences, this article examines the effect 
of sex typing on the gender gap in offered wages and training. Previous studies using 
post-hire data have not been able to focus directly on the effects of employer behavior, 
distinct from employee preferences. By analyzing gender-designated job requisitions 
for the entry-level labor market in a Japanese city, this study investigates employ- 
ers’ pre-hire decisions about the wage level and on-the-job training that accompany 
the sex typing of jobs. Results show that employers’ sex typing excludes women in 
advance from jobs that provide higher wages and longer training. 


In most countries, job segregation by gender and the gender gap in wages go 
together: there is a greater concentration of women in low-status and low-paying 
jobs (Anker 1998; Chang 2000; Charles and Grusky 2004). This concentration 
results in part from gender differences in employees’ education level and job 
preferences (Becker 1985; Correll 2001; Marini and Brinton 1984). Residual 
gender inequality in wages has been documented, however, after controlling for 
education and occupational aspirations (Groshen 1991; Marini and Fan 1997); 
suggesting that employers’ preferences also play a role. Theories of sex segregation 
have offered several reasons why employers may show gender preferences, includ- 
ing statistical discrimination (Bielby and Baron 1986; Phelps 1972), customers’ 
tastes (Neumark 1996; Sunstein 1991), employers’ gender biases or stereotypes 
(Reskin and Roos 1990), and insiders’ rent-seeking behavior (Reskin 1988; Tilly 
1998; Tomaskovic-Devey and Skaggs 2002). 

By contrast, the question of how employers act on their gender preferences (as 
opposed to why they have them), and how their policies and procedures in hiring 
lead to gendered employment outcomes, has been little studied. ‘This “how” ques- 
tion is of great importance. As Reskin (2003) points out, motivations alone do 
not necessarily lead to discrimination; specific actions must be taken that result in 
gender inequality in wages, training, authority or other outcomes. ‘These actions 
have been difficult to observe because of data limitations. Research using post-hire 
data can only infer employer’s discriminatory actions by relying on the existing 
variation in hiring outcomes or wages between the genders (e.g., England, Farkas, 
Kilbourne and Dou 1988; also see Reskin 2003). But this inference inevitably 
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leaves room for the alternative explanation that the variation ‘may have resulted 
from differing job preferences of female and male employees. It is difficult, us- 
ing post-hire data, to directly investigate employers’ actions in hiring and their 
implications for gender-disparate outcomes. ‘There is thus a gap in the literature 
that limits our understanding of how sex segregation and reward disparities occur. 
In this article I attempt to address this gap and shed some light on employer- 
side behavior at the hiring stage and its impact. I take advantage of data from 
Japan in which job announcements explicitly include the gender hiring preference 
of employers. Although not practiced in the current American labor markets, 
such explicit gendered hiring was common in the United States until the 1960s, 
with many employers posting sex-specific job advertisements in the newspapers 
(Pedrinan 2004). For this study, job requisitions for the entire population of full- 
time entry-level jobs for high school graduates in an area of a large city (Kawasaki) 
were collected at a time (the mid-1990s) when Japanese employers stated in ad- 
vance their preference for the gender of the new employee, together with the wages 
and amount of training they would provide for a specific job.' The data provide a 
rare view of employers’ pre-hire thinking about their desired employees through 
"the sex typing of jobs. Because an Equal Employment Opportunity Law became. 
more tightly enforced in Japan by the late 1990s, the mid-1990s data provide an 
unusual opportunity to assess employers’ sex typing of jobs in an environment 
relatively unconstrained by legal provisions. My analysis of the data shows that 
sex typing of jobs at the point of hire directly restricts women’s access to higher 
paying jobs and those with longer on-the-job training periods. 


Employer-side Exclusion of Women in Hiring 


Many studies sample current employees to document gender segregation by job 
or occupation and the gender gap in salary, benefits and authority (e.g., Bielby 
and Baron 1986; England et al. 1988; Tomaskovic-Devey 1993; Wright, Baxter 
and Birkelund 1995). Based on the existing gender gaps in the workplace, these 
studies attempt to determine who is responsible—sometimes arguing that employ- 
ers exclude women from more lucrative jobs. Although these studies control for 
employee human capital, their conclusions with regard to employers’ exclusion 
of women are limited because they do not show how employers do this and what 
the direct consequence of exclusion is (Reskin 2003). While post-hire survey data 
have been a good way to measure the outcomes of hiring, they cannot reveal the 
process of hiring (Roscigno, Garcia and Donna 2007). 

Debates between supporters of different explanations for the gender wage 
gap—cultural devaluation, specialized human capital and social closure—reflect these 
analytical limitations of using post-hire data to analyze processes in the workplace. 
Although England and her colleagues, finding that female-dominated jobs pay-less, 
argue that women earn less because their jobs are devalued and thus underpaid by 
employers (England et al. 1988; Kilbourne, Farkas, Beron and Weir 1994), it could 


Sex Typing of Jobs in Hiring ¢ 2001 


also be that women choose lower-paying, female-dominated jobs for various unob- 
served reasons. Tam (1997) raises this self-selection issue by showing that the wage 
gap is associated with a lower level of specialized human capital in women, who enter 
lower-paying jobs because they have chosen not to invest in training. Tam’s (1997) 
argument was criticized by Tomaskovic-Devey and Skaggs (2002), who contend 
that women do not choose not to invest in training but are excluded from jobs that 
provide it. Both Tam (1997) and Tomaskovic-Devey and Skaggs (2002) find that 
a training differential helps explain the wage gap, but their findings do not provide 
conclusive evidence of either self-selection or exclusion; either or both processes 
could be responsible. In order to advance the theoretical debate in the literature 
and determine whose preferences are realized, it is necessary to show how women 
are excluded from high wages or training—to analyze specific employer practices 
distinct from the influence of other actors, particularly female employees themselves. 
Many theories explaining employers’ motives to discriminate implicitly suggest 
that they sex type jobs in the hiring process, reserving better jobs with higher wag- 
es, longer training and higher status for men (Bielby and Baron 1986; Sunstein 
1991; Reskin and Roos 1990; Reskin 1988). A few qualitative studies on particular 
jobs and industries provide evidence of employer sex typing of jobs, although 
this is not usually the focus of the study. In their study of the rise of part-time 
jobs during the 1970s, Beechey and Perkins (1987) interview employers in one 
British city, who report that employers distinguish women’s jobs and men’s jobs 
according to various criteria that sometimes conflict, and that there is a clear 
hierarchy between men’s and women’s jobs as employers describe them. Written 
job definitions, which are studied in Gorman’s research on in-group favoritism in 
law firm hiring (2005, 2006), may reflect sex typing because they contain femi- 
nine or masculine descriptions of jobs. Skuratowicz and Hunter (2004), in their 
participant-observation study on restructuring of jobs in a large American bank, 
observe that employers attach implicitly gendered images to new jobs, which may 
affect individual decisions to enter a job. Cohn’s (1985) historical study on the 
feminization of clerical jobs in Britain in the early 1900s finds that employers 
deliberately recruited women for jobs when hiring men became less profitable. 
If the pre-hire sex typing of jobs found by researchers in diverse contexts ex- 
cludes women from better jobs (instead of relegating the genders to different jobs 
while treating them equally), such behavior in the hiring process contributes to 
a gender gap in valuable outcomes. Wages are a direct indicator of the value of 
jobs. In addition, on-the-job training is an important factor leading to later gen- 
der wage gaps (Padavic and Reskin 2002). On-the-job training helps employees 
develop firm-specific skills that are key to career advancement and wage increases 
because these skills are considered costly and often more valuable to employers 
than transferable skills (see Tam 1997). Empirical evidence in previous studies also 
shows that gender differences in accumulated on-the-job training at workplaces 
explain a good proportion of the gender wage gap (Roos and Gatta 1999). The 
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data used in this article allow me to directly examine the linkage between employ- 
ers’ sex-typing of jobs and the wages and on-the-job training attached to jobs. 


Research Setting: The Japanese Entry-level Labor Market 


In contrast to previous work, I use data on employers’ pre-hire practices.’ The 
representative value of the setting from which these data come, the Japanese 
entry-level labor market, may require a word of explanation, since the common 
image of the Japanese labor market is one of extreme gender inequality and segre- 
gation with women concentrated in lower-status jobs. It is true that Japan shows 
higher gender inequality in wages and job status than the United States and many 
other Western countries; women earn an average of only 50 percent of what men 
earn, female managers constitute less than 5 percent of all managers (Japanese 
Statistical Yearbook 2008; Japanese Company Handbook 2008), and Japanese 
women tend to drop out of the labor market when they marry or have children 
(see Appendix A for a U.S.-Japan comparison). However, Japan exhibits a level of 
sex segregation no higher than that of other industrialized societies (Anker 1998; 
Charles, Chang and Han 2004; Jacobs and Lim 1992; Rosenfeld and Kalleberg 
1991). Although the precise accuracy of the standard measure of sex segregation is _ 
debatable (Charles and Grusky 2004), Japan’s segregation index is 44.8, compared 
to the United States at 45.1, France at 54.5, and Sweden at 60.1 (Charles and 
Grusky 2004).> Moreover, Tomaskovic-Devey and Stainback (2007) show that 
in the American labor market as of 2002, despite a trend of desegregation, more 
than 50 percent of employees would need to be transferred to other jobs in order 
to produce gender- and race-integrated workplaces. 

At the same time, the Japanese labor market has particular advantages for inves- 
tigating employer behavior at the point of hire because the visibility of this behavior 
is a distinguishing feature of this market. Especially before 1997, when the Japanese 
government began enforcing the Equal Employment Opportunity Law banning 
practices that could limit women’s employment opportunities, employers explicitly 
searched for applicants of a specific gender to fill empty positions, as is the case with 
the Kawasaki job requisitions studied here. Although these requisitions are from 
a time before the Equal Employment Opportunity Law was enforced, in fact the 
gender wage gap and segregation index have not changed much since then; women 
earned about 50 percent of what men earned both in 1995 and 2005 (calculated 
according to “full-time employees’ monthly cash earnings including overtime pay” 
taken from the Japanese Statistical Yearbook 2008), and the segregation index was 
about 40 both in 1995 and 2000 (Japanese Population Census 2000). The M-curve 
pattern of female labor force participation in Japan, while it has slightly flattened 
out, has persisted between 1995 and 2005 (see Appendix A). 

Japanese employers have a strong motivation to be particularly clear about 
their preferences at the point of hire. Because the Japanese workplace has been 
characterized by long-term employment and strong internal labor markets, hir- 
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ing decisions are of utmost importance. In ILMs, positions at higher levels in 
job ladders are filled by internal candidates who have acquired firm-specific skills 
(Brinton 1991; Kalleberg and Lincoln 1988). Employers are thus deciding as early 
as the point of hire how to distribute their resources and to whom. 

Understanding the hiring process in the youth labor market is important to 
projecting the trajectory of earnings over the life cycle. Previous studies investigat- 
ing gender inequality in the Japanese labor market have shown that the increase 
in women’s earnings with age is considerably smaller than for men (Aiba and 
Wharton 2001; Kalleberg and Lincoln 1988; Kumlin 2007). The increasing wage 
gap with age reflects the path-dependent process of cumulative advantage; it is 
hard for women to overcome the initial disadvantage (see DiPrete and Eirich 2006 
for a review). Employers’ decisions at the point of hire, therefore, are critical to 
the overall pattern of workplace sex segregation and the gender wage gap. 

The Japanese entry-level labor market for graduating high school seniors 
provides special analytical leverage. In the case of new high school graduates, 
employers are legally required to submit job requisitions (forms listing details 
of their job openings) to the local Public Employment Security Office. In this 
nationally standardized system, employers provide workplace information such 
as industry, unionization and number of total employees. They specify the job 
titles of the vacant positions and describe the tasks that employees will perform; 
give information on the level of wages, training and fringe benefits (e.g., health 
insurance, pension benefits); and finally list the names of the high schools from 
which they want to recruit. According to Japanese labor law, employers must 
have these forms approved before sending them to the schools (Brinton 2000). 
High school seniors must receive their school’s recommendation in order to be 
interviewed by the employer. Until the late 1990s the form required employers 
to specify the gender they preferred: male, female or either. Employers could 
tailor their requests by making separate requisitions if they wanted to hire for 
multiple positions, and they could fill out two requisitions for the same job if 
they wanted to hire a certain number of female and male workers. 

While the labor market for new high school graduates contains proportionately 
more low-skill jobs than the labor market for new higher-education graduates, it 
constitutes a substantial portion of the Japanese entry-level positions. About 38 
percent of high school students entered universities or junior colleges immediately 
after graduation in the mid-1990s, and another 15 percent went to two-year 
training schools (senmon gakko). This means that about 47 percent of high school 
seniors were potentially job-seekers.‘ In addition, the spectrum of jobs in the high 
school graduate labor market is wide. The Kawasaki job requisitions for 1995 span 
about 40 percent of the job titles listed in the Japanese standard occupational 
classification, including retail sales clerk, tour bus guide, carpenter, computer 
programmer, electrical technician and dozens of others. 
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Hypotheses , 

The analyses in this article test whether employers’ sex typing excludes women 
from better jobs, as theoretical explanations suggest. Although the definition of 
“better” jobs can vary, the criteria most often mentioned are higher wages, longer 
on-the-job training, and more authority or status. Since the requisitions analyzed 
are for entry-level jobs, neither authority nor status is relevant here. But the level 
of wages attached to male and female jobs can reveal the effect of sex typing in 
excluding women from better jobs. This leads to the first hypothesis: Male-typed 
jobs offer higher wages than female-typed jobs. 

Given the importance of training for an employee's future wages (e.g., Tam 
1997; Tomaskovic-Devey and Skaggs 2002), training differentials between male 
and female jobs can also reveal the effect of sex typing in excluding women from 
better jobs associated with higher wages. The Japanese context is a particularly 
good one in which to interpret the gender training gap because human capital 
development is mainly accomplished by firm training or firm-sponsored train- 
ing, with not much opportunity for alternative education or training (Koike 

. 1988). This leads to a second hypothesis: Male-typed jobs provide longer on-the- 
job training than female-typed jobs. 

Although these hypotheses test pre-application employer behavior, applicants’ 
choice in human capital investment before the hiring process, such as high school 
education, can also influence employers’ offers of wages and training. In order to 
take account of employer decisions made about the human capital of potential 
applicants other than gender, the different types of high schools attended by ap- 
plicants are controlled for in the analyses. In this way, testing hypotheses 1 and 2 
can reveal the contribution of employer's sex typing of jobs to wages and training, 
independent of employee job and education choices. 


Data and Methods 


The data set is constructed from all job requisitions submitted by employers 
to the Public Employment Security Office in 1995 in Kawasaki, a city near 
Tokyo and one of the 13 largest cities in Japan, with a population of more 
than 1 million. These requisitions constitute all full-time entry-level jobs for 
graduating seniors from high schools in the district.° Both the period in which 
the data were collected and the geographical location are very fortuitous. In the 
mid-1990s it was still legal for Japanese employers to explicitly indicate their 
gender preferences for specific job openings; this became formally illegal by the 
late 1990s. Kawasaki, with a large industrial base, traditionally had many entry- 
level jobs for high school graduates. Moreover, about 60 percent of the firms 
represented in the data set are Tokyo-based firms, meaning that the Kawasaki 
labor market is part of the Tokyo metropolitan area, and thus is indicative of a 
larger urban entry-level labor market. 
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Table 1: Descriptive Statistics 
Requisitions Designated for 





Men Women __T-tests Either Total 
Job Characteristics 
Monthly wage (Yen) 168,401.10 156,505.70 ** 160,368.20 164,574.00 
Logged wage 12.03 11.96 as 11.98 12.00 
Training period (days) 30.52 9.55 a 21.43 24.88 
Logged training period -.42 -.62 -.52 -.48 
Any training 33.338 35.29 34.19 37.10 
Training (> 1 month) % 21.81 13.73 : 18.38 19.67 
Training (> 3 months) % 15.64 4.90 hid 15.07 13.69 
Skill requirement % 30.95 7.84 pte 19.85 24.52 
Job size (# of openings) 11.22 12.64 10.61 11.32 
Occupational Groups 
Professional/ Technical % 26.34 5.39 ie 20.96 21.58 
Clerical % 12.62 53.43 ride 16.91 20.50 
Sales/ Services % 12.07 31.86 ee 36.03 20.83 
Production % 30.59 8.82 art 16.18 23.65 
Operators % 18.38 AQ as 9.93 13.44 
Firm Characteristics 
Firm size (# of employees) 1,371.64 1,923.34 1,001.97 1,381.60 
Percent female % 19.70 40.40 ve 29.03 25.38 
Unionization % 20.39 33.30 ae 19.49 22.38 
Industries 
Construction % 30.86 5.39 se 19.12 23.90 
Manufacture % 24.55 31.86 7 16.91 24.07 
Sales % 12.89 30.39 is 25.74 18.76 
Professional services % 20.44 22 oe 20.59 20.83 
Other services % 11.25 9.80 17.65 12.45 
School Types 
Industrial only % 53.64 0.49 a 30.88 39.50 
Commercial only % 2:33 22.06 bah 5.15 6.31 
General only % 6.86 19.61 a 19.49 11.87 
Industrial and commercial % at .00 1.47 50 
Industrial and general % 9.19 1.96 Pep 6:51 7.14 
Commercial and general % 3.57 30.39 i 12.13 10.04 
All three schools % 6.31 1.47 sf 4.41 5.06 
None of them % 17.83 24.02 ss 20.96 19.59 


Note: Mean differences (T-tests) are tested only between men and women. 


fosUs, p< .U01 


Models 


***9 < 001 (two-tailed tests). 


The unit of analysis in this study is the job requisition.* I first run OLS analysis us- 
ing robust-cluster estimator of standard errors, which corrects the standard errors 
among those requisitions issued by the same employers. Although OLS analysis is 
widely used, it suffers from unobserved heterogeneity problems even when many 
controls are included. In order to overcome the limitations of OLS analysis, I then 
run fixed-effects analysis using multiple requisitions from the same employers. 
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This analytical approach differs from previous studies, which compared individu- 
als already working in different jobs and different companies (England et al. 1988; 
Kilbourne et al. 1994; Petersen and Saporta 2004; Tam 1997; Tomaskovic-Devey 
and Skaggs 2002). That approach is problematic for several reasons. If individuals 
across companies are compared (generally OLS models), there may be omitted 
variables for firms and individuals. Company culture and individual preference 
for a specific company, for example, are hard to control for but can confound the 
results. Especially when post-hire data are used, the risk of omitted variables is high 
(see Fernandez and Sosa 2005 for a critical review). In addition, most cross-company, 
individual-based surveys collect occupational-level data, which introduce possible 
errors in the measurement of job-level attributes such as training time. As a result, 
results based on such data are likely to be unstable, depending on the ways the oc- 
cupational proxies are manipulated in the regression models. This is why Petersen 
and Morgan (1995) focused on within-company and within-job wage gap between 
men and women.In this research, the pre-hire job-level data and fixed-effects mod- 
els help overcome the common limitations of OLS models, caused specifically by 
unobserved heterogeneity across firms and jobs. Using the multiple requisitions 

~ submitted by the same employers, I can compare jobs within the same company, 
which eliminates errors caused by firm-level unobserved heterogeneity. Although 
these multiple requisitions might seem to be duplicate information because of the 
shared-firm characteristics, they vary in sex typing. With no variation in firm infor- 
mation, these requisitions actually provide great analytical leverage to measure the 
pure sex-typing effect, free of any influence related to firm characteristics. Moreover, 
two job requisitions for the same job in the same company, where only the specified 
gender is different across the requisitions, allows controls for firm-related variations 
across companies as well as unobserved job-related heterogeneity. That is, differences 
in job task or relative importance of the job do not confound the effect of sex typing. 
I thus use fixed-effects models to compare (1. job requisitions submitted by the same 
employers (firm fixed-effects models) and (2. job requisitions aiming to hire men and 
women for the same position in the same company (firm and job fixed-effects models). 
Some 227 companies submitted two or more requisitions (560 total), and 54 of 
these companies submitted two or more requisitions advertising exactly the same job 
to men, women and either (113 requisitions).” By using these cases in fixed-effects 
models, I compare within-firm and within-firm-and-job gender differences in wages 
and on-the-job training. The advantage of this approach is that the “pure” effect on 


wages and training of the employer's action (i.e., sex-typing) can be measured net of 
the influence of firm and job characteristics. 


Variables 


Dependent Variables 


Offered wages are stated as monthly salary (unit: yen). Offered training is measured 
as the length of the training period (company orientation, training or apprentice- 
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ship) in days. Both wages and training are log-transformed in the analyses to 
correct their highly skewed distributions. 


Independent Variable 
‘The sex type of jobs by employers is the gender specified in the requisition: male, 
female or either. Male is the base-line category. 


Control Variables® 

‘Three types of variables are controlled. The first is ob characteristics such as the size 
of the job and skill requirements. The size of the job is the number of job openings, 
measured by the number of new employees to be hired for that job. This variable 
controls for the proportion that each job assumes in the local labor market (also 
see Appendix B). A higher labor demand may increase the level of wages but may 
also reflect a non-specialized job that pays less. The skill requirement is coded (1) 
if there is a specific skill requested. A driver's license is a common requirement, and 
employers also sometimes want applicants who have taken courses in particular 
techniques, such as electrical or basic mechanical skills. These skill requirements 
may be related to the level of wages and the allocation to career-track jobs. 

Second, firm characteristics that may be associated with the level of wages and train- 
ing are controlled (see Jacobs, Lukens and Useem 1996 fora review). The percentage 
of female employees is the percentage in the company. The firm size is measured by 
the number of employees. The presence of a labor union in a company is coded (1). 
The industry of the firm is categorized into one of five groups: construction, manu- 
facturing, sales, professional services and other services. (In the fixed-effects models 
these firm characteristics are not included because the models are designed to control 
for between-firm differences by including dummies for all firms.) 

Finally, the high schools contacted by employers are controlled. Job applicants 
have attended different types of high schools: industrial vocational schools, com- 
mercial vocational schools and general high schools that do not specialize in vo- 
cationally related education. Each type of school represents a different type of 
education which is correlated with a student's future career. Thus including school 
type as a control partially controls for the supply-side self-selection of young 
people into specific types of education. Because employers can identify as many 
schools as they want, eight combinations of schools can appear: industrial schools 
only, commercial schools only, general schools only, industrial and commercial 
schools, industrial and general schools, commercial and general schools, all three 


types of schools, and none of them.’ 


Analyses 
Descriptive Results 


The descriptive statistics in Table 1 show the characteristics of “ports of entry” 
(Doeringer and Piore 1971) in the labor market for new high school graduates, by 
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Figure 1. Wage and Training Distribution 
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Notes: | added a constant (.1) to training before taking the log because many jobs do not 
provide training and coded as “zero.” 


the sex type of each job requisition. As shown at the bottom of Table 1, the differ- 
ence in the numbers of job requisitions for men and women is striking. In the year 
of the study, 872 firms submitted 1,205 job requisitions, with 729 (60.5 percent) 
of the requisitions seeking male candidates and 204 (16.9 percent) seeking female 
candidates. ‘The remaining 272 (22.6 percent) requisitions were for “either.” 

The requisitions designated for men are for more desirable jobs in terms of 
wages and training. ‘The average monthly wage is 164,574 yen, which is approxi- 
mately 1,760 U.S. dollars using the 1995 average yen/dollar exchange rate (and 
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1,734 U.S. dollars using the 2009 exchange rate).'° Job requisitions for men have 
higher wages (11,895 yen, or nearly 130 dollars more) than those for women. This 
is a 7.6 percent difference and is significant at p <.001. Moreover, men tend to be 
preferred for jobs with longer training: the average length of training among the 
male-typed jobs is slightly longer than four weeks (30.5 days), while the length 
of training for female-typed jobs is only 10 days. In terms of long-term training, 
only 5 percent of women’s jobs provide more than three months, while more than 
15 percent of men’s jobs do."! 

The distributions of wages and training by gender, presented in Figure 1, show 
that the training distribution is highly skewed while wages are relatively normally 
distributed. The mean of the wages female jobs offer is slightly lower than that 
of male jobs, but there are not many outliers for either the male or female jobs 
that provide much higher wages than other jobs. The training distribution is more 
skewed, suggesting that many jobs do not provide training but that when they do, it 
can be quite long. Male-typed jobs provide much longer training —three years—than 
female jobs, which provide a maximum of one year of training. When training is log 
transformed in order to correct the skewing (which may result in a biased regression 
estimate influenced by the small number of extreme cases), male jobs show a logged 
training distribution similar to that for female jobs. This result suggests that the 
gender difference in training in Table 1 is driven by the length of training that male 
and female jobs offer, not by the proportion of jobs that offer any training at all. 

The differences in wages and training in Table 1 may reflect the association 
between sex typing and skill requirements. While more than 30 percent of men's 
jobs require specific skills (according to employers’ requisitions), only 8 percent 
of women’s jobs do. Because having specific skills leads to getting jobs that utilize 
these skills, jobs for men and women may be unevenly distributed by the skills 
each gender has acquired and brings to the job. Whether a job requires any skill, 
therefore, will be controlled for in the regression models which test the sex-typing 
effect on wages and training. 

There is a sharp distinction in the way men’s jobs and women’ jobs are dis- 
tributed across occupational groups. For the descriptive analyses, I classified 
job titles in the requisitions into five broad occupations: technical/ professional, 
clerical, sales/services, production and operator occupations (these categories are 
not included in the regression analyses, which use job-level data).'’ The column 
percentages in Table 1 show that more than 80 percent of female-typed jobs are 
concentrated in clerical and sales/services occupations. In contrast, men are re- 
quested for professional and production occupations more than other occupations, 
although none of the occupational categories overwhelms other categories to the 
extent that clerical and sales occupations do for women. Even in these simple 
descriptive statistics, it is clear that employers classify jobs according to gender. 
Sex typing restricts women to a limited range of predominantly clerical/services 
jobs that pay lower wages and provide less training. 


2010 © Social Forces 88(5) 


The results also suggest that sex typing varies by firm characteristics, which can 
be associated with the level of offered wages and training. Among the jobs for 
men, requisitions issued by construction firms represent the largest proportion, 
while requisitions submitted by manufacturing and sales firms are most common 
among the requisitions targeting women. In terms of the proportion of women in 
the firm, the average is smaller among jobs for men than among those for women. 

The distribution of the contacted schools in the requisitions reflects the gender 
distribution among schools. More than half (53.6 percent) of the requisitions seek- 
ing men were sent to industrial high schools, which are dominated by male students. 
In contrast, almost half of the requisitions seeking women were sent either to the 
female-dominated commercial high school (30.4 percent) or to both general high 
schools and the commercial school (19.6 percent).'? Because type of high school 
education is reflective of a young person’s anticipated career choice, more female- 
typed requisitions sent to commercial schools may imply that employers’ sex typing 
simply reflects applicant’s educational self-selection, which may stem from parental 
influence or gender norms. Either way, it is clear that employers sex type jobs and 
the schools that supply the candidates. And to some extent, employers’ sex typing 
‘of jobs seems to be distinct from how they choose schools. For example, about 23 
percent of job requisitions filed by employers specified “either sex,” and more than ~ 
30 percent of these requisitions were sent to male-dominated industrial schools. 

In the following regression analyses, I test the sex-typing effect on wages and 
training (hypotheses 1 and 2) after controlling for the job, firm and school char- 
acteristics that may be correlated with both employers’ sex-typing patterns and 
the amount of wages and training. By controlling for the types of schools and skill 
requirements that reflect a student's career-related choices before the hiring process 
starts, | am also able to assess the nature of male jobs and female jobs, limiting the 
influence of direct supply-side factors as much as possible. 


Wage Analyses 


Using logged wages as the dependent variable, Table 2 shows the gap in offered 
wages between the female- and male-typed jobs. In OLS models of cross-company 
comparisons using robust standard errors (models 1 and 2), there is a significant 
gender gap in wages. Without controlling for job, firm or school characteristics 
in Model 1, female-typed jobs provide about 7 percent less wages than male- 
typed jobs.'* This gap, however, reduces to about 3 percent when the controls are 
included in Model 2. This suggests that women’s jobs provide lower wages, but 
that when men’s and women’s jobs have similar chahathest the wage difference 
becomes smaller. Although the smaller gap in Model 2 is to be expected (because 
the gender wage gap is likely to be wider when men and women have different 
jobs), it is important to note that the gap is still significant. 

This pattern of wage difference remains in the firm fixed-effects models. In 
these I compare differences in the job requisitions submitted by the same employer. 
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In Model 3, the difference in offered wages is about 2 percent and is statistically 
significant. The coefficient of the female category is still negative when job charac- 
teristics and school types are controlled in Model 4, but the difference is reduced 
to about 1 percent and is not statistically significant. The small and insignificant 
coefficient of the gender variable (the female category) in Model 4 may seem to 
imply that when applicants’ educational choices are controlled, there is not much 
sex-typing effect on wages at the firm level. When I tested the difference between 
male jobs and non-male jobs (i.e., female jobs + either jobs), I found a significant 
wage difference of 2 percent, which implies that male typing increases offered 
wages. This result, again, needs to be tested by firm and job fixed-effects analysis, 
a more stringent test because it eliminates idiosyncratic variations in wages caused 
by unmeasured job characteristics (in addition to firm characteristics), leaving 
only the variation caused by sex types. 

Model 5 compares the wages offered to men and women when the same jobs 
are simultaneously advertised to both men and women in the same company. If an 
employer does not provide the same wages to women even when they are recruited 
for exactly the same job, it is strong evidence that “the gender of the job” creates 
a difference in offered wages. The significant within-job wage difference, about 3 
percent, shows that jobs are in fact gendered—there are “female jobs” and “male 
jobs” even when the jobs are identical. This can be interpreted as direct wage 
discrimination by employers. Employers’ sex-typing practice serves to restrict 
women’s access to higher wages at hire. Because only 113 cases were used in the 
firm and job fixed-effects model, the female and either categories (i.e., non-male) 
are combined and compared with the male, in order to check the consistency of 
the sex-typing effect. Model 6 confirms the effect by showing a significant wage 
difference of about 3 percent between male and non-male jobs. 

The very high R-squared scores of fixed-effects models might seem surprising. But 
it is understandable because the models compare wages of entry-level jobs recruiting 
applicants with highly similar human capital and offered by the same company. The 
firm fixed-effects (models 3 and 4) have R-squared scores of .93, which reflects that 
wages among entry-level jobs in the same company are very homogeneous, and after 
including job and school variables in the analysis, only 7 percent of the variation is 
left to be explained. Comparing the same jobs offered by the same company, models 
5 and 6 have even higher R-squared scores; this implies that when the same jobs are 
offered by the same company, almost no variation among wages remains. 

The 3 percent gap in Model 5 is the same as that found when within-company 
and within-job wage differences are measured in quite a different context by 
Petersen and Morgan (1995), using data from blue-collar and clerical employees 
in the U.S. labor market. The implication of their finding is ambiguous, how- 
ever, because the post-hire data used in their study cannot reveal who creates the 
difference and whether it is statistically significant. Using pre-hire data on jobs 
requisitioned by firms, Model 5 in Table 2 shows the effect of the employer's ac- 
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Figure 2. Wage Difference between Male Jobs and Female Jobs 
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tion on wages. Female jobs are paid about 3 percent less than male jobs, and the 
difference is initiated by employers at the time of hire. 

Figure 2 summarizes the wage difference in male and female jobs across compa- 
nies (OLS), within the same company (firm fixed-effects), and within the same job 
in the same company (firm and job fixed-effects). The results reveal that employers 
exclude women from higher wages by sex typing jobs, as predicted in Hypothesis 
1. Although modest in magnitude, the significant wage gap between men and 
women within the same entry-level job in the same company shows the clear effect 
of sex typing on the amount of employee wages at the point of entrance to the 
labor market. This indicates direct wage discrimination against women, who begin 
their work life with the disadvantage of slightly lower wages than men. 


Training Analyses 


Training is specialized human capital that leads to higher wages; jobs that provide 
training are often promising career-track jobs (see Tam 1997; Tomaskovic-Devey 
and Skaggs 2002). The models in Table 3 use the logged length of training in days 
as the dependent variable.’ Models 1 and 2 present the results of OLS regression 
using robust-cluster standard errors. The coefficients of female variables in these 
models are negative but given the large standard errors, it is hard to tell whether 
women are offered less training than men. This implies that the gender disparity 
in the raw amount of offered training shown in Table | results largely from male- 
typed jobs that involve much longer training than other jobs (also see Figure 1). 
‘The results in models 1 and 2 might also be complicated by heterogeneity in the 
types of activities performed across firms. In order to control for the between-firm 
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Table 2: Regression Analysis of Logged Wages 
OLS Firm and Job 
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@|n this model, the male category (the baseline category) is compared with non-male, i.e., 
female + either. 
> “Groups” are firms in models 3 and 4, and 217 firm-dummy variables are included in 
these models. They are company-job combinations in models 5 and 6, and 54 firm-and-job 
dummy variables are included in these model. 
°| used the “areg” command in Stata in order to estimate fixed-effects regression 
coefficients. The procedure includes dummy variables for each firm (models 3 and 4) 
and for each firm and job (models 5 and 6). Accordingly, the reported R? is the proportion 
of variation in the dependent variable explained by the model, including all the dummy 
variables. The very high R? in this table implies that firms have consistent descriptions of 
their requirements, and hence not much variation is left to explain once dummy variables for 
each firm and job are included. An alternative procedure for estimating fixed-effects models 
is the “xtreg, fe” command in Stata. This procedure fits a regression model using mean- 
deviated variables (both dependent and independent variables). The resulting estimates of 
coefficients and standard errors are the same as those from the “areg” procedure, but the fit 
statistics are different because “xtreg, fe” uses only the variation in the dependent variable 
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Figure 3. Training Difference between Male Jobs and Female Jobs” 
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heterogeneity, models 3 and 4 adopt the firm fixed-effects approach. In Model 3, 
the significantly negative coefficient of the gender variable—female—shows that 
within the same company, female-typed jobs provide less training than male- 
typed. Employers seek to hire women into jobs that provide less training than 
the jobs reserved for men. This pattern remains in Model 4, which controls for 
skill requirements and the high schools contacted by employers. This suggests 
that women may be excluded from training even when no gender difference 
exists in the type of human capital. Also notable in firm fixed-effects models 
is that the coefficients in models 3 and 4 become very large (much larger than 
in the OLS models), corresponding to differences in the length of the training 
period of 53 percent (Model 3) and 55 percent (Model 4). Figure 3 presents the 
increase of the gap between cross-company and within-company comparisons. 
Within the same company, women are recruited for jobs that give less than half 
of the training that male jobs provide. 

The different results in OLS models and firm fixed-effects models may partly 
explain the lack of empirical consensus in previous research on the gender dif- 
ference in training. While qualitative studies on a limited number of jobs and 
companies suggest that women are excluded from training (e.g., Detman 1990; 
Milkman 1987), findings from quantitative cross-company comparisons have 
been unclear: one set of studies finds gender differences in training (Lynch 1992; 
Veum 1993) but other studies fail to find a difference (Jacobs et al. 1996; Knoke 
and Kalleberg 1994). In the cross-company comparisons, the variation across 
firms is difficult to control completely without fixed-effects models, and therefore, 
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Table 3: Regression Analysis of Logged Training 
OLS Firm and Job 
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@|n this model, the male category (the baseline category) is compared with non-male, 
i.e., female + either. 
> “Groups” are companies in models 3 and 4, and 217 firm dummies are included in these 
models. They are company-job combinations in models 5 and 6, and 54 firm-and-job 
dummies are included in these models. 
° See Note c in Table 2. Using the “xtreg, fe” procedure, | obtained the within-group R? of 
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the gender difference becomes clear only when the contrast between male jobs and 
female jobs becomes more precise as in the within-company comparison. 

In Model 5, when both firms and jobs are fixed, the estimated difference in 
training between job requisitions for men and women is nearly as large (55 per- 
cent) as it is in models 3 and 4 but no longer statistically significant—possibly 
because the effective sample size falls dramatically (from 536 cases in models 3 and 
4 to 113 cases in Model 5). The sex-typing effect is more evident in Model 6, when 
the male category is compared with non-male (female + either). Model 6 shows 
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Figure 4. Simulated Wage Trajectory for Japanese Men and Women 
8,000 




















- - - :Male 
Female 








20 29 30 35 40 45 50 99 


Notes: My data show starting wages for men and women with high school education, but 
“the data do not include information about wage increases over their careers. In order to 
simulate wage trajectories, | adopted coefficients of age and tenure from Kumlin (2007)'s ~ 
analysis using the Japanese General Social Survey 2001. Kumlin (2007), however, 
uses hourly wages as a dependent variable, while my data analyses are conducted on 
monthly wages. In order to match the wage unit with Kumlin (2007), | re-ran Model 5 of 
Table 2 (firm and job fixed-effects model of wages) using a proxy for hourly wages, i.e. 
monthly wages divided by 160, assuming that employees work 40 hours a week for 4 
weeks in a month. From this analysis, | calculated initial hourly wages, which are 6.942 
and 6.91 logged yen for men and women respectively; these can be converted to 1,035 
and 1,002 yen (11 and 10.7 dollars using the 1995 yen/dollar exchange rate). Adopting 
the age and tenure coefficients from Kumlin (2007:210), | simulated the wage trajectories 
by using an equation, In(wages) = (initial wages) + 8, xage + B,x age* + B,xtenure. In 
Kumlin’s (2007) analysis, B, is .069, B, is -.064 and 8, is .008 for men, and 8, is .034, 
B, is -.04 and B, is .012 for women. In simulating the wage trajectories, | set the initial 
wages at age 20 since my data show wages of high school graduates, and increased 
age and tenure by one unit thereafter. The simulated results are presented in real 
Japanese yen. 
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that job requisitions that specifically demand men offer significantly more training 
than those that do not. This result is similar to that of the wage models in Table 2. 
The negative but non-significant coefficient for the gender variable in Model 5 can 
be cautiously interpreted to show employers’ direct training exclusion of women. 


Discussion and Conclusions 


Hypotheses 1 and 2, that male-typed jobs provide higher wages and longer train- 
ing than female-typed jobs, were supported by the results of fixed-effects models. 
That employers prefer women for some jobs and men for others but compensate 
and train them equally was a possible finding, but the data do not support it. 
Rather, they show that sex typing at the point of advertisement excludes women 
from desirable jobs that provide higher wages and longer training. Although the 
initial wage differential between female- and male-typed jobs is slight, the large 
training differential means that the female-male wage gap will increase over the 
course of workers’ lives. To predict future wage trajectories for Japanese men and 
women based on the starting wages in the data set, I used the coefficients for age 
and tenure reported in a recent study of wage determination in Japan (Kumlin 
2007). Figure 4 shows a difference in earnings across the careers of Japanese men 
and women of over 100 million yen (1 million U.S. dollars). Ifa woman worked 
in the same company from age 20 to 55 (the standard retirement age in Japan), 
she would have earned around 138,516,590 yen while a man of the same age and 
tenure would have earned around 245,276,173 yen. 

This study holds a number of theoretical and empirical implications for re- 
search on sex segregation and the gender wage gap. First, existing studies of em- 
ployers’ hiring behaviors rely on audit methods or natural experiments limited to 
particular jobs (Goldin and Rouse 2000; Neumark 1996; Skuratowicz and Hunter 
2004). This article directly examines specific employer practices across the entire 
range of entry-level jobs in a labor market. Second, most studies have not been 
able to separate out the roles of different actors involved in the creation of job 
sex segregation and the gender wage gap. The data used here allow measurement 
of the influence of employer decisions before other factors enter in—to observe a 
more-or-less pure employer-side process—and to show how employer motives for 
preferring men over women for certain jobs lead to disparities in outcome. Third, 
the analyses in this research bear on the debate between England (1988), Tam 
(1997), and Tomaskovic-Devey and Skaggs (2002) concerning which actor is most 
responsible for the training gap between men and women. My data show that 
Japanese employers explicitly exclude women from applying to jobs with longer 
training. While this does not deny the possibility that women might voluntarily 
seek jobs with shorter training periods, employers’ exclusion of women at the start 
from the pool of applicants to certain jobs is a powerful demand-side mechanism. 

It can be argued that jobs for high school graduates may be more segregated 
than higher-skilled entry-level jobs for higher education graduates. Moreover, 
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recent greater enforcement of the Equal Employment Opportunity Law in Japan 
should lessen the gender wage gap. But it is striking that the results of this research 
are quite consistent with studies of the U.S. labor market such as Petersen and 
Morgan's (1995). Moreover, studies suggest that the overall effect of Japan’s Equal 
Employment Opportunity Law has been ambiguous (Hanami 1998; Kumamoto- 
Healey 2005) and register suspicion that employers continue to sex type jobs, 
both intentionally and unintentionally. Also, the effect of the Equal Employment 
Opportunity Law on sex segregation in the United States has not been clearly 
shown (Kaley and Dobbin 2006). Although the data used here cannot be argued 
to represent the entire labor market across time and space, they show clearly what 
employers in an entry-level market would like to do if not prevented by Equal 
Employment Opportunity Law. 

This study demonstrates a phenomenon that has long been assumed in many 
theories: employers’ sex typing of jobs. It offers direct evidence that male- and 
female-typed jobs have different characteristics, and that this is related to gender 
inequality in wages and training. The findings set the stage for further analysis of 
workplace processes that affect gender inequality beyond the point of hire. 


Notes 


1. In 1997 the Japanese government decided to enforce the Equal Employment 
Opportunity Law, which had been initially enacted in 1986. In the reinforced version, 
the government explicitly banned employers from requesting applicants based upon 
their gender (Yomiuri newspapers 1999). 


2. “Employers” in this study refers to the actors who are in charge of hiring new 
employees. For example, hiring agents could be the actors who act as the agents 
of owners. In larger organizations, it is likely a personnel office that creates job 
requisitions in consultation with the hiring departments. In this case, it could be 
that lower-level managers or co-workers are part of the hiring process. This shows that 
hiring is an organizational process in which the roles of diverse actors are conflated, 
and it warrants further research (Fernandez and Mors 2008). In the setting of this 
study, however, co-workers do not have much say at the point of hiring. Therefore, 

“employers” imply owners or hiring agents/managers of personnel offices. 


3. Japan shows some distinctive characteristics such as a more gender-integrated 
manufacturing industry than other countries (Shirahase and Ishida 1994). 


4. ‘This includes rénin or individuals who study informally after graduation for later 
admission to their first-choice university (Ministry of Education and Sciences 1995). 


5. The data were collected as a part of a larger project on Japan’s youth labor market. 
‘They represent the entire entry-level labor market for new high school graduates in the 
jurisdiction of one of the two Public Employment Security Offices in Kawasaki, from 
which Professor Mary C. Brinton of Harvard University obtained the requisitions. 
‘The author is grateful to her for making these data available. 


6. An alternative approach is to use the job opening (each slot of positions) as the unit of 


analysis (see Appendix B). The results, however, do not change when this alternative 
approach is taken. 
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7. When an employer submits multiple requisitions to recruit men and women for 
the same position, it is possible that the employer is using the same job title but has 
slightly different jobs in mind. Given the large range of job titles available to Japanese 
employers, such discrepancies are likely to be minimal. 


8. In order to save space I do not show the results for control variables in the regression 
tables. It is worth noting that the models that include the control variables show 
the same pattern of the sex-typing effect as the models without controls. The point 
estimates of these control variables are available in Appendix C. 


9. Although there is no information about which schools’ employers send these non- 
specified requisitions, listing no specific school can be an important signal by itself—that 
specialization in high school education does not make a difference for these jobs. 


10. This amount excludes bonuses or other types of earnings. Many companies give a 
reduced bonus to new employees in their first year; further, information on bonuses 
was missing for a number of cases. 


11. The amount of offered wages and training, along with other benefits employers 
provide, are not negotiable in most cases, because employers want to find employees 
who want to take the job with the specifications the employers have outlined. 


12. Based on the descriptions about actual work on the requisitions, all advertised jobs 
were classified into three-digit job categories. The Japanese three-digit job classification 
has 364 job titles; job requisitions were spread across 146 of these (Statistics Bureau 


of Japan, Japanese Standard Occupational Classification, revised in 1997). 


13. In the district under study in Kawasaki there are 12 high schools, of which two 
are industrial vocational schools and one is a commercial vocational school. The 
remaining nine schools are general high schools. 


14. The wage differences between male jobs and female jobs are interpreted as the 
percentage difference in the predicted wages when gender is female vs. when gender 
is male according to the equation (100(exp 6) — 1) (Woodridge 2006). The training 
differences are also calculated in this fashion. 


15. It is worth noting that skills attained in a short period of time do not have as strong 
an exclusive characteristic as skills that require a longer time to attain. In order to 
examine the difference, I conducted analyses using two binary dependent variables 
(more than 6 months of training = 1, and more than 1 year of training = 1). The logit 
analysis results show a significant gender disparity; men are more likely to be offered 
training longer than 6 months and longer than 1 year. However, fixed-effects models 
could not converge because the number of cases used in the analyses dropped too 
much; employers who submitted multiple requisitions, all of which provide less than 
6 months or 1 year of training, are not used in the fixed-effects models. In order to 
show a more representative result, I therefore used a continuous dependent variable. 
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Appendix B. Job Requisitions and Job Openings . 


In the requisitions, the number of new employees whom employers plan to hire for 
specific jobs is listed (the size of jobs). This number indicates the number of job 
opening (vacant positions) for the jobs described in the requisitions, and it varies 
from 1 to 560 based on the labor demand of each employer who submitted job 

requisitions in 1995. Since the size of jobs reflects the importance of the jobs in 

the labor market, I conducted a set of analyses using frequency weights separately 
from the analyses presented in this paper; I weighted each requisition by its number 
of openings. The results of the weighted regression analyses (OLS and fixed-eftects 

models) confirm the patterns reported in this paper: both in the wages and train- 
ing analyses, the signs of sex-typing variables are in the same—negative—direction 

(except for the OLS model in the training analysis, which is significant neither 
in the weighted nor un-weighted analyses). The difference, however, is that the 

size of the standard errors in the weighted analyses is much smaller, ranging from 

one tenth to one fortieth of the standard errors in the un-weighted analyses. Such 

small standard errors result from the artificially inflated number of cases. Another 

’ difference is the size of coefficients in the weighted analyses. The coefficients are 

smaller in the weighted analyses, particularly in the fixed-effects models. (In the’ 
OLS analyses, the coefficients are slightly larger.) For example, in the firm fixed- 
effects model of training analysis, the coefficient for female-typing is -.752 when 

un-weighted, but it is -.608 when weighted, and both of them are significant. 
The smaller size of coefficients may result from the smaller variation of sex-typing 
variables when weighted, but overall the findings are substantively consistent. I de- 
cided to use the requisition as the unit of analysis because it is theoretically better 
suited and empirically conservative. Theoretically, employers sex-type jobs rather 
than each position for a given job; employers, for example, would female-type a 

textile production job regardless of the number of job opening slots. Empirically, 
the weighted analyses gain an artificially inflated statistical power, while the pat- 
tern of coefficients remains the same as the pattern in the un-weighted analyses. 
Therefore, un-weighted analyses are a statistically more conservative test. The 

results of the weighted analyses are available on request. 
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This study revisits the Wisconsin model of status attainment from a life course de- 
velopmental perspective. Fixed-effects regression analyses lend strong support to the 
Wisconsin framework’s core proposition that academic performance and significant 
others’ influence shape educational expectations. However, investigating the process 
of expectation formation back to the elementary grades yields insights not evident 
when analyses are limited to the high school years: (1. many youth consistently expect 
to attend college from as early as fourth grade; (2. the expectations of middle- and 
low-SES youth are less stable, and across years the preponderance of their exposure to 
socialization influences mitigates against sustained college ambitions; (3. long-term 
stable expectations are more efficacious in forecasting college enrollment than are 
changing, volatile expectations. As anticipated in the Wisconsin framework, fam- 
ily- and school-based socialization processes indeed contribute to social reproduction 
through children’s educational expectations, but the process starts much earlier and 
includes dynamics outside the scope of the original status attainment studies. 


In a series of influential articles published in the late 1960s and 1970s, William 
Sewell and his colleagues at the University of Wisconsin introduced an empirical 
model depicting the processes by which family background (dis)advantages are 
perpetuated across generations. The “Wisconsin model of status attainment,” as it 
came to be known, was heavily social-psychological in orientation, invoking fam- 
ily- and school-based socialization processes as the principal mechanisms linking 
social origins with status positions in adulthood.' Using the tools of path analysis, 
Sewell and his colleagues showed that through relationships with teachers, family 
members and friends, and self-reflexively in reaction to their own record of school 
performance, young people form expectations regarding their future schooling 
and career prospects. These expectations, in turn, propel youth toward stratified 
educational and occupational destinations.” 

This new approach to the study of intergenerational mobility dominated em- 
pirical stratification research for nearly 15 years, but, as inevitably happens, at- 
tention eventually shifted to other concerns and other approaches. ‘This research 
returns to the Wisconsin framework’s core concerns, but the empirical approach 
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is different: where the Wisconsin studies evaluated socialization processes cross- 
sectionally, we do so dynamically; where the Wisconsin studies assumed homo- 
geneity of experience across social lines, we explore sources of heterogeneity. Our 
results offer strong support for the propositions that young people respond to 
parental influence and academic feedback in forming educational expectations, 
and that these expectations, in turn, help determine educational attainment. The 
Wisconsin approach in those particulars is sustained. However, we also find that 
the developmental processes at play are both longer-term and more nuanced than 
recognized in the Wisconsin studies. 


The Wisconsin Status-Attainment Framework 


The fundamental question driving the Wisconsin model was: Why do higher-status 
youth attain higher levels of education, and consequently, higher status jobs, than 
their less privileged peers? For Sewell and his colleagues, the answer resides in young 
people’s plans for the future, as captured in expressed educational expectations and 
occupational aspirations. Building on ideas from reference group and self-identity 
theories, the Wisconsin model centers on family- and school-based socialization 
processes as the principal mechanisms linking social origins with status destinations. 
Through the accumulation of feedback at home and at school involving relationships | 
with teachers, family members and friends (rendered as sources of “significant others 
influence”) and their own record of school performance, youth gradually acquire 
a sense of their future educational prospects. In turn, these expectations regarding 
future schooling propel youth toward different postsecondary destinations —expecta- 
tions as “motors to behavior.” (Woelfel and Haller 1971) 

Although the socialization processes depicted in the Wisconsin framework 
themselves are universal, owing to differences in life circumstance they result in 
sharply differentiated attainment orientations across social lines. ‘The home and 
school environments experienced by affluent youth encourage an orientation to 
college—they do well in school and receive ample support from within their social 
networks. For low SES and disadvantaged minority youth, the corresponding 
experiences less consistently point toward college, and so hold down expectations. 
These are tendencies, not certainties, but the tendencies align, and the cumula- 
tive effect is that expectations for the future tend to follow social lines. Indeed, 
educational expectations emerge in this literature as the primary vehicle driving 
intergenerational status transmission, surpassing even measures of cognitive ability 
in that regard (Jencks et al. 1972). 

This was a watershed moment in empirical stratification research, as until the 
advent of status attainment modeling the mechanisms by which social status 
was transmitted across generations had been simply a speculative proposition in 
sociology (Haller and Portes 1973). But as Morgan explains (2005), progress un- 
bundling the role of expectations in the status attainment framework stalled when 
attention shifted toward structural constraints originating in school organization 
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and labor market rigidities and away from “elucidating the micro-mechanisms 
with which prior status socialization models were appropriately concerned.” 

It has now been almost 40 years since Sewell and his colleagues launched their 
agenda, and though the highly stylized path models they introduced are no longer 
much in evidence, the issues their work addressed have currency still; many studies, 
for example, explore interpersonal influence on students’ expectations (Hao and 
Bonstead-Bruns 1998; Goyette and Xie 1999) or the pattern of college attendance 
in relation to students’ expectations (Morgan 2004; Reynolds et al. 2006). This more 
recent work, however, rarely situates itself in the Wisconsin tradition of modeling 
relationships among status origins, status expectations and status destinations as 
an integrated whole. The present article returns to the original Wisconsin agenda, 
but recasts the framework along life course developmental lines in order to further 
elucidate the micro-mechanisms that transmit social status across generations. 


Expectation Formation: Bringing in the Developmental Dynamic 


The Wisconsin longitudinal study interviewed a sample of high school seniors 
in 1957 and re-interviewed them seven years later, securing reports of their edu- 
cational attainment and early job experiences. Panel survey data were rare at the 
time, and the project’s identification of mediating mechanisms linking family 
background to educational attainment was groundbreaking: “... the place of 
status attainment research in the study of social stratification lies in the effort to 
specify the causal sequence through which individuals reach their positions in status 
hierarchies.” (Haller and Portes 1973:55, emphasis added) 

In hindsight, it is clear that the data and research design of the original 
Wisconsin studies were not well suited to the task. At the core of the status attain- 
ment framework is the idea that young people adjust their thinking in relation 
to status-differentiated experiences at home and at school: “attitudes— including 
levels of aspiration—are formed and altered through two basic mechanisms: inter- 
personal influence, including reflexive adjustment of others’ expectations, and 
including self-reflexion.” (Haller and Portes 1973:55, emphasis added) 

Such propositions are inherently developmental, but the Wisconsin project 
lacked appropriate developmental data. Responses to the senior year high school 
survey were used to predict levels of schooling and occupational placements seven 
years later, so the time-ordering and underlying imagery are well-aligned. However, 
the model’s interior socialization mechanisms were measured cross-sectionally, 
with “expectations” and “significant others influence student” reported in the 
senior year survey, and academic performance operationalized concurrently as 
class rank. This is limiting, as a cross-sectional assessment shows how expectations 
stand in relation to other factors, not how they emerge, effectively hiding from 
view the evolution of young people’s thinking in relation to the processes thought 
to shape their thinking (i.e., feedback received over time on the quality of their 
school performance and the support received from others in their social network). 
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The fact that these measures correlate highly in 12" grade is no assurance 
that 12 grade is the relevant arena. It is a classic endogeneity problem. If, for 
example, expectations for college crystallize earlier, during the elementary or 
middle school years, then parents’ expectations from the senior year of high 
school cannot possibly be the determining force. Influence could flow in the 
opposite direction, an uncertainty acknowledged at the time: “Such ‘circles of 
causality’ are particularly difficult to portray on the basis of survey data.... It 
remains for future research, based on different data collection designs, to clarify 
these relationships.” (Haller and Portes 1973:67) 

Unfortunately, the “future research” anticipated by Haller and Portes suffers 
from much the same limitation: coverage of socialization mechanisms typically is 
short term, and so gives a truncated view of the expectation formation process by 
focusing on near-term influences. But a life course perspective instructs us that 
human development is a lifelong process (Elder, Johnson and Crosnoe 2003). 
Experiences in childhood set the stage for later patterns of behavior, attitudes 
and the like, but individuals also continue to respond to changing contexts and 
circumstances as they age, such that timeline of the life course is characterized by 

* both stability and change across different stages of life. The implications of this 
for our present interests are captured well by Emirbayer and Mische (1998:962): 


“Actors are always living simultaneously in the past, future 

and present, and adjusting the various temporalities of their 
empirical existence to one another.... They continuously 
engage patterns and repertoires from the past, project 
hypothetical pathways forward in time, and adjust their 
actions to the exigencies of emerging situations.” 


In the tacit self-reflection that informs their intentions, young people certainly 
look around at their present circumstances, but also back to where they have been. 
Research on how status expectations are formed should do the same. 


Research Objectives 
Revisit the Wisconsin Framework’s Causal Propositions 


We first assess how well the Wisconsin model's core propositions regarding the devel- 
opment of educational expectations fare when evaluated longitudinally. Using survey 
and school record data starting in fourth grade and continuing through 11" grade, 
we test whether changes in youth's thinking about college respond to co-occurring 
changes in their school performance and the support received from parents. The 
substantive embrace is much like that of the Wisconsin framework, but influence 
processes are evaluated dynamically rather than cross-sectionally. We predict, as‘did 
the Wisconsin researchers, that youth’s expectations regarding college are shaped 
over time by support from their parents and their performance in school. 
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Inspect for Heterogeneity in the Process of Educational Expectation Formation 


In her depiction of the transition to adulthood, Buchmann (1989) points out 
that modal life course patterns are clearer for some youth than for others. 
When the experiences of others in one’s immediate social group and broader 
realms of contact are homogeneous, success criteria and the understood routes 
to success are clear. Heterogeneous experiences and role models, on the other 
hand, yield less clearly defined paths, with outcomes, consequently, less cer- 
tain. The long-term patterning of role models and contextual cues aren't topics 
generally found in the status attainment literature, but they are integral to 
young people’s emergent sense of their place in the world. 

Differences in life experience are shaped by one’s location in the social 
structure and implicated in intergenerational stratifying processes. Consider 
two scenarios that span the socio-economic extremes. At the privileged end are 
youth who have been primed for college all their lives: they attend the best schools 
and are placed in the most advanced courses; their peer networks mainly consist of 
youth like themselves who also want to attend college, as well as those who have 
gone before them and succeeded; and their parents’ support, both financial and 
socio-emotional, is never in question. 

The situation of disadvantaged youth is more complicated. For many who 

grow up in an economically distressed neighborhood, attend poorly resourced and 
undemanding schools, are tracked away from academically challenging courses, 
and see few or no older friends and siblings rise to the top professionally through 
academic accomplishment, it’s hard to see the basis for an affirmative answer. Yet 
often there are countervailing forces at play: a single mother struggling financially 
may nevertheless encourage her son to try college; a caring teacher may see a 
“diamond in the rough;” and many children burdened by a “high risk” demo- 
graphic profile nevertheless find themselves welcomed into academic programs 
for the gifted and talented, while at a more distal level, there is the widespread 
U.S. cultural ethos that encourages college attendance (Rosenbaum 2001) and 
an optimism, shared by many poor and minority youth (MacLeod 1995), about 
prospects for upward mobility through education. Thus, many disadvantaged 
youth receive conflicting signals about their educational prospects. 

Informed by Buchmann’s ideas about modal life pathways, we predict that 
social origins and pro-college contextual influences will be more tightly aligned 
at the upper end of the socio-economic spectrum than at the lower end and 
middle. At the upper end, significant others and relevant academic markers will 
more consistently send the message “you are college material,” and the expecta- 
tion to attend college we anticipate will stabilize at a younger age. For others, 
the signals and resources will be less consistent; plans for college, accordingly, 
will be less stable. 
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Revisit the Relationship Between Educational Expectations and Destinations 


Our third objective is to assess whether the efficacy of expectations as a “motor 
to behavior” is attenuated under some circumstances? The Wisconsin framework 
assumes that expectations propel youth toward different destinations in adulthood, 
but plans for college might not have the same degree of motivation if they are 
arrived at via different routes (McClelland 1990), if some plans are less securely 
held than others (Alexander and Cook 1979), or if some are less firmly grounded 
in relevant externalities and accurate information (Morgan 2005). 

In some research (e.g., Alexander and Cook 1979), long-held college expec- 
tations have greater predictive value than plans of shorter duration, suggesting 
that expectations more authentically embedded in young people's life experi- 
ences are more useful as guides to action than those which merely mimic the 

“college for all” ethos (Rosenbaum 2001). To test this idea, we assess whether 
long-held plans are a studier platform for their actualization than are plans of 
short duration. 


Methods 
Data and Project Design 


The data we use are from Beginning School Study, which has been monitoring 
the life progress of a cohort of Baltimore school children since fall 1982, when a 
stratified random sample of 790 rising 1* graders was selected for participation. 
First, 20 Baltimore city public schools were selected at random within strata 
based on school racial composition and socio-economic status. Next, children 
were sampled at random from first grade classrooms. Parents of all but 3 percent 
of the selected students consented to have their children participate in the study 
(see Entwisle, Alexander and Olson 1997, for details). Study participants were 
followed throughout their elementary, middle and secondary school careers and 
into young adulthood. Baseline characteristics about the students and the schools 
they attended are shown in Table 1. 

The original sample was about evenly divided by race (45% white, 55% 
black) and representative of all socio-economic status levels in the Baltimore 
City public school system.’ At baseline, 51 percent were female, and parents 
education averaged just under 12 years. Reflecting the disadvantaged makeup 
of the BCPS enrollment, two-thirds of the panel received free or reduced price 
meals during the primary grades. In 1982 when the panel was in 1* grade, the 
BCPS school enrollment was 76 percent black and 66 percent low income. The 
Baltimore context is not unlike that of other deindustrializing, high poverty city, 
bounded by the particulars of its time and place. Yet the project’s representative 
random sampling is uncommon and captures the experience of typical urban 
youth in a typical urban setting, for whom ambitious expectations seem least 


likely to be fulfilled. 
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Table 1: Baseline Characteristics of the Beginning School Study 
Sample Members and their Schools 


Sample Member Characteristics 


Sex 
Female 51 
Male 49 
Race 
Black aS 
White A5 
Receiving Free or Reduced Price Lunch 
Yes .67 
No ee, 
Average Years of Parent's Education 11.9 


School Characteristics 
Average School Neighborhood Family Income (2009 dollars) 36,075 
School Neigborhood Poverty 


Below Poverty Line 16 

Above Poverty Line 84 
N 790 
Note. All characteristics are measured in Year 1 of the study except for Meal Subsidy 
receipt, which is measured in Year 3. 


Measures 


Educational Expectations regarding college attendance are used from BSS years 4, 
6, 8, 9, 10 and 11.4 

In the spring of each of these school years, sample members were asked the 
following question: “Considering your situation, how far do you think you actu- 
ally will go in school?”* Response options most years were: Less than High School, 
Finish High School, Finish College, Greater than College. Options for the GED 
and Business/Vocational Training were included some years, and in years 9, 10 
and 11 two “Some College” options were added : Less than Two Years of College, 
Two or More Years of College. Through year 9, students were interviewed in the 
fall and spring. In years 10 and 11, interviews were conducted in the spring only. 
Data from years 12 and 13 are not used. 

The analyses give precedence to spring responses when available.° Responses 
were dichotomized to distinguish high educational expectations coded 1 (some 
college or greater) and low educational expectations coded 0 (less than col- 
lege).’ In fixed-effects regressions these six measures are used to construct a 
time-varying measure of college-orientation from years 4 through 11. In logistic 
regressions predicting college attendance, they are used to construct the personal 
record of expectation stability and change over this period (e.g., Stable High, 
Rising, Volatile High, etc.). These pattern constructions are described when 


they are introduced. 
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Social Origins. Family socio-economic status is our primary focus, with race (black 
= 1; white = 0), sex (1 = female; male = 0), and family structure included as controls. 
Family SES is constructed from five variables measured during the elementary years: 
school records, meal subsidy status, a proxy for low family income; from parent 
surveys, mother’s and father’s education and mother’s and father’s occupational 
status in the SEI metric (Featherman and Stevens 1982). The five were standardized, 
and the standardized measures averaged. The composite is the average of however 
many of the five measures was available—five for 49 percent of the panel; one for 
just 5.3 percent (the average was 2.5). For some analyses, the continuous measure 
is collapsed into three categories: low SES, middle SES and high SES. Category 
boundaries were drawn so the mean education level of both mothers and fathers was 
around 10 years for the low SES group, 12 years for the middle SES group, and 15 
for the high SES group. Approximately half the sample is in the low category and 
one fourth in each of the other two. Although SES can vary over time, the available 
measures require that we treat family SES as time-invariant in this analysis. 

Family Structure is a time-varying binary variable obtained from student inter- 
views. For each year, the variable is coded 1 if at the time of the survey the student 

" was living in a two-parent household (including stepparents) with their mother 
and father, and 0 if they were not. i 

Academic Feedback is measured through three indicators: academic performance, 
high track placement and grade retention. Each is measured cross-sectionally on 
six occasions, then pooled to form three time-varying measures, with students 
contributing six observations. The cross-sectional versions are used in descriptive 
analyses; the time-varying versions in fixed-effects analyses. 

Academic performance is constructed from achievement test scores and report 
card marks. As part of the BCPS regular testing program, in years 4 and 6 stu- 
dents were administered the California Achievement Test in the fall and spring; 
in year 8 testing was done only in the spring. The BCPS discontinued use of the 
CAT after year 8, but in year 9 the BSS administered its own Math Concepts and 
Applications and Reading Comprehension subtests, achieving 75 percent sample 
coverage. CAT scale scores on those two subtests were standardized each year to 
have a mean of 0 and a standard deviation of 1. 

Grades in English, math, reading and science were taken from BCPS tran- 
scripts or, for 271 students who left the BCPS at some point, from transcripts 
obtained from the schools to which they transferred. All marking systems were 
scaled to a common zero through 13 metric and then standardized within years 
to a mean of zero and a standard deviation of 1. Yearly measures of academic 
performance were constructed by averaging test scores and grades. In years when 
only one source was available, it was used alone. In years 10 and 11, academic 
performance is based solely on grades. For some parts of the analysis, academic 
performance is dichotomized at the median as to distinguish low scores (below 
the median) from high (at the median and above). 
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High track placement is a dummy variable based on course information taken 
from student transcripts and coded 1 if the student was enrolled in any high level 
academic course during the corresponding school year or else 0. As students were 
not tracked by course level in elementary school, all students were coded 0 in 
year 4. In middle school and high school, the BCPS distinguishes “Honors” and 

“Advanced Academic” classes from regular and remedial classes. For years 6-11, a 
code of 1 was used if the transcript showed any advanced math, science, English or 
foreign language course. For the years 9-11, Latin and calculus also were coded 1. 

Grade retention dummy variables for each year were constructed from student 
transcripts, coded 1 if the student had been held back a year in school prior to 
the reference year or else 0. When school records were missing or incomplete, 
parent and child sources from several high school years were used, selectively, after 
checking for inconsistencies.* 

Parental Support for college is measured with dummy variables coded 1 if the 
child was expected to attend college, 0 if not. Parents’ expectations were elicited 
in the fall of the year, always before the end of the first marking period. Students’ 
expectations, as mentioned, usually are from spring interviews, so in most cases 
parental expectations predate students’ by approximately six months. Parental 
support is measured cross-sectionally each year; a time-varying measure pools 
the six observations. 

Usually the parent respondent was the mother or maternal caretaker (roughly 
62% across survey years), but sometimes fathers or paternal caretakers were in- 
terviewed and occasionally other relatives. To account for how this might influ- 
ence parental expectations, dummy variables indicating the interviewed parent's 
relationship to the child were constructed for each wave and are controlled in 
the multivariate analysis: mother (used as the reference category), father, other 
guardian, parent not interviewed. 

College Attendance is a categorical indicator of highest level college enroll- 
ment through December 1999. We distinguish three attainment statuses: none 
(47%); enrollment in a two-year school (30.3%); enrollment in a four-year 


school (22.7%).? 


Missing Data 


Although panel coverage is nearly 80 percent, the cumulative effect of missing data 
is substantial—e.g., excluding cases missing any of the six expectation measures 
loses exactly half the panel (7 = 395). As the degree of missing information across 
measures and years is nontrivial, we use the ice (imputation by chained equations) 
multiple imputation scheme available in STATA to impute missing values. Using 
dummy variable indicators, multivariate analyses control for whether the data 
point for a given case was observed or imputed. Appendix A reports the means 
and the proportion imputed for each year. 
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Table 2: Percent Expecting College by Socioeconomic Status, Academic Feedback 


and Parental Support 
Year in School 


oo ee 


Year4 Year6 Year8 Year9 Year10 Year 11 





Full Sample 80.5 80.7 71:2 65.9 62.3 65.4 

Social Origins 

Socio-economic Status 
Low 79.0 76.7 58.0 52 46.5 52:1 
Middle 78.3 80.1 78.8 73.9 65.7 70.2 
High 86.3 90.0 91.2 86.8 89.2 88.5 


Academic Feedback 
Academic Performance 


Low 77.0 74.3 58.3 54.6 50.7 50.8 

High 84.2 89.2 83.8 iid Tak 79.9 
High Track Placement 

No 80.5 82.2 62.6 59.3 Se 59.4 

Yes _ 78.6 82.4 90.0 86.1 89.6 
Cumulative Retention 

Not Retained 82.8 84.4 78.9 76.0 77.3 81.5 

Retained 76.2 75.0 61.9 54.6 49.1 Ao 


_ Parental Support 
Parent Expects College 





No 12.4 12.3 49.9 48.0 38.3 38.5 
Yes 87.5 88.8 89.0 87.5 81.2 83.0 
N 790 
Results 


Determinants of Educational Expectations in the Wisconsin Tradition 


Table 2 shows the percentage college-oriented across categories of family SES, 
academic feedback and parental support. The comparisons are approximately 
cross-sectional within years because student’s expectations are mainly from the 
spring of the school year, while the other measures either overlap the entire 
academic year (Academic Performance, High Track Placements) or are from the 
fall of the year (Parent Expectation data). With that orientation, the layout of 
Table 2 is straightforward. 

When asked about their college plans late in year 11, 65.4 percent of the sample 
anticipated attending college. This is far below recent national samples—86.6 
percent of high school seniors in 2004 expected to attain at least some college 
(Ingels, Planty and Bozick 2005)—but with the BSS panel comprised mainly of 
low income and minority youth attending public schools in a high poverty urban 
area, it is hardly surprising that their expectations lag behind the national average. 

In other respects, the patterning of expectations in Table 2 mirrors a host of 
studies targeted at the upper grades. Despite the panel’s low SES skew, there is 
a sharp socio-economic divide, with 88.5 percent of high SES youth college- 
oriented in year 11 as against barely half the low SES group (who constitute ap- 
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proximately half the sample). Academic contrasts also are large, with differences of 
20-30 percentage points across the extremes. But the largest differences in Table 2 
revolve around parental support for college, with more than 80 percent of students 
in year 11 expecting to attend college when their parents also expect that vs. just 
38.5 percent in the absence of parental support. 

From Table 2 we see that differences in expectation are considerably smaller in 
years 4 and 6, revealing that college expectations become more stratified over time. 
High SES youth’s expectations hover in the vicinity of 90 percent, while those of 
lower SES youth fall, and the expectation decline is especially precipitous among 
the lowest: the college-oriented youths drop from 79 percent in year 4 to 52.1 
percent in year 11, such that an initial difference of about 7 percentage points 
across family SES levels in year 4 explodes to more than 35 points. 

Family SES level, as measured, is fixed, so its time-trend unambiguously signals 
changes in low SES children’s thinking about their educational futures, something 
that in Table 2 most often happens toward the end of middle school in anticipa- 
tion of high school.'! The academic and parental support measures, on the other 
hand, are time-referenced, and so potentially also changing. We find that parental 
and student expectations are more closely aligned in the upper grades than in the 
lower, but in these aggregated cross-sectional comparisons it cannot be said wheth- 
er it is the students whose thinking has changed or the parents. The Wisconsin 
framework accords parents precedence in influencing their children, but parents 
who were optimistic about their children’s academic prospects could become less 
so in light of mounting academic difficulties, in which case the closer alignment 
evident in the upper grades would reflect changes in their thinking rather than 
their children’s. Likewise, while it is plausible that older students would be more 
attentive than younger to their academic standing when formulating a plan for 
the future, it is equally plausible that those with little interest in college let their 
grades slip, in which case measures of academic performance would shift toward 
expectations, rather than the reverse—such “circles of causality,” it will be recalled, 
are hard to disentangle.’ 

Table 2 thus gives the timeline, but to achieve clarity regarding the underlying 
dynamic requires a different approach: are changes in plans in fact responsive to 
changes in academic feedback and in parental supports, as the Wisconsin model 
presumes? This is addressed in Table 3, which reports results from a set of fixed- 
effects logit models predicting educational expectations starting in year 4 and 
continuing through year 11 as a function of time-varying academic feedback and 
parental support. Fixed-effects models use a within-student estimator whereby the 
data are transformed into deviations from individuals’ means, yielding unbiased 
and consistent estimates. In deviating from individual means, fixed effects models 
remove potential confounds involving time invariant student-level characteristics 
(race, sex, genetic make-up, etc.) and time invariant period-level and contextual 
characteristics (economic climate, structure of the local school system, etc.). To 
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Table 3: Odds Ratios from a Fixed-Effects Analysis of Changing ~ 


Educational Expectations 
Model 1 Model 2 Model 3 





Academic Feedback 


Academic Performance Me ae 162°: 1.28 
Missing 1.07 1.08 1.20 
Not Enrolled T2 1.36 130 
High Track Placement 1.28 1.26 of 
Missing 18 14 0 
Cumulative Retention 91 97 1.28 
Missing 16 89 .90 
Parental Support 
Parent Expects College 1200t 1.66* .99 
Parent Reporting Expectation 
Mother (reference) 1.00 1.00 1.00 
Father 1.08 1.14 1.13 
Other Family Member 16 AT 19 
Missing Expectation 84 87 93 
Family Structure 
Lives with Mother and Father .99 .98 91 
_ Missing 64 69 68 
Year .80** .84** .80** 
Interactions 
SES * Academic Performance os .97 = 
SES * High Track Placement — 1.32 — 
SES * Retention _ 93 — 
SES * Parent Expects College — 1.11 — 
SES * Lives with Mother and Father = 19 —_ 
SES * Year oa 123° ace 
Year * Academic Performance = = 1.07 
Year * High Track Placement — 2s 1.20 
Year * Retention —— aa 93 
Year * Lives with Mother and Father _ ae 1.01 
Year “Parent Expects College, 
Nara es 478 478 478 


Orsn0or = p-<201 
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capture the time trend apart from other considerations, the regressions control for 
“year in school” using a continuous term specifying (year—4)." 

Results for Model 1 align with the descriptive trends evident in Table 2, 
as well as much previous research. First, net of other considerations, positive 
academic feedback in the form of good grades and high test scores encourages 
an orientation toward college. Specifically, a standard deviation rise in academic 
performance increases the odds of holding a college expectation by 65 percent. 
Like results are obtained for the measure of parental support. Net of other 
considerations, including possible changes in the parent completing the survey, 
a change in parents’ expectation from non-college to college is associated with a 
56 percent increase in their children’s odds of expecting to attend college. These 
are familiar findings, but here assessed over the period of time when youths’ 
ideas about their future are under construction and with time-invariant poten- 
tial confounds controlled. Hence, the proposition that academic feedback and 
parental support induce changes in youths’ thinking—a core tenet of the status 
attainment framework - receives strong support in these results. 

The “year” effect in Model 1 mirrors the over-time decline in expectations 
evident in Table 2, with the odds of expecting to attend college estimated to drop 
20 percent per period. As youth approach the end of high school, their prospects 
for the future become clearer, and for many in places like Baltimore that future 
entails challenges: in 1990, the city’s childhood poverty rate stood at 32 percent, 
almost a fourth of youth age 16 through 19 were unemployed, and a third of the 
city’s children lived in so-called “distressed neighborhoods,” all figures above the 
national averages for large cities (Annie E. Casey Foundation 1996). 

In light of these externalities, to see the panel’s optimism fade as they age seems 
reasonable, but the analyses in Table 3 additionally show that the fade is highly 
selective. In testing for interactions, just two emerge as significant, and the time 
trend is implicated in both: the SES * Year coefficient in Model 2 (= 1357p 
.01) and the Year * Parent Expects College coefficient in Model 3 (¢ = 1.11; p < 
.05). In the first case, it is high SES youth who hold firmer to their intentions over 
the years; in the second it is those who receive strong parental support.'* High 
resources in these areas buffer the tendency for expectations to drop with time, and 
for those with initially high expectations, that means greater stability. 

Table 3 is focused on patterns of change, and these are important, but this last 
finding makes clear that stability is important too. Almost 40 percent of the panel 
is excluded from the fixed-effects analysis because their expectations were invari- 
ant over this entire span of years. Because their thinking remains stable, changes 
in the right-side predictors can have no bearing on their thinking about college. 
Exactly who these youth are, and what their expectations look like, is taken up in 
Table 4, which reports SES background and resource levels in support of college 
across time for two categories of youth: Stable Highs (college oriented on all six 
occasions) and Stable Lows (never expected college). 
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Table 4: Social Origins, Academic Feedback and Parental Support for Stable 


Expectations, by Year 
Stable High Stable Low 


Expectations Expectations 


Social Origins 

Socio-economic Status 
Low 28.1 84.9 
Middle 29.3 15.1 
High 42.6 0 

Academic Feedback 

Academic Performance 
Year 4 475 -.650 
Year 6 465 -.552 
Year 8 479 -.580 
Year 9 454 -.269 
Year 10 405 -.244 
Year 11 438 -.468 

High Track Placement 
Year 4 .000 .000 
Year 6 569 294 
Year 8 561 252 
Year 9 408 .008 
Year 10 385 .000 
Year 11 318 .008 

Cumulative Retention 
Year 4 194 496 
Year 6 231 563 
Year 8 .266 .664 
Year 9 .268 .664 
Year 10 304 165 
Year 11 339 815 


Parental Support 
Parent Expects College 





Year 4 159 .076 

Year 6 734 084 

Year 8 832 092 

Year 9 159 008 

Year 10 860 054 

Year 11 907 059 
N 288 24 


a 
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This coding makes it easy to distinguish the resource rich from the resource poor, 
and the contrasts are stark. High SES youth, who constitute only 25 percent of the 
total sample, account for more than 40 percent of the Stable Highs, while low SES 
youth, half the panel, make up less than 30 percent. At the other extreme, almost 90 
percent of Stable Lows are low SES youth and there’s not a single high SES youngster 
among the 24, vividly illustrating the imprint of socio-economic origins. 

The resource underpinnings of college plans also evidence stark differences in 
Table 4, with Stable High youth receiving mainly affirming feedback and Stable 
Low youth receiving mainly disconfirming feedback at all points of comparison. 
Specifically, in the early years Stable High youth average just under a half standard 
deviation above the sample-wide test score/mark distribution mean, most take some 
high level courses, their retention rate is in the vicinity of 25 percent (lower than 
the panel-wide cumulative rate of 45.8% by year 8), and around three-fourths of 
their parents expect college of them. Although there are small downward changes in 
some of these measures during the high school years, the relative academic standing 
of the Stable Highs remains strong overall, including a decided up-tic in parental 
expectations, with 90.7 percent expecting college of their children in 11" grade. The 
Stable Lows, though few in number, likewise receive signals that mainly align with, 
and presumably reinforce, their ideas about the future: their academic performance 
is low, hardly any take high level courses, they experience extremely high rates of 
grade retention (more than 80% have been held back by year 11), and their levels 
of parental support for college are extremely low. 

Limiting the study of expectation formation to the latter years of high school 
misses a great deal: (1. a sizeable minority of youth in the BSS panel hold sta- 
ble college expectations back to fourth grade; and (2. signals that reflect on 
academic competence and prospects are strongly aligned across years. These 
observed patterns of stability (Table 4) and change (Table 3) accord with life 
course developmental ideas, and their socio-economic overlay accords with an 
educational stratification perspective, but one that plays out over the broad sweep 
of children’s schooling. 

Another feature of the socialization context that bears on the development of 
educational goals is the consistency of resources and socialization influences across 
domains of life experience. Table 5 reports, by family SES level, the average 
number of college-affirming signals year by year and then cumulatively across 
years. In the cross-section, scores can range from zero (no affirming signals) to 
four (all affirming signals);'> across years the range is zero to 24. This examina- 
tion of signals is loaded in favor of academics to the neglect of other influences 
(peers, teachers, labor market opportunities, etc.), so the view here is both 
partial and skewed. ¢ nevertheless think it revealing that Table 5 shows large 
differences across socio-economic levels in the cumulative weight of socialization 
inputs, favoring an orientation toward college among the advantaged and an 
orientation away from college among the disadvantaged. 
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Table 5: Average Number of Supportive Signals by Year and Socio-economic Status 


Socio-economic Status 
Low Middle High 
252 


Year 4 1.18 1.93 

Year 6 1.39 Z20 3.07 
Year 8 1:37 2.28 ait 
Year 9 1.11 1.96 2.67 
Year 10 1.09 1.92 2.18 
Year 11 1.05 2.02 2.82 
All Years 7.20 12.36 17.02 


Low SES youth each year receive in the vicinity of just one signal that they 
are college material and high SES youth nearly three; the middle SES figures fall 
squarely between. So year after year, environmental cues reinforce young people's 
given place in the social hierarchy, and cumulatively the differences seem quite large: 

. 7.2 affirming signals at the low end over 11 years vs. 17 affirming signals at the high. 
Assuming this is a reasonable picture, then extrapolating across the great wealth - 
of relevant externalities young people experience growing up helps explain why 
educational expectations elicited toward the end of high school mirror status origins. 

However, contrary to our hypothesis, it is middle SES youth, not low SES 
youth, who receive the most mixed signals—in every year about two affirming 
signals and two disconfirming signals. The consistency of cumulative feedback 
among low SES youth rivals that of high SES youth, but at the negative ex- 
treme—three negative signals and just one positive one per occasion, on average. 
This would seem a weak developmental foundation for expecting to attend college, 
but even so toward the end of high school many middle and low SES youngsters 
do so: 70.2 percent and 52.1 percent, respectively, in year 11 (see Table 2). A 

“college for all” ethos encourages even poorly prepared youth to aspire to college 
(Goyette 2008; Rosenbaum 2001). The final part of our analysis asks whether a 
history of limited resources and inconsistent/disconfirming feedback makes it less 
likely that these ambitious youth will make it to college. 


College Destinations: Attendance in Relation to Expectation History 
Table 6 shows the highest level of schooling undertaken by the BSS panel five years 


after their on-time high school graduation in December 1999 in relation to their 
expectation histories. A more typical study would measure expectations just once 
or twice in the upper grades. Our expectation constructions in Table 5 include 
that information, but embed it in the youth’s entire expectation history back to 
4" grade. Table 6 includes the Stable Highs and Stable Lows and four additional 


categories: those whose year 4 plans change but eventually settle where they started 
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Table 6: Education Expectation Histories by Highest Postsecondary Enrollment as 
of December 1999 


Expected to Attend Did Not Expect to Attend 
___CollegeinYear11 —» ~__—CollegeinYear11_— 
Stable Volatile Stable Volatile 
High High Riser Low Low _ Faller Total 
No Enrollment 19.5 49.9 34.8 89.3 71.8 78.1 47.0 
Enroll in 2-Year School 31.4 37.0 47.6 10.7 26.2 20.2 30.3 
Enroll in 4-Year School 49.1 13:4 Wan 0 2.0 iam Zonk 
N 288 151 78 24 50 199 790 


(Volatile High and Volatile Low), and those whose year 4 plans change and settle 
opposite the respondent's initial thinking (Risers and Fallers). If intentions near 
the end of high school were all that mattered, we would expect to see similar rates 
of postsecondary attendance among the three groups that are college oriented in 
year 11—the Stable Highs, Volatile Highs and Risers—and the three groups that 
are non-college oriented in year 11—the Stable Lows, Volatile Lows and Fallers. 
The actual figures are quite different. 

The Stable Highs have the highest rate of college attendance by far—80.5 percent 
with almost half attending four-year schools (49.1%). These are youth whose intent 
to attend college has been unwavering. They are mostly high SES youth and have 
received encouragement from parents all along. Their expectations are well anchored 
in confirming academic experiences and appear to have the biggest pay-off. 

The Volatile Highs and Risers also said in year 11 they intended to attend 
college, but the Volatiles wavered over the years, while the Risers at some point 
warmed up to the idea of college. At 50.1 percent and 65.2 percent college at- 
tendance, respectively, both groups’ attendance lags far behind the Stable Highs. 
Further, when Volatile High and Risers do attend college, it is more likely a two- 
year school. Recall that middle- and low-SES youth have the most changeable 
expectations, and compared to high SES youth they receive less encouraging 
feedback from school and less consistent parental support. Table 5 is a simple 
descriptive accounting, but these attendance patterns accord with the idea that 
unsettled expectations, a history of conflicting signals, and limited socio-economic 
resources don’t just compromise postsecondary prospects, but also channel youth 
into stratified postsecondary destinations. 

Turning now to the other half of Table 6, it might seem odd to examine the 
postsecondary destinations of youth who disavow an interest in college, but 
in fact a “snapshot” of expectations is far from binding.'® Of the three groups 
who did not expect college attendance in year 11, the Volatile Lows most often 
attend (28.2%). This seems sensible considering that at some point they had 
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the Stable Lows are lea8t likely to attend 


been focused on college. Also sensible, 
(10.7%). It appears that the degree of commitment 


matters here too, even for 


hese youth do attend col- 


those who are largely not college oriented, yet when t 


Iso are heterogeneous in 


hat expectations are heterogeneous 


We saw previously comparing across groups t 


in their construction; here we see indications that they a 
their consequences. Youth with a firm commitment to postsecondary attainment 


have the highest odds of enrolling in schools focused on baccalaureate preparation, 


lege, it is almost always a two-year school. 
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while those with a less secure vision of the future are less likely to attend altogether 
and when they do attend it is more likely at a two-year institution. All of this is 
missed when expectations are measured just once toward the end of high school. 

Table 7 completes these analyses, asking whether the developmental histories 
captured in these constructions are more discriminating with respect to college 
attendance than are expectations measured cross-sectionally. Two multinomial lo- 
gistic regression analyses are reported, the first using educational expectations mea- 
sured in year 11 to predict postsecondary attendance (the traditional approach) 
and the second using the expectation 
history constructions (the developmen- 
tal approach). The reference category is 
“no college attendance.” 

According to Model 1 in Table 7, edu- 
cational expectations from high school 
are highly predictive of later college at- 
tendance. Youth who are college oriented 
in year 11 have almost three times the 
odds of attending a two-year college 
and nearly 14 times the odds of attend- 
ing four-year college compared to youth 
who in year 11 did not intend to go to 
college. So, expectations matter, but are 
these results reason to believe that some 
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expectations matter more than others? 
Though Stable Highs and Volatile 
Highs (the reference group) both say 
in year 11 they expect to attend college, 
their postsecondary prospects and desti- 
nations are different. Model 2 in Table 7 
shows that Stable Highs are more than 
twice as likely as Volatile Highs to at- 
tend a four-year school (as opposed to 
not attending college). So youth with 
longstanding plans to attend college 
have a decided advantage, and not just 
in terms of college attendance, but also 
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in terms of four-year college attendance. 
The Volatile High vs. Rising difference, 
on the other hand, is not significant. 
Like Volatile Highs, Volatile Lows 
also shift their thinking over time, but 
in the opposite direction. Although stu- 





p< 01 


Likelihood 


N 
Lo 


Parent expects college 
*<aD5 


High track placement 
Missin 


Cumulative retention 
Missing 


Missing 
Parental Support 


2046 © Social Forces 88(5) 


dents in the two groups report different plans in year 11, their ‘odds of enrolling 
in either a two- or four-year school are not significantly different net of other 
characteristics.'” Thus while college expectations in year 11 discriminate college 
attendance in the aggregate, the history within which these expectations are 
embedded has additional implications for college destinations. These trajectory 
distinctions, and students’ personal histories add useful perspective to what can 
be gleaned about college attendance prospects from circumstances extant toward 
the end of high school. 

As planful actors, young people are expected to be forward looking, and around 
times of impending life course transitions to plot a course action (Shanahan 2000). 
But the evidence here suggests that not all plans are equally well-grounded or 
well-informed, and for those reasons not equally effective. Those differences aren't 
obvious when expectations for college are detached from their developmental 
foundations, but they are real and consequential. Young people's ideas about col- 
lege and what awaits them are not altogether ephemeral or capricious (Morgan 
1998), but neither are they all equally grounded in relevant externalities. That 
being the case, to use them as a guide to where young people are heading, it helps 

" to have a sense of where they have been. 


Conclusion 


Our analysis, based on the experiences of a cohort of urban youth spanning 
the majority of their schooling careers, documents the processes posited in the 
Wisconsin framework, but does so dynamically. It also illuminates differences 
in the process of expectation formation across social lines not evident in 
earlier studies focused on the latter years of high school. These involve: (1. 
differences in the patterning of expectations over time, and (2. heterogeneity 
over time and across social lines in the socialization signals youth receive 
about their educational prospects. 

The original Wisconsin studies measured expectations toward the end of 
high school, tacitly, if unintentionally, reifying the idea that the high school 
years are when the socialization processes at the heart of their framework 
operate most forcefully. But the present analysis observes like processes as 
early as 4" grade. More than 60 percent of the BSS sample changed their ideas 
between 4th grade and 11th grade, with most turning away from college. The 
largest decline in expectations occurred in the late middle school years, on 
the eve of high school. By the time students reach the 11" grade, when previ- 
ous studies start evaluating the determinants of expectations, postsecondary 
orientations for many already were set. Youth respond to relevant signals at 
home and at school and adjust their expectations accordingly. Missing from 
the original Wisconsin path model, then, is an “arrow” denoting time. . 

The long view taken in this research also reveals a completely different dy- 
namic, one in which ongoing academic and interpersonal experiences do not 
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modulate expectations. Almost 40 percent of the BSS panel held expectations 
that were invariant from 4" through 11" grade. Seeing expectations stabilize 
for many youth as early as the 4" grade suggests a dynamic at play other than 
self-reflection of the sort stressed in the original Wisconsin theorizing (Haller 
and Portes 1973) or the tacit cost-benefit accounting advanced in more re- 
cent interpretations (Morgan 2005). At age 10 it is dubious many youth are 
attuned to the importance of educational credentialing at the postsecondary 
level and the occupational pay-off to different skill sets. 

Rather, one’s social-structural location, according to Bourdieu (1973,1977), 
leaves a deep imprint on the psyche, determining habits, tastes and preferences, 
and extending to one’s understood place in the social order. Experiences at home, 
at school and in the wider world, including those embedded in one’s academic 
performance, parent's wishes and the daily life conditions that we crudely measure 
as family SES, are structurally constrained and often well aligned. Children whose 
parents hold well-paying professional jobs, who have consistently done well 
in school, and whose social support networks give consistent encouragement 
have no need to adjust their ideas about college as they progress through 
school because their early thinking is reinforced all along the way. The same 
holds for many of those growing up in conditions of disadvantage, except their 
experiences point away from college. 

As a model of social reproduction, the Wisconsin framework understates 
the forcefulness by which family social-structural dynamics express themselves 
over the life course. Our study was able to document aspects of how this 
happens through the consistency of socialization signals received over time, 
but no doubt even deeper, more enduring forces are at work. For example, 
Lareau’s (2002, 2003) qualitative research on a sample of 10 year olds reveals 
how the organization of family and leisure activities, the use of language and 
parental interactions with social institutions cultivate a sense of entitlement 
among middle-class youth that helps them navigate the educational system 
from elementary school to college. 

However, it is equally important that the imprint of social origins seems 
weaker for some than for others. Many youth, particularly those with limited 
resources, receive mixed signals about their educational prospects. Buchmann 
(1989) argues that heterogeneous models of life course patterns provide less 
clearly defined paths, leading to more indeterminate expectations, a situation 
evident in the expectation formation process of middle-SES youth. These are 
youth whose parents on average hold terminal high school degrees, and on a 
larger national scale many probably would be considered lower SES. Sometimes 
these youth do well in school; sometimes they stumble. Over time the signals 
they receive neither consistently support nor temper an expectation to attend col- 
lege, and when they report late in high school that they expect to attend college, 
the extent of their commitment and their ability to follow through are less certain. 
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This heterogeneity, hidden in past studies, suggests that not all expecta- 
tions are created equal; some are cemented at an early age, some are volatile 
over time. Owing to such developmental differences, they vary in their util- 
ity as guides or roadmaps during the transition out of high school. Sturdier 
long-term expectations are more efficacious than volatile ones in the sense of 
forecasting college attendance, but more than that, they also are associated 
with stratified postsecondary destinations. 

The realization that not all college expectations emerge in the same way 
or have the same utility raises a host of interesting questions about the social 
and developmental contexts of their formulation, questions integral to under- 
standing the role of expectations in educational inequality: why do educational 
orientations crystallize at an early age for some; how do youth make sense of the 
range of (potentially conflicting) signals they receive; in formulating postsecond- 
ary plans, how is recent information balanced against the weight of cumulative 
experience; and when high expectations persist against the grain of externalities, 
including many not examined in this study (e.g., learning disabilities, limited 
labor force opportunities, discrimination), are the resulting expectations less use- 

* ful as attainment resources? The boundaries of the traditional status attainment 
framework afford little space for such questions; a life course developmental fram- 
ing makes them hard to avoid. 


Notes 


1. Representative publications include: Haller and Portes 1973; Otto and Haller 1979; 
Sewell, Haller and Ohlendorf 1970; Sewell, Haller and Portes 1969; Sewell and Hauser 
1975. We refer to this genre of research, interchangeably, as “status attainment models” 
and research in the “Wisconsin” tradition. The citations are not repeated each time. 


2. Following Morgan (1998:134) we use the terms expectation, intention, plan and future 
orientation interchangeably as referring to respondent reported educational plans as 
measuring “... intent... based largely on reasonable appraisals of future events.” 


3. Whites, who made up 24% of the city’s public school enrollment in 1985, were 
oversampled to enable comparisons by race. 


4. All members of the BSS panel were in first grade in fall 1982, “year 1” in the BSS 
accounting scheme. After 1982, owing to grade retention and other setbacks, the 


° “ . . . 
alignment between “year” and “grade” is only approximate. For ease of communica- 
tion, we use the two terms interchangeably. 


5. No student surveys were conducted in year 5, and in year 7 only students making 
the transition to middle school a year late were interviewed. Expectations also were 
elicited in the spring of years 12 and, for late graduates, 13. Although a large literature 
uses senior year expectation to forecast college attendance, here we stop at year 11. 
The motivational import of expectations measured late in the senior year is suspect 


owing to contamination by information on college acceptances and other commit- 
ments for the following year. : 


6. In years 6, 8 and 9, the percent of expectations taken from the fall interview are 2%, 
13% and 1%, respectively. 
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7. Inyear 9, the “some college” option was used in the spring survey, but not the previ- 
ous fall. Almost 70% of those who responded “some college” in the spring replied 
college or greater the previous fall when “some college” was not available. This agrees 
with our judgment that “some college” should be counted as college oriented.: 


8. Parent-child agreement with school records was 75% on the timing of a first retention, 
100% for second retentions. In years 6, 9, 10 and 11, these sources were used for 1%, 
.1% 3.5 % and 3.4% of the cases, respectively. 


9. Postsecondary vocational and proprietary licensure school attendance are coded “none.” 


10. To further enhance accuracy, the equations used to predict academic achievement in- 
cludes all variables used in the present analysis as well as student self-reported grades 
from years 11 and 12. 


11. Recall that “some college” was not included as a response option until year 9, so the 
“optimism” evident in the early years is at an even higher threshold than later. 


12. The ability to model bidirectional and synergistic processes in the areas under con- 
sideration is severely limited with survey data. 


13. With this recentering, year = 0 corresponds to “4,” the first year expectations were 
measured. 


14. We interacted family SES, race, sex (not reported) and year in school with all other 
main effect predictors, but just those reported in Table 3 achieved significance. 


15. A score of four indicates being at or above the median in the grade and test score 
distribution, some advanced course enrollment, no retention to that point and a 
parental expectation to attend college. 


16. Using the HS&B data, Rosenbaum (2001:72) reports that almost 12% of youth 
attain higher levels of schooling than expected when they were high school seniors. 


17. When using Volatile Lows as the reference group, their plans are not significantly 
different from the Fallers or Stable Lows, suggesting that heterogeneity in the process 
leading to an expectation not to attend college is less consequential than heterogene- 
ity in the process leading to an expectation to attend college. 
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Appendix A. Sample Means and Proportion Imputed for Academie Feedback, 


Parental Support and Family Structure 
Mean Without Mean With 
Missing Values Missing Values Proportion 


imputed Imputed imputed 

Academic Feedback 

Academic Performance 
Year 4 0 0 02 
Year 6 0 0 20 
Year 8 0 0 ves 
Year 9 0 0 Bb 
Year 10 0 0 aD 
Year 11 0 0 0 

High Track Placement 
Year 4 0 0 .00 
Year 6 41.0 40.0 .00 
Year 8 44.3 43.4 01 
Year 9 19.9 21 rae 
Year 10 220 20.2 35 
Year 11 17.4 19.0 Hoo 

Cumulative Retention 

- Year 4 34.8 34.7 .08 

Year 6 37.4 39.6 12 
Year 8 42.8 45.8 16 
Year 9 44.9 47.5 14 
Year 10 48.0 53.5 19 
Year 11 50.9 51.3 | 

Parental Support 

Parent Expects College 
Year 4 54.0 54.0 01 
Year 6 46.2 50.0 43 
Year 8 557. 551 29 
Year 9 44.0 46.4 42 
Year 10 56.2 53.7 22 
Year 11 63.3 59.7 0 


Family Structure 
Lives With Mother and Father 


Year 4 44.6 43.6 44 
Year 6 41.7 43.8 5 
Year 8 40.3 40.9 29 
Year 9 41.1 40.2 22 
Year 10 38.2 Ole .20 


Year 11 34.7 33.2 22 


Do Changes in Job Mobility Explain the Growth of Wage 
Inequality among Men in the United States, 1977-2005? 


Ted Mouw, University of North Carolina at Chapel Hill 
Arne L. Kalleberg, University of North Carolina at Chapel Hill 


To what extent did the increase in wage inequality among men in the United States 
over the past three decades result from job loss and/or employment instability? We 
propose a simple method for decomposing the change in wage inequality into compo- 
nents due to upward and downward between-employer mobility and within-employer 
wage changes using data on men’s wages and job mobility from the 1977-2005 waves 
of the Panel Study of Income Dynamics. We find that downward employer mobil- 
ity—a proxy for job displacement based on movement to a lower paid job with a new 
employer—has the largest effect on inequality over a two-year period. However, the 
effect of job displacement declines with time. We find that the effect of job loss ac- 
counts for 39 percent of the increase in wage inequality during the average eight-year 
period from 1977 through 2005, compared to 52 percent that is attributable to wage 
changes for workers who stay with the same employer. 





To what extent did the increase in wage inequality among men in the United 
States over the past three decades result from changes in job mobility? This ques- 
tion joins two central topics in recent research on labor markets. First, the well 
documented increase in men’s earnings inequality in the United States since the 
1970s has been recognized as one of the most dramatic features of the American 
stratification system: inequality in men’s wages increased sharply in the 1980s, 
leveled off in the early 1990s, and grew modestly since (Morris and Western 
1999; Neckerman and Torche 2007; Gottshalk and Danziger 2005; Autor, Katz 
and Kearney 2006). Second, a shift from a relatively stable employment model 
toward a “new deal” in which workers’ attachment to employers is more tenuous 
has created widespread insecurity within the labor force (Cappelli 1999; Osterman 
1999; Uchitelle 2006; Kalleberg 2009). 

‘The idea that changes in employment relations may have contributed to the rise 
in wage inequality is prominent in the literature. Osterman (1999), for example, 
argues that both job mobility and wage inequality have increased in recent years, 
creating a “good news/bad news” scenario in the U.S. labor market: those persons 
with marketable skills do well, while workers whose skills are in less demand sufter 
adverse consequences when they lose their jobs. Krugman (2006) illustrates these 
transformations by comparing the two largest civilian employers in the past three 
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decades: General Motors in the 1970s, which provided stable employment, ben- 
efits and middle-class wages for its production employees; and Wal-Mart, which 
currently employs 1.2 million workers and pays low wages, provides few benefits, 
and is characterized by high turnover rates and little reward for employer loyalty. 

The hypothesis that the divergence in economic payoffs to job mobility has led 
to greater wage inequality has been advanced perhaps most forcefully by Bernhardt 
et al. (1999, 2001:16), who maintain that the relationship between job mobility 
and inequality is “... where the link between labor market structure and indi- 
vidual life history is made, where we gain insight into the dynamic processes that 
actually generate inequality...” In particular, they argue that changes in corporate 
restructuring and employment relations have diminished opportunities for up- 
ward mobility within firms and created greater wage inequality among workers 
who must compete for skilled jobs in the external labor market. 

Despite the widespread interest in how changes in job mobility may have 
generated greater economic inequality, systematic quantitative evidence on how 
these are related is conspicuously scarce. While the increase in wage inequality over 
the past 30 years has generated considerable debate regarding its causes (Autor, 

* Katz and Kearney 2006; Card and Dinardo 2002; Neckerman and Torche 2007) 
there is no definitive evidence as to the impact of job mobility and instability. Yet- 
the issue of how the economic pay-offs to job mobility may have changed is of 
considerable importance for public policy: this question connects human capital 
formation to the structures within which people move and where they obtain 
economic returns to their work experiences and skills. 

The few studies that have sought to relate changes in mobility and wage in- 
equality are limited. Several authors have shown that the effect of employer mo- 
bility on the variance of changes in workers’ wages has increased (Bernhardt et. 
al. 2001; Stevens 2001; Polsky 1999), but this does not necessarily demonstrate 
that wage inequality has increased as the result of changing employers. Moreover, 
Bernhardt et al.’s (2001) methodological framework—which relies on a “variance 
component” approach that divides the increase in wage inequality into permanent 
and transitory inequality parts—does not enable them to assess whether changes in 
instability were responsible for the observed growth in wage inequality over time. 

In this article, we seek to evaluate the extent to which the rise of wage inequality 
among men in the United States during the past 30 years is due to changes in job 
mobility. In contrast to variance component models (Haider 2001; Stevens 2001; 
Bernhardt et al. 2001), our model provides a simple, non-parametric decomposi- 
tion of inequality into components due to within-employer wage changes and 
upward or downward between-employer mobility. 


Job Mobility and Wage Inequality: Previous Research 


The internal labor market is the key concept that links job mobility to career 
trajectories and changes in wage inequality. We focus here on firm internal labor 


The Impact of Job Mobility on Wage Inequality © 2055 


markets, although internal labor markets are also found within some occupations 
(Althauser and Kalleberg 1981). FILMs represent job ladders that are associated 
with the progressive development of skill and knowledge. Workers enter the firm 
at the bottom of job ladders, and then receive progressively higher earnings as they 
move up the rungs of these ladders. 

FILMs have generally been regarded as a key feature of labor markets in the 
United States during much of the post-war period (Hall 1982). Beginning in the 
mid-1970s, corporate restructuring and other institutional and technological 
changes led to pressures on employers to reorganize the employment relationship 
by making it more market-mediated. Employers were less likely to be able to pro- 
vide their employees with opportunities for long-term employment and upward 
mobility (Cappelli 1999). They thus encouraged their employees to invest in 
their own training, so as to make them more “employable” elsewhere in addition 
to being more valuable to the current employer. This strategy is consistent with 
a model of risk shifting (in this case, investments in training) from employers to 
employees and a decline in the use of FILMs. 

This scenario suggests that employers increasingly substituted market solutions 
for administrative rules as the employment relationship became more marketized. 
Accordingly, a number of writers have speculated that workers’ attachments to 
their employers have become more tenuous. Quantitative evidence as to the extent 
of this discontinuity is mixed. In part, this is due to researchers operational- 
izing job instability in various ways, such as in terms of employer tenure, job 
displacement rates, and job mobility. There is also no consensus on the extent to 
which there have been changes in employer mobility (i.e., movement from one 
employer to another), the indicator of job instability on which we focus. Using 
data from the Panel Study of Income Dynamics, Boisjoly, Duncan and Smeeding 
(1998), for example, found that the rate of changing employers increased since 
the 1980s (see also Rose 1995; Marcotte 1995). On the other hand, Gottschalk 
and Moffitt (1999), Jaeger and Stevens (1999), and Polsky (1999) —also using 
PSID data—found little evidence of change in employer mobility.! Gottschalk and 
Moffhitt’s (1999) analysis was perhaps the most comprehensive, as they examined 
changes in short-term employer mobility using data from the Survey of Income 
and Program Participation as well as the PSID. 

Nonetheless, the key question for this research is not whether job mobility has 
increased, but whether job mobility and job displacement have increased inequal- 
ity. To the extent that firm internal labor markets provided long-term employment 
that rewarded firm loyalty with gradual pay increases and dampened inequality 
among workers in the same firm, then a decline in FILMs could increase inequal- 
ity as firms rely increasingly on external labor markets to fill positions without 
regard for firm tenure, paying a premium for highly-skilled workers but investing 
little in worker training (DiPrete, Goux and Maurin 2001). In a post-FILM 
environment, then, more wage growth and inequality should result from across- 


2056 © Social Forces 88(5) 


employer movement, regardless of whether such employer mobility actually in- 
creased. Workers are likely to find themselves at the mercy of supply and demand 
in external markets. For highly skilled workers, this means a better opportunity to 
receive higher wages. For low-skilled workers, who lose the protections previously 
provided by their seniority and firm-specific knowledge, the opposite. 

Anumber of recent studies have examined the links between employer mobility and 
earnings growth or decline using longitudinal data such as the National Longitudinal 
Survey of Youth (e.g., Bernhardt et al. 1999, 2001; Perticara 2002; Light 2005). 

Using longitudinal data from the PSID and the SIPP, Gottschalk and Mofhtt 
(1999) found no evidence of a time trend in the effects of employer mobility on 
wages from the early 1980s through the mid-1990s. Bernhardt et al. (1999, 2001) 
examined two cohorts of young white men (National Study of Young Men 1966- 
81) and (NLSY 1979-94) and, in contrast to Gottschalk and Moffitt (1999), they 
found that the pay-offs to employer mobility had declined. 

Similarly, DiPrete et al. (2002) used data from the job tenure supplements of 
the Current Population Survey to estimate changes in the effects of job tenure 
and general experience on wages. They found evidence of a decline in the returns 

’ to job tenure over time and an increase in the returns to general labor market 
experience among college-educated workers (but not among workers without a 
college degree). These results are consistent with the argument that a decline in 
internal labor markets has affected earnings trajectories. 


Effects of Job Mobility on Wage Inequality 


To test whether the changes in job mobility have increased wage inequality, we 
need to demonstrate evidence of a differential impact on wages. In other words, 
job mobility increases inequality if it increases the wages of some workers and 
decreases the wages of others. The question is: how big of an impact does it have? 
Using PSID data from 1976 to 1991, Polsky (1999) found evidence that em- 
ployer mobility may help generate greater wage inequality. Among workers who 
switch employers voluntarily, the variance of their change in wages between em- 
ployers increased from 34.6 percent in 1976-1981 to 108.4 percent in 1986-1991. 
Evidence of an increase in the one-year variance of change in wages was also noted 
by Bernhardt et al. (2001) and Stevens (2001). An increase in the variance of wage 
change is the kind of evidence that relates directly to inequality as it indicates a 
differential impact of job mobility on wage growth. Nevertheless, the effect of 
wage changes on overall inequality depends on the worker's position in the wage 
distribution; hence an increase in the variance of wage changes does not necessarily 
demonstrate that inequality has changed as the result of job mobility. 


Variance Component Models 


The recent literature on job instability and inequality uses “variance component” 
models (Bernhardt et al. 2001; Stevens 2001; Leonardi 2003; Amaral 2003) to 
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divide inequality into permanent and transitory components. Using this method, 
Stevens (2001) and Leonardi (2003) found that earnings instability for job- 
changers has increased over time. This conclusion would be consistent with the 
argument that a decline in FILMs has altered the impact of job mobility on wages 
and contributed to an increase in inequality. At the same time, however, this con- 
clusion is misleading: the variance component method is incapable of providing 
an overall decomposition of inequality into components due to between-employer 
mobility and within-employer wage growth. 

In a simple model of permanent and transitory inequality, Gottschalk and 
Moffitt (1994) use PSID data to decompose inequality between the 1970-1978 
and 1979-1987 periods. They define “permanent income” as the average income 
for each individual in each nine-year period and “transitory income” as fluctua- 
tions around each individual’s mean income for the period. They found that a 
third to half of the increase in the variance of earnings of white men from the 
1970s to 1980s reflected short-term (transitory) increases (earnings instability) 
rather than long-term (permanent) increases in the variance of average earnings. 
However, while this approach has the advantage of being methodologically trans- 
parent and easy to calculate, it doesn’t take into account life-cycle earnings patterns 
that should be incorporated into the concept of permanent income. 

Bernhardt et al. (2001), Stevens (2001), Leonardi (2003) and Amaral (2003) 
estimate more complicated models based on Gottschalk and Moffitt (1994) and 
Haider (2001). Although the specification used in each article is slightly differ- 
ent, the basic strategy is the same. First, a baseline model is estimated in order to 
calculate the wage residuals for each worker, 


In go: a,+BX,+¢,, (1) 


where In w, is log wages or log earnings for worker i at time t, a, is period specific 
intercept, X, is a set of demographic variables (including a quadratic in experience), 
and £ is allowed to vary over time. The wage residual, ¢,, is then divided into two 
components: 


ey =p, a Up (2) 


where v, is an error term representing transitory fluctuations in wages, 1,is a fixed 
individual effect, and p_ is a time-specific multiplier of v,. The combination of u, 
and p, represents permanent income because they are based on fixed deviations 
of the individual’s wages from the earnings trajectory defined by a quadratic in 
experience in Equation 1. In contrast, transitory income represents deviations 
from each individual’s wage trajectory. 

Stevens (2001) and Leonardi (2003) use this model to estimate the effect of 
employer mobility on transitory inequality in the PSID by dividing their sample 
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into employer changers and stayers in order to calculate the transitory component 
of income separately for each group. Stevens (2001) looks only at displaced work- 
ers and finds that wage instability is substantially higher among displaced job 
changers than workers who stayed in the same job. Given the extensive literature 
documenting average wage losses to displaced workers (e.g., Kletzer 2003), it is 
not surprising that transitory wage variance would be higher among displaced 
workers compared to those who kept their jobs. 

Overall, however, the problem with the variance component models is that they 
are incapable of incorporating the effect of job mobility on permanent income 
because they parameterize “permanent income” as a fixed individual effect (the w, 
term in Equation 2 above). Any deviation from the wage trajectory defined by 
u,and the demographic controls (X,) is considered transitory. In reality, however, 
this is a simplification: job mobility may affect permanent income, if it means 
getting a job at a relatively high (or low) wage organization that bumps the worker 
onto a different trajectory of wage growth. By splitting their sample into separate 
groups of job-changers and stayers, Stevens (200 1) and Leonardi (2003) force any 
effect of job mobility on inequality to be expressed as a transitory wage change 

‘over a specific period of time. This will underestimate the long-term impact of job 
mobility on the worker's permanent wage trajectory.” ; 

Bernhardt et al. (2001) adopt a different approach. In a variance component 
analysis using two cohorts of NLS data, they use the number of employers the 
worker has had to model the individual fixed-effect u, (see Bernhardt et al. 2001 
for a discussion). The problem with this approach is that it models differences in 
permanent income prior to an employer change as a function of the cumulative 
number of employer changes the worker experiences in the data. This parameter- 
izes employer change as if it were a stable characteristic (i.e., the “propensity” to 
change jobs), while the real effect of employer mobility on inequality is that it 
results in wage changes at a specific point in time. 

In their research on earnings volatility, Shin and Solon (2008) argue that the 
drawback of these variance component models is that they rely on complex mod- 
els of earnings growth and that the results are sensitive to untested assumptions 
about model specification. In the case of job instability and inequality this prob- 
lem is compounded by the assumption that job loss results in purely transitory 
departures from workers’ permanent earnings trajectories. In contrast, Shin and 
Solon adopt a simpler approach to studying earnings volatility by calculating the 
variance in the change in wages over successive two-year periods in the PSID. 

We propose a model of the effect of job instability on wage inequality that is 
similar in spirit to Shin and Solon’s approach by calculating changes in inequality 
based on a simple variance decomposition approach. This model avoids the prob- 
lems associated with the variance components approach and allows a direct test 
of the hypothesis that a change in the outcomes of job mobility due to a decline 
in internal labor markets has increased inequality. 
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PSID Data 


Assessing the implications of economic restructuring for career outcomes requires 
longitudinal data on individuals; such data permit us to examine directly wage 
changes over the course of a person's career, and to link these changes to various 
kinds of job mobility. Our analyses use the PSID, which contains information on 
the career experiences of a national sample of individuals. 

‘The PSID also allows us to follow individuals through unemployment spells 
(which can't be done using cross-sectional data sets such as the CPS). On the other 
hand, employers are not uniquely identified in the PSID (unlike the NLSY, for 
example), so a change of employers must be inferred using questions about length 
of employer tenure. The advantage of the PSID over the NLSY is that we capture 
a representative cross-section of the U.S. labor force, while the NLSY follows a 
single cohort. We restrict our sample of data from the PSID to 1977-2005 because 
tenure with current employer is recorded using interval codes prior to 1976 (see 
Brown and Light 1992). After 1997 the PSID began to collect biannual, rather 
than annual, data. In order to make our analysis of the effect of job instability 
on inequality consistent with the pre- and post-1997 data, we restrict our final 
analysis to the odd years of PSID data beginning in 1977. 

In 1979 employer tenure was not asked, so we use a variable on the length of 
time in the current position. Because “current position” may include job changes 
with the same employer, we check the data by using the employer tenure data 
from 1981 and 1982. Workers who report a position change in 1979 or 1980 
but report continuous employment with the same employer during that period 
on tenure data in 1981 or 1982 are recoded, but some within-employer job 
changes are still included.’ 

In order to be comparable to most similar analyses, we restrict our analysis 
to male heads of household in the core sample. A parallel analysis could also be 
conducted on trends in inequality and job mobility for women. When weighted 
(as in our analysis), responses of males in the PSID are representative of male 
household heads since 1968. We exclude cases with wage outliers (hourly wages 
greater than $400 or less than $5 in 2005 dollars), missing tenure data, or with 
less than 200 hours worked during the calendar year. We calculate wages as labor 
earnings during the calendar year divided by total hours of work. 

Consistent with much of the existing literature on transitory inequality in the 
PSID, we first estimate a preliminary regression of log wages on years of potential 
work experience (including squared and cubed terms), and dummy variables for 
year.’ We then calculate the residuals for each case, and proceed to work with these 
rather than directly with log wages. This transforms each worker's wage series into 
a set of deviations from the average life-course trajectory of wages. This permits a 
clearer interpretation of the effects of various job transitions because the positive 
or negative effect of a particular job transition is defined as wage growth relative 
to the average wage growth at a particular age. For example, even if a worker 
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Table 1: Trends in Inequality and Job Mobility 





Inequalit Job Mobilit 
Variance of Number of Same 

Year Log Wages Cases Employer Down’ Up! 
1975 .2068 1,411 = — — 
1977 2303 1,411 63.4 17.9 18.7 
1979 2164 1,453 63.2 17.1 19.7 
1981 .2098 1,356 77.0 10.6 12.4 
1983 2144 1,271 78.2 ioe 9.6 
1985 2239 1,236 Ths 9.7 13.0 
1987 .2631 1,236 76.8 10.9 12.4 
1989 2761 1,229 Soul 11.3 13.6 
1991 .2697 1,191 75.6 tis Tet 
1993 3030 1,142 79.8 10.8 9.4 
1995 2879 1,234 76.8 12.6 10.7 
1997 3103 1,238 79.8 8.3 11.9 
1999 3184 1,292 15,7 10.9 Ree 
2001 3464 1,299 714.4 11.6 14.0 
2003 3432 1,259 £55 11.9 10.7 

be 2000 3393 1,362 78.3 10.4 11.4 
Total 75.0 12.0 5 ee 


Ot 


Notes: “Downward job mobility” refers to a new employer with lower wages than the 
worker's previous job, “upward job mobility” is a new employer with higher wages than 
the previous job. 

2In 1977 and 1979 the tenure question refers to the job not the employer. 


who loses his job gets back to his original wage level, the impact of the job loss 
is negative if he falls behind relative to the wage growth of other workers of the 
same age. A change in the shape of this average trajectory over time (i.e., if older 
workers earn more now than they did in the past) will be expressed in terms of a 
larger wage residual for these workers. 

Table 1 provides basic descriptive information on trends in inequality and job 
mobility in the PSID from 1975-2005. There is a substantial increase in wage 
inequality over this time period, as the variance increases from .207 in 1975 to 
339 in 2005. Columns 4-6 of Table 1 show time trends in employer mobility. 
Based on the organization of our data into two-year periods, we classify workers as 
having moved to a new employer between waves if they have been with the current 
employer less than two years. We then divide employer mobility into “upward” 
and “downward” mobility based on whether a worker's current job pays more or 
less than his previous job. Downward job mobility corresponds closely to invol- 
untary job mobility or job loss because a worker is unlikely to move voluntarily to 
a job with lower wages. Table 1 indicates that there has been no significant trend 
in the level of upward or downward job mobility over time. 
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A Variance Decomposition Model 


In contrast to the parametric models of changes in permanent and transitory 
inequality, the method we use is a direct calculation of the sequential change in 
the variance in log wages over two time periods. We use this method to decompose 
changes in inequality over 2-, 4-, 6- and 10-year time periods into components 
due to upward and downward between-employer mobility and differential rates 
of wage growth with the same employer. First, Equation 3 depicts the change in 
wages for individual workers over a two-year period: 


In wage. = In wage, _, + Asame, + Aup, + Adown, (3) 


Where In wage, _,is the (residual) log wage of worker i at time t-2, and Asame., 
Adown, and Aup, indicate the change in log wages for worker i based on three 
types of mutually exclusive job transitions between time t-2 and t: staying with 
the same employer, moving to a new employer with lower wages (downward 
mobility), and moving to a new employer with higher wages (upward mobility). 
For example, if worker i had a wage increase of .1 log points and stayed with the 
same employer, then Asame,= .1 and Adown, = Aup,=0. We can rewrite Equation 
3 more succinctly as: 


In wage, = In wage, , + 2Aw, (4) 


where g references the type of job mobility and Aw depicts the change in log wages 
for each specific type of job mobility. 

If we take the variance of both sides of Equation 4, and subtract the variance 
of In wage, ,, from both sides, we have an equation for the change in inequality 
between time t and t-2:° 


o (In wage )-o° (In wage _,) = > [ (Aw) + 2cov(Aw 2 In wage.,) | +joint-effect (5) 


where “g” references the type of job mobility and the term “joint-effect” on the 
right hand side of Equation 5 indicates a small effect of the nonzero covariance 
between Asame, Adown, and Aup due to a correlation among the means, despite 
the fact that they are exclusive categories.° 

The effect of job mobility depends not only on the variance of the change in 
wages for workers who experience a particular type of transition, but also on where 
in the original wage distribution those workers were, which is indicated by the 
covariance term cov(Aw 2 In wage_,). Note that the covariance is with In wage_,, 
the entire wage distribution at time t-2, not just the wage distribution of workers 
of a particular job mobility category. For example, the effect of upward employer 
mobility on inequality is the variance of Ap, (which is 0 for all workers who didn't 
experience upward mobility over this period) plus a measure of the covariance be- 
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Table 2: Definition of the Samples * 
Job Transition Subsequent Wage Growth 

Analysis Job Wage 
Length t-2 t t+2 t+4 t+6 t+8 Transition Change 
2years Start Stop t-2 >t t-2>t 
4years — Start Stop t-2 >t t-2t+2 
6 years _— Start Stop t-2 >t t-2>t+4 
8years _— Start Stop t-2 >t t-2>t+6 
10 years _ Start Stop t-2 >t t-2>t+8 


Key: Grey shaded area = period used to define the type of job transition. 
“Start” = starting point for definition of wage changes, “stop” = end point for definition of wage changes. 


tween Aup,and the original wages at time t-2. This makes sense because the effect 

of an increase in the wages of upwardly mobile workers on inequality is ambiguous: 

if the upwardly mobile workers are at the low end of the income distribution then 

an increase in wages may decrease inequality, because cov(Aw 2 In wage, ,) < 0, while 
"an increase in wages to high-wage workers always increases inequality. 

When we extend the time frame of the analysis to look at 4-, 6- and 10-year" 
intervals, we keep the original definition of the job transition based on what hap- 
pened between time t-2 and t (Table 2). The motivation behind this definition of 
the samples is the idea of treatment and effect: the period from t-2 to t defines the 
job transition, which is the “treatment,” while the overall change in wages over the 
4-10 year period is the “effect” associated with a particular job transition.’ 

The only exception to this coding of job transitions comes from workers who 
appear in the data set at time t-2 and the end of the analysis period (t+2, 4 or 6) 
but not at time t, either because they were out of the labor force or had missing 
wage data. In these cases, we code their job mobility based on whether their em- 
ployer tenure indicates a new job within the past 4, 6 or 10 years depending on 
the time frame of the analysis. Cases that have valid wage and tenure data at time 
t, t+2, t+4, or t+8 but no wage data for time t-2 are excluded from the analysis of 
the specific year and time window (2, 4, 6 or 10 years), as we can’t calculate their 
change in wages over the specified time period. 


Results 


Table 3 provides descriptive information on the effect of each type of job transi- 
tion on the change in wages for individual workers over 2-, 4-, 6-, 8- and 10-year 
periods. ‘The type of job transition is defined by employer tenure and wage change 
over the first two years (t-2 and t), and then we follow the impact of the job 
transition over various intervals based on subsequent wage data. In Panel A, we 
see that workers who do not change employers in the first period have an average 
gain in ( residual) log wages of .026 in the first two year period, and .044 over 
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Table 3: Effect of Job Mobility on Change in Log Wages 
Change in Log Wages 


Mobility from t-2 to t Time Period Cases ss Mean _Vari 











B. Downward Job Mobility 


2 2,106 .362 
= 1,604 431 
6 1,364 444 
8 1,203 : 442 
0 


1,048 464 





} ‘Note: See Table for the definition of the time periods for job transitions and wage changes. 


four years. While the average wage gain doesn't increase significantly after four 
years, the variance of log wages does, which indicates that inequality among these 
workers increases over time. 

In Panel B of Table 3, we show the impact of downward job mobility on wages. 
Over a two-year period, moving to a new employer with lower wages is associ- 
ated with a wage loss of .355 log points. Although much of this is recovered over 
time, particularly between time t and t+2, the medium term effect of downward 
mobility is still substantial four years later at time t+6, where the average wage loss 
is .145 log points. Over 8- and 10-year windows, the negative impact continues 
to decline, to -.115 and -.080, respectively. Finally, Panel C of Table 3 shows the 
effect of “upward” mobility from t-2 to t on wages over time. At time t upward 
mobility increases wages by .339 log points, but this declines to .272 at time t+4. 

Table 4 shows the variance-covariance matrices for the 2-, 4-, 6- and 10-year 
periods of analysis using all available cases from our PSID sample. First, we note that 
the results using all of the cases pooled together are potentially misleading given that 
the underlying wage distributions at different times gradually change as inequality 
increases (i.e., the distributions at time t-2 and t are not constant over time). 

Panel A of Table 4 shows the variance-covariance matrix for the change in 
inequality between time t-2 and t. Diagonal elements are variance terms and off- 
diagonal elements are covariance terms. ‘The final column of each panel shows the 
estimated overall effect of each job transition, calculated using Equation 5 as the 
variance of the change in wages plus two times the covariance with initial wages. 
The variance terms will differ from those in Table 3, which presents the effect for 
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Table 4: Variance-Covariance Matrices of Change in Inequality ~» 


Panel A: 2-year period 
In we2 Asame __Adown Aup Effect 


In We2 252i 

Asame -.03500 07549 .0055 

Adown -.00692 .00077 02671 0129 

Aup -.01741 -.00082 .00176 02580 -.0090 
N = 15,361 


Note: See Equation 2 for definition of the effect on inequality. 


Panel B: 4-year period 
Inwe2 = Asame — Adown Aup Effect 


In We2 24084 

Asame -.03706 08951 0154 

Adown -.00820 .00070 02417 .0078 

Aup -.01737 -.00114 .00077 03132 -.0034 
N= 12,831 


Panel C: 6-year period 
In We2 Asame Adown Aup Effect 





In Wi-2 23236 

Asame -.03814 10910 .0328 

Adown -.00789 00052 02360 .0078 

Aup -.01950 -.00123 00059 .03701 -.0020 
N = 10,635 


Panel D: 10-year period 
In Wt.2 Asame__Adown Aup Effect 








In we2 22504 
Asame -.04170 14279 0594 
Adown -.00918 .00035 .02748 .0091 


N=8,125 
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workers who experienced a particular type of transition, while Asame, Adown, 
and Awp, include 0s for workers who did not experience the specific job transition. 
To begin with, the variance of log wages at time t-2 is .251. To calculate the effect 
of each type of job transition on the change in inequality from time t-2 to t we 

calculate the terms in the bracket on the right-hand side of Equation 5, which is 

the variance in the change in wages for each type of job transition plus two times 

the covariance with wages at time t-2. For example, the effect of staying with the 

same employer is .07549 (the variance of Asame,) plus 2 x -.035, the covariance 

between Asame, and log wages at time t-2. As a result, staying with the same em- 
ployer increases overall wage inequality by .0055 in the pooled data. The negative 

covariance between Asame, and log wages at time t-2 indicates that same-employer 
wage growth is stronger among workers with less than average wages, and this 

negative covariance washes out most of the inequality-generating effect due to the 

non-zero variance of Asame.. 

In contrast to same-employer wage growth, the impact of downward employer 
mobility on inequality is relatively large over a two-year interval. Although the 
variance in Adown, is smaller than the variance of Asame., the covariance term 
is closer to zero as well, and the overall effect is 2(-.0069) + .0267) = .0129 or 
about 2.5 times the effect of Asame. In contrast to Asame,and Adown, the effect 
of upward job mobility over a two year period is close to zero in the pooled PSID 
data because two times the negative covariance term almost completely cancels out 
the variance of the change in wages of Aup.. Finally, the covariance terms between 
Asame, Adown, and Aup, have a joint effect on inequality, which is relatively small 
(2*(.00077 — .00082 + .00176) = .003416). 

In panels B and C of Table 4, we decompose the impact of job transitions 
between time t-2 and t on inequality over a 4- and 6-year period. We find that the 
impact of downward mobility on inequality declines as the time period increases. 
Over a four-year time period (Panel B), the effect is 2(-.0082) + .02417 = .0078, 
and over a six-year time period it is 2(-.00789) + .0236 = .0078. ‘The declining 
effect of downward job mobility on inequality from a two-year to four- and six- 
year periods reflects the fact that some of the decline in wages associated with 
downward job mobility is transitory, as indicated in Table 3. ‘The overall decline 
in the magnitude of the effect is about 40 percent as we go from a two- to a six- 
year window in Panel C. On the other hand, it is important to point out that the 
effect does not go away, which is consistent with the persistence of average wage 
losses for downwardly mobile workers noted in Table 3. This result demonstrates 
that the variance components models that estimate wage trajectories by assuming 
that the effect of job loss on inequality is transitory are misleading: in contrast, we 
find there is a persistent medium-term effect of downward mobility on inequality. 

The impact of same-employer wage growth on inequality, which was smaller 
than the effect of downward mobility over a two-year period, turns out to be con- 
siderably more important when we follow workers over a four- or six-year period. 
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Table 5: Results by Year . 
Panel A: 2-year period 

Same Downward Job Upward Job Joint 
Year Employer Mobilit Mobili Effect N 
1977 .0089 0148 -.0053 .0057 1,300 
1979 .0091 .0037 -.0253 .0065 1,367 
1981 -.0001 0134 -.0095 .0029 1,290 
1983 -.0015 0122 -.0037 .0025 1,215 
1985 .0066 0145 -.0104 .0022 1,172 
1987 | .0199 .0202 -.0201 .0027 1,166 
1989 -.0039 0104 -.0066 .0032 1,153 
1991 -.0002 0210 -.0176 .0033 1,129 
1993 .0332 0145 -.0157 .0028 1,063 
1995 -.0186 .0078 -.0151 .0039 1,145 
1997 -.0043 0132 -.0032 .0024 1,094 
1999 0179 .0067 .0014 .0025 1,113 
2001 .0070 .0162 .0072 .0051 13151 
2003 -.0009 -.0027 .0036 .0042 1,078 

Average .0052 .0119 -.0086 .0036 

Panel B: 4-year period ; 

Same Downward Job Upward Job 
Year Employer Mobility Mobility Joint Effect (N) 
1977 .0065 .0046 -.0006 .0025 1,165 
1979 .0109 -.0011 -.0187 .0030 1,142 
1981 .0079 .0081 -.0083 0012 1,063 
1983 .0097 .0085 -.0051 .0002 1,002 
1985 .0190 .0098 .0003 -.0007 980 
1987 0345 .0150 -.0111 .0004 986 
1989 .0047 .0031 .0007 -.0001 954 
1991 .0246 0127 -.0106 -.0005 906 
1993 0222 .0042 -.0095 .0007 886 
1995 .0073 -.0029 -.0043 .0012 950 
1997 .0099 0111 .0077 .0005 922 
1999 .0335 .0168 .0096 .0006 950 
2001 .0085 .0097 -.0007 .0011 925 
2003 0215 -.0161 -.0064 .0013 955 
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Panel C: 6-year period 


Same Downward Job Upward Job 

Year Employer Mobility Mobility Joint Effect (N) 

1977 .0134 .0054 -.0049 .0027 1,008 
1979 .0162 .0006 -.0091 .0026 1,000 
1981 .0301 .0034 -.0083 .0002 927 
1983 .0365 .0110 .0003 -.0015 910 
1985 0425 .0069 -.0011 -.0032 884 
1987 .0489 .0036 -.0049 -.0017 877 
1989 .0291 0117 -.0025 .0002 815 
1991 0227 .0073 -.0060 -.0003 838 
1993 .0633 .0103 -.0153 .0000 799 
1995 .0345 .0053 -.0073 .0003 898 
1997 0232 .0106 .0183 .0003 861 
1999 .0357 0157 .0098 -.0012 818 
2001 0262 .0084 .0051 .0007 909 

Average .0325 .0077 -.0020 .00006 


In panel C, the effect over a six year window is 2(-.03814) + .1091 = .0328 or 
about 4.5 times the effect of downward job mobility. This is due to the fact that 
the variance of Asame, increases over time (from .0755 after two years to .1091 
after six years). Another way to look at the comparison between the impact of 
Asame,and Adown, is to note that while the variances of the change in wages are 
fairly similar in Table 3, the effect on inequality of Asame, is larger because only 
a minority of cases experience downward job mobility in a given year. Finally, in 
panel D of Table 4, we extend our window of analysis to a 10-year time period. 
The key finding from Panel D reiterates the basic findings from panels B and C 
that while the impact of involuntary mobility does not disappear, it is dwarfed in 
size by the effect of staying with the same employer between t-2 and t. 

While Table 4 pooled all the years of data together into a single analysis for 
each time interval, Table 5 calculates the effect of each type of job transition on 
inequality separately for each year of our sample. This is a more accurate approach 
than the pooled data, but small sample sizes (i.e., 1,200-1,500 per year) and year- 
to-year fluctuations in inequality make the results harder to interpret. 


Calculating the Overall Effect of Job Displacement 


The transparency of our approach in tables 4 and 5 is based on Equation 5 and the 
“treatment” and “effect” depiction of the sample illustrated in Table 2. Nonetheless, 
this approach has two shortcomings. First, it is possible that workers who stay with 
the same employer between time t-2 and t lose their jobs two, four or six years in 
the future. Hence, it is possible that Panel C in Table 4 overestimates the relative 
importance of the effect of staying with the same employer between t-2 and t on 
inequality at time t + 4: the worker may have lost his job later, and his effect on 
inequality should be attributed to job displacement. In addition, the overlapping 
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windows of time in the different panels of Table 5 mean that it is difficult to 
calculate the overall effect of displacement two, four, six and eight years in the 
past on current inequality. Although a comparison of different panels in Table 4 
shows that the relative importance of job displacement on inequality diminishes 
over time, the “turbulence” in the wage distribution caused by a continual level 
of job displacement means that the overall effect on inequality is a combination 
of the effects at different time periods. Here we adapt Equation 5 to measure the 
overall effect of job displacement at different time intervals on current inequality 
by calculating the change in inequality between time t-2 and t using six categories 
of job mobility: same employer or upward mobility over the past two years, and 
four categories of job displacement indicating whether the job loss occurred two, 
four, six or eight years ago. 

We opt to calculate the cumulative effect of job mobility on inequality over an 
eight-year period for two reasons. First, the effect of job displacement declines 
over time, as workers recover wages that were lost, as indicated by the average 
change in wages by job mobility in Table 3. Second, as the sample changes over 

_ time due to retirement and new entrants to the labor market (and non-retirement 
panel turnover) it is misleading to connect past job mobility of respondents who _ 
are no longer in the sample to the current level of inequality. The choice of eight 
years for the “window” is long enough for key short- and mid-term effects of job 
displacement to play out, while being short enough that turnover in the labor 
market doesn't diminish the impact of past mobility on current inequality. 

Even within the eight-year window, however, we want to account for the effect 
that sample turnover has on the change in inequality across consecutive two-year 
periods. There are four types of movement in and out of the sample. Because we 
know the first and last year of participation in the PSID (up to 2005), we can 
identify new entrants into the sample (INSAMPLE) and respondents who are 
in the sample for the last time (QUTSAMPLE). In addition, we can identify 
respondents who temporarily move out of the sample (OUT) and those previous 
members who move back in (RETURNING). 

Accounting for change in the sample is depicted as a series of steps in Table 
6. First, we start with the wages for all workers observed at time t-2. In steps 2 
and 3, we remove the workers who are observed at time t-2 but not at time t 
(OUTSAMPLE , and OUT). Consequently, in Step 3, we are left with the 
sample of workers who are observed at both time t-2 and time t (BOTH). In Step 
4 we add the most recent wage observation prior to time t-2 for workers who are 
rejoining the sample after not being observed at time t-2 (RETURNING ) after 
an absence of no more than eight years. We do this because we want to include 
the inequality-generating effect of workers who lose their job and drop out of the 
labor force temporarily but return at a later date. Previously sampled workers who 
are returning to the sample after an absence of more than eight years are included 
in the INSAMPLE group on the grounds that they have no prior record in the 
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Table 6: Accounting for the Effect of Changes in the PSID Sample, Time T-2 to T 
Change in Inequality 


Step Time Sample (current step — previous step) 
1 T-2 All, time t-2 — 

2 T-2 Step 1-OUT Churn 

3 T-2 Step 2-OUTSAMPLE = BOTH A Sample 

4 T-2 Step 2 + IN = BOTH+IN Churn2 

5 i BOTH+IN A Equation 5 

6 T Step 4+INSAMPLE = All, time t A Sample2 


OUT(t-2) = cases observed at time t-2 but not at time t, that will return to the sample in 
the future. 

OUTSAMPLE(t-2) = cases that are leaving the sample at time t-2. 

IN = previously sampled cases that were not observed at time t-2 that are returning to the 
sample at time t. 

BOTH = cases observed at time t-2 and t. 

INSAMPLE = new cases observed at time t. 


Churn = Change in inequality due to movement of continuing sample members in and out 
of the sample. 

A Sample = Change in inequality due to the addition of new members and the departure 
of old members. 

“A Equation 5” indicates the change in inequality that can be calculated directly from 
Equation 5. 


window we are analyzing.’ The change in inequality between Step 3 and Step 4 
measures the impact of including prior wages for returning sample members on 
the overall level of inequality at time t-2. 

Step 5 includes the time t wages of the sample of workers who were observed 
at both time t-2 and time t and the returning workers mentioned in Step 4. The 
difference in inequality between Step 4 and Step 5 is captured by Equation 5 as the 
change in inequality between time t-2 and time t, keeping in mind the addition of 
the returning workers described in Step 4. Finally, in Step 6 we add new members 
of the sample who are observed for the first time at time t (INSAMPLE). When we 
account for the change in inequality due to the changing composition of our PSID 
sample in steps 2, 3, 4 and 6 and the change in wages for BOTH and IN groups 
in Step 5, we can write the overall change in inequality between time t-2 and t as: 


o (In wage") — o° (In wage“*)=(ASamplel + ASample2) + (Churnl1+Churn2) (6) 

Where the superscript ALL in the left hand side of Equation 6 indicates that the 
time t and time t-2 inequality includes all workers, not just those observed at both 
time periods. ASamplel is the change in inequality between steps 2 and 3 in Table 
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6 due to the loss of sample members, and ASample2 is the corresponding change 
in inequality between steps 5 and 6 due to the addition of new members. Similarly, 
“Churn” and “Churn2” represent the stepwise change in inequality due to returning 
and (temporarily) departing sample members. A Equation 5 indicates the change in 
inequality attributable to wage changes in Equation 5 (Step 4 Step 5). 

Table 7 shows the number of cases in each category by year. In panel A we 
distinguish our rotating longitudinal panel (BOTH + IN) from the other cases. 
On average, the sample consists of 982 cases observed in both year t and t-2, 79 
previously observed cases rotating in or out, and about 140 cases joining or leav- 
ing the sample (excluding INSAMDLE for 1975 and OUTSAMPLE for 2005). 
Because the number of cases in BOTH is large compared to the number of re- 
turning cases, it seems likely that the majority of the effect of job mobility on the 
change in inequality will be due to cases observed in both years. However, even 
if it isn’t, we believe that including the returning workers in Step 4 of Table 6 is 
important, as it is possible that discouraged workers who drop out of the labor 
force after losing their jobs could have different wage dynamics than those who 
are in the sample continuously after a job loss. 

Panel B of Table 7 shows the job mobility classification of workers who are _ 
included in the BOTH+IN sample defined by steps 4 and 5 of Table 6. Each of 
the displacement variables DISP(N) for N = 2,4,6 or 8 is coded as 1 if the worker 
lost his job N years ago. If the worker has multiple job displacements within each 
eight-year period, then the most recent displacement variables are recoded to 0 for 
that period. Past displacement, within the eight-year window, takes precedence over 
the two-year record of job mobility; i-e., a worker who lost his job eight years ago is 
coded as DISP8 = 1, and UP2 = SAME2 = 0 regardless of what happened over the 
previous two years. This is because we want to measure the mid-term effect of job 
displacement, which may include subsequent wage growth with a new employer. 

Tables 8 and 9 and Graph 1 show the effects of job mobility and changes in 
the composition of the sample on changes in inequality over time. First, Table 8 
shows the effect of each of our categories of job mobility on the two-year change 
in inequality. Overall, the average effect of job loss two years ago on inequality 
is +.005, and the effect declines to +.0022 and +.00005 over four and six years. 
Finally, the effect of job loss eight years ago on the change in inequality between 
time t-2 and t is actually negative, -.0004. As a result, we can conclude that the 
impact of job loss on inequality is largely confined to the first four years, and, 
despite the different construction of the samples, this is consistent with our find- 
ings from the previous set of analyses in tables 3 and 4. In contrast, the average 
effect of staying with the same employer on inequality is .0088, and the impact 
of upward job mobility is close to zero at -.00016. 

To provide an overall estimate of the impact of job mobility on inequality aver 
each eight-year period in the data, Table 9 presents the cumulative impact of each 
type of job mobility on the change in inequality over the previous eight years. We 
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Table 7A: Sample Sizes, by Year 


Temporary Turnover Sample Changes 
Year Both Return Out Insample Outsample 
1975 0 0 82 1,411 139 
1977 1,185 0 49 226 134 
1979 1,219 70 15 164 189 
1981 1,177 47 92 132 156 
1983 1,095 70 87 101 141 
1985 1,038 82 73 107 138 
1987 1,020 97 66 107 133 
1989 1,027 72 76 117 141 
1991 1,003 74 112 109 133 
1993 937 69 83 131 124 
1995 927 98 1417 199 107 
1997 1,001 98 115 130 106 
1999 1,010 121 100 152 134 
2001 1,058 110 147 123 154 
2003 998 99 0 1s 227 
2005 1,032 167 0 156 1,362 
Average 982.4 79.6 79.6 140.51 143.72 


Note: ‘Excluding 1975. 7Excluding 2005. 
See Table 7 for definition of the sample types. 
BOTH, RETURN and INSAMPLE define the sample size of log wages for time t. 


Table 7B: Number of Cases by Category of Job Mobility 
Year Disp2 Disp4 Disp6 Disp8 Up2 Same2 
0 





1975 0 0 0 0 0 
1977 191 0 0 0 192 802 
TA 139 212 0 0 161 803 
1981 65 146 210 0 75 791 
1983 66 90 142 205 48 13 
1985 54 103 112 180 82 701 
1987 54 1 129 131 69 750 
1989 67 78 98 138 84 710 
1991 61 93 95 105 88 708 
1993 47 84 100 95 65 696 
1995 49 76 96 108 70 119 
1997 46 73 91 109 96 764 
1999 74 76 80 98 137 740 
2001 85 110 85 86 114 748 
2003 65 113 123 88 94 706 
2005 70 109 120 122 110 731 


Average 70.8 90.0 92.6 91.6 92.8 692.6 
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Table 8: Effect of the Type of Job Mobility on the 2-year Change in Inequality 
Nee ae ae ee a ge ee eed 
Year Disp2 Disp4 Disp6 Disp8 Up2 Same2 
0 0 0 





1977 0125 -.0048 0101 
1979 0063 = -.0043—s «0 0 -.0109 .0046 
1981 0059 = -.0073 0052 «0 -.0032 .0072 
1983 .0039 0043 0 .0018 .006 0041 


1985 .0052 .0083 .0009 0024 ~— -.0005 .0067 
1987 .0078 .0015 .0026 0017 ~— -.0026 .0254 
1989 .0048 .0015 0041 ~~ -.0044 .007 .0033 
1991 0116 .0052 0017 -.0078 ~ -.0019 .0085 
1993 .0022 0023  —- -.0036 0034 = -.006 .0306 
1993 .0053 0044 -.0067 -.0042 -.0059 ~ -.0017 
1997 0071 0032 -.0005 = -.0032 .0017 .0044 
1999 .0017 .0034 0043. = -.0012 .0057 0185 
2001 0014 0132 = -.0001 ~—-.0001 .0087 .0122 
2003 0044 -.0024 = -.0033 .0033 0068 ~—--.0018 
2005 0041 -.0007 -.0026 -.0012 -.0026 .0008 


Average .0051 .0022 .0001 -.0006 -.0002 .0089 
Notes: For sample sizes, see Table 8b. 


add the combined effect of the job displacement variables over the past eight years, 
although Table 8 indicates that most of the effect will be due to Disp2 and Disp4. 
In addition, we include the cumulative effect of the rotation of sample members 
in and out of the panel (“Churn”) and the impact of sample turnover due to 
adding and losing sample members (Asample). Overall, Table 9 shows that the 
average effect of the rotation of sample members, CHURN, is fairly small, with 
a cumulative effect of -.0069 over the average eight-year period. In contrast, the 
impact of the addition and attrition of sample members, Asample, is nontrivial. On 
average it decreases inequality by -.0283 over each eight-year period. The reason 
Asample is negative is that the dispersion of wages increases by age (or potential 
labor market experience), even though we have controlled for the basic life cycle 
increase in average wages by replacing log wages with the residual of a regression 
on experience and dummy variables for year. The increase in wage dispersion over 
time at the individual level reflects the accumulated impact of the different types of 
job mobility on inequality as calculated via Equation 5. Thus, as cohorts of sample 
members retire, the “memory” of their collective effect on inequality is lost on the 
sample, and overall inequality, on average, is reduced. 

Finally, Table 9 and Graph 1 show the accumulated eight-year effect of the 
different types of job mobility on inequality. Our key contribution here is that 
we can measure the overall impact of job loss by calculating the combined effect 
of the displacement variables. Job displacement increases inequality by .0308 
over the average eight-year period, which represents 39 percent of the increase 
attributable to wage changes in the rotating two-year samples.’ This compares to 
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an average effect of .0407 for workers who stayed with the same employer and 
0025 for upward mobility. Graph 1 depicts these same trends over time. The most 
recent years of data, in 2003 and 2005, show that the effects of all three types of 
job mobility on inequality are fairly equal, although more years of data would be 
needed in order to determine whether this represents an actual time trend or just a 
random fluctuation. Overall, the results in tables 8 and 9 confirm the basic results 
presented earlier in tables 4 and 5: Job displacement increases inequality, but the 
effect declines over time since displacement, and the magnitude of the effect is 
smaller than the impact of differential wage growth for workers who stay with the 
same employer. Nonetheless, the “turbulence” in the wage distribution caused by 
job loss two and four years ago has a very real impact on the level of inequality at 
almost all time points in the PSID data. 


Discussion and Conclusions 


In this article, we estimate the impact of job mobility on wage inequality by calculat- 
ing the effect of wage changes attributable to each type of job transition on changes 
_ in overall inequality. Because there is no consistent coding of the reasons for job 
mobility in the PSID data, we use a measure of downward between-employer mo- 
bility— moving to a new employer with a loss in wages—as a proxy for involuntary 
employer changes. While some involuntary job mobility might result in an increase 
in wages, we believe that our measure captures what is meant by the negative effect 
of involuntary employer changes on wages and inequality. Overall, the PSID is the 
best available dataset for this analysis because it covers multiple cohorts of workers 
over long periods of time, permitting an analysis of the effects of job mobility that 
allows us to differentiate between transitory, short-term and more permanent effects. 
Our key finding is that longitudinal data from the PSID indicate that the 
majority of the increase in wage inequality for men in the United States from 1977 
to 2005 is due to differential rates of wage growth within firms. We do find that 
over a two- or four-year period, downward employer mobility has a significant 
effect on inequality, as indicated by panels A and B of Table 4. Indeed, in Table 10 
we find that the inequality-generating effect of the short-term “turbulence” in the 
wage distribution caused by downward job mobility is significant, representing 39 
percent of the increase in inequality attributable to wage changes over the average 
eight-year period between 1985 and 2005. However, the size of the effect of down- 
ward between-employer mobility is reduced by about by 30 percent when we go 
from a two- to six-year window of observation in Panel C of Table 4. In contrast, 
the inequality generating effect of wage growth for workers who stay with the same 
employers is much larger when we extend our period of analysis from two to six 
years in Panel C of Table 4, as well as in the overall results reported in Table 10. 
The result that within-employer wage growth accounts for most of the increase 
in wage inequality raises serious questions about the validity of arguments that 
greater employment instability was primarily responsible for the observed growth 
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in wage inequality. Instead, it appears that the growth in wage inequality examined 
here is a process that appears to be happening largely within firms rather than 
through the movement of workers across firms. While between-employer mobility 
or job displacement can have an impact on individual wage trajectories, our results 
show that it has less of an effect on overall trends in inequality. 

What might account for the importance of differential rates of wage growth 
within employers on trends in inequality? While our PSID data lack the informa- 
tion on firms and employment practices needed to answer this question defini- 
tively, we speculate that some of this may result from growing inequality among 
employers in profitability and other factors associated with their ability and will- 
ingness to pay high wages. 

Overall, the implications of our results are clear: any comprehensive explana- 
tion of the rise in wage inequality in the United States over the past three decades 
must focus on the role that within-firm wage growth—i.e., differential returns to 
firm tenure—plays in labor market inequality. We must explain more systemati- 
cally the sources of heterogeneity among employers and firms in order to account 
for the growth in wage inequality among employees. Linked employer-employee 
data sets (such as the Longitudinal Employer-Household Dynamics data collected 
by the U.S. Census Bureau) that contain information on both the characteristics 
of employers and their employees are needed to account for differences in the 
experiences and earnings trajectories of employees in different organizations. 


Notes 


1. The differences between these studies may reflect in part differences in assumptions 
about measuring year-to-year changes (see the discussion in Bernhardt et al. 2001: 


66; and Gottschalk and Moffitt 1999). 


2. Note that Gottschalk and Moffitt’s (1994) approach described above does a decent 
job modeling the effect of job mobility on inequality if workers only have one job 
change each and we divide each worker's observations into two samples, “before” and 

“after” the job change. 

3. This procedure misses employer spells that include 1979-1980 but end prior to the 

1981 survey. 


4. Potential work experience is defined as age- (years of education + 6). We use all years 
of PSID data from 1975-2005 in this preliminary regression. 


5. We thank an anonymous reviewer for suggesting this decomposition strategy. 
6. joint-effect = 2[Cov(Asame, Adown)+Cov(Asame, Aup)+Cov(Adown, Aup)] 
= > dX (a-a)(b-b) = > -(@xb)[1+(p, + p,)| where > indicates the possible 
a n a, 


a, 


combinations of Asame,, Adown, and Aup, and p, and p,are the proportion of cases 
with nonzero levels of a and b respectively. 


7. Below in Equation 6 we also consider an alternative approach that calculates the effect 
of job displacement over two-, four-, six- and eight-year periods simultaneously. 
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8. There are only 195 cases of workers returning after an absence of more than 10 years, 
compared to a cumulative total of 1,912 new workers added to the sample after 1975. 


9, The overall effect attributable to wage changes—as opposed to sample changes—is the 
sum of the effect of job displacement, staying with the same employer, and upward 


mobility. 
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Occupational Status and the Experience of Anger 


Jessica L. Collett, University of Notre Dame 
Omar Lizardo, University of Notre Dame 


Current theories in the sociology of emotions posit contradictory expectations regarding 
the relationship between status and the relative experience of anger, with some predict- 
ing a negative relationship and others proposing a positive one. We test the compatibility 
of these opposing hypotheses by examining the relationship between anger and a key 
dimension of socioeconomic status — the occupational status score of an individual’s 
occupation — for a representative sample of Americans. We connect different strands of 
theory and research in the social psychology of emotions to posit a non-linear relation- 
ship between occupational status and the experience of anger. Analyses of data from 
the 1996 General Social Survey's emotions module (N = 1460) are consistent with this 
integrative account. Individuals located at the two opposite ends of the status and pres- 
tige hierarchy are more likely to experience anger than those of middle status. We use 
insight from Blau’s macro-structural theory to help elucidate this complex relationship. 


How is emotional experience shaped by an individual’s location in the power and 
prestige order? This question, while central to early work in the sociology of emo- 
tion (Collins 1975, 1990; Gordon 1990; Hochschild 1975, 1979, 1983; Kemper 
1978; Kemper and Collins 1990), has received relatively little empirical attention 
recently (but see Schieman 2003; Simon and Nath 2004; Turner and Stets 2005 
for a detailed theoretical treatment). This is unfortunate, because the link between 
emotion and patterns of interaction across structural positions, and the role that 
emotion has in the maintenance of social structures, are, in our view, the primary 
contributions and promise of a sociological approach to the study of emotion. 

In this article, we open a path in this direction by focusing on the relatively 
under-examined link between occupational status (Hauser and Warren 1997) 
and anger, an emotion that has been prominently studied in social psychologi- 
cal research (Collins 1990; Hegtvedt 1990; Johnson, Ford and Kaufman 2000; 
Lovaglia and Houser 1996; Ridgeway and Johnson 1990; Shelly 2004; Stets 2004; 
Stets and Harrod 2004). 

While the study of emotion and the study of structural patterns of stratification 
have, to date, remained largely separate enterprises, we see many ways in which 
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these two research streams can complement each other. On the one hand, a focus 
on emotional experience as it relates to patterns of micro-interaction can help 
refocus the study of status attainment and class (Grusky and Sorensen 1998), a 
tradition focused on large scale patterns of sorting into different positions (Blau 
and Duncan 1967; Featherman and Houser 1978). On the other hand, precisely 
because studies of micro-interaction sometimes lose focus of how the outcomes 
produced in those local settings feed back into the larger structure (Ridgeway and 
Smith-Lovin 1994; Smith-Lovin 2003), linking emotional experiences with occu- 
pational standing might clarify how this process operates outside of the controlled 
laboratory settings often used in research on status and emotion. 

We focus on anger because it is both an important product of, and contributor 
to, the maintenance and transformation of large scale patterns of inequality and 
stratification (Collins 1990). Anger has been the focus of detailed empirical and 
theoretical treatment in both the structural social psychology of small groups 
(Lovaglia and Houser 1996; Ridgeway and Johnson 1990) and more macro- 
oriented analyses of the link between emotion and social structure (Collins 1975; 

_ Turner 2002). In spite of all of the attention that anger has received, the relation- 
ship between anger and status remains ambiguous at best. ; 

In what follows, we specify both the expected relationship between status and 
anger and the mechanisms that may help explain that association. We draw from 
research on emotion rules (Hochschild 1990; Thoits 1990), perceptions of control 
(Mirowsky and Ross 1990), status organizing processes (Ridgeway and Johnson 
1990; Shelly 2004) and structural-relational theories of emotional experience 
(Collins 1990; Johnson, Ford and Kaufman 2000; Kemper 1978; Kemper and 
Collins 1990), while also taking into account an important implication of Blau’s 
(1977) macro-structural theory, to explore the possibility of a curvilinear relation- 
ship between status and anger. 


Anger and Status: Divergent Perspectives 
The Compatibility Hypothesis 


Status and emotion are generally thought to be compatible. That is, the higher 
one’s status, the more positive their emotion (Shelly 2004). Research and theory 
substantiate this view. For example, Lovaglia and Houser (1996) show that high 
status is compatible with positive emotion because it leads to higher degrees of 
influence and positive evaluations from others. Low status, on the other hand, is 
likely to be associated with negative emotion (especially anger) as lowered status 
decreases interpersonal influence and forecloses opportunities for advancement 
and personal enhancement (Conway, DiFazio and Mayman 1999; Lucas and 
Lovaglia 1998). Those of low status are more likely to be chronically situated in 
“aversive environments” that produce anger as a response (Agnew 1997; Berkowitz 
1982) or to experience a chronic state of under-reward for contributions, which 
fosters feelings of anger and distress (Hegtvedt 1990). Furthermore, low status in- 
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dividuals are more likely to experience feelings of “helpless anger” (Scheff 1986:74) 
because they are situated in disadvantageous structural conditions. 

In addition to these structural dynamics, any positive correlation between 
status and the experience of anger is likely partially driven by the extent to which 
individuals occupying status positions abide by rules that either discourage or 
promote the public expression of negative socio-emotional behavior (Hochschild 
1975, 1979, 1983, 1990). Individuals draw on emotion rules as they work over 
and direct their subjective interpretation of events. Engaging in surface and deep 
acting manages both emotional expression and the physiological arousal caused 
by certain social situations, helping ensure that individuals adopt culturally ap- 
propriate feelings and emotional behavior. 

Hochschild (1979:572) connects emotion management to employment and ar- 
gues that more emotion work occurs in settings where it is a worker's job to “make 
and sustain meaning.” While the original focus of the work on emotion manage- 
ment centered on the service sector, there is increasing attention to other occupa- 
tions where concerns with maintaining professionalism lead those of middle and 
upper classes to carefully manage their demeanor, including often suppressing 
negative emotion (Lively 2000). With parents’ child-rearing so closely tied to 
employment experiences (Kohn 1963), these middle- and upper-class parents are 
more likely to encourage children to engage in emotion work than working class 
parents. Therefore, with an emotion rule that discourages negative emotion, status 
should be negatively related to anger (Hochschild 1979, 1983; Smith-Lovin 1995; 
C. Stearns and P. Stearns 1986; P. Stearns and C. Stearns 1985). 

Considering this research alone, the association between increasing status and the 
experience of anger is expected to be negative, with individuals of lower status the 
most likely to experience anger. Nevertheless, focusing on other, more interaction- 
centered, research (Collins 1990; Ridgeway and Johnson 1990) suggests that high 
status individuals experience more situation-specific and episodic anger than those 
of lower status, resulting in a more complex relationship between status and anger. 


The Emotion-Domination Link Hypothesis 


Insofar as anger can be considered an emotion that signals mastery and domina- 
tion over others, there are reasons to expect that high status may also be positively 
linked to the experience of anger. The observation that higher status members of 
groups are more likely to produce negative socio-emotional behavior (actions ac- 
companied by external signals of emotional arousal) in the context of task groups 
is evident in the seminal experiments of Bales and Slater (1955). As Ridgeway and 
Johnson (1990:1192-1193) point out, “task leadership requires disagreement with 
and negative evaluation of others... [as a result] negative socio-emotional behavior 
is more common from high status members than from low status ones.” 

The primary mechanism offered to explain the association between high sta- 
tus and negative socio-emotional behavior in task groups concerns the higher 
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performance expectations held by high-status individuals. These expectations 
lead high-status individuals to consider their behavior more legitimate and to 
value their contributions over those of lower-status group members. In addition, 
high-status individuals may feel angry and frustrated if they perceive themselves 
as strong and see others as interfering, not following instructions that were given, 
or not showing proper respect (Conway, DiFazio and Mayman 1999). 

Research on emotion that draws on attribution theory suggests that when high- 
status individuals encounter socio-environmental triggers that lead to frustration 
they are less likely to assume responsibility for that state of affairs and more likely 
to blame others (Turner 2002). This leads to a higher likelihood of experiencing 
the negative emotion of anger instead of shame (Scheff 1988) or sadness. This 
is particularly true when interacting with lower-status alters (Lively 2000). As 
Ridgeway and Johnson (1990:1198) observe: When high-status individuals (in 
the context of task groups) attribute “disagreement from another to that other, 
they are likely to experience anger and be inclined to express negative socio- 
emotional behavior toward the other.” 

Collins’ (1990) Interaction Ritual Model of emotional dynamics is similar 
to this account. According to Collins, high-status individuals are more likely to 
have a chain of interaction rituals that raises their emotional energy levels and 
lowers those of their lesser-status interaction partners. High emotional energy 
then leads to the experience of anger when faced with an obstacle (Collins 1990). 
Consistent with this, Sloan (2004) finds that members of high-status occupations 
are more likely to experience (and subsequently express) anger directly toward an 
interpersonal target during social interaction at work. In line with Collins’ domi- 
nation hypothesis, she finds that “high status workers most frequently reported 
experiencing anger at work due to the behavior of others that interfered with their 
own endeavors.’(Sloan 2004:65) 

According to Identity Control Theory, negative emotion arises out of the failure 
to confirm identity expectations during interaction (Cast and Burke 2002). Because 
high-status individuals are likely to have their identities confirmed during everyday 
interaction (Stets 2004; Stets and Harrod 2004), the specific situations when their 
high-status identities fail to be confirmed are especially salient, meaning they are 
more likely to report such instances, and these situations produce intense nega- 
tive emotion. This negative emotional arousal is elevated when “the source of the 
disconfirmation is a low status other.” (Stets 2004:53, italics added) In other words, 
while any failure to confirm identity expectations in interaction produces negative 
emotion, such failures are particularly salient and distressing for those of high status 
and when the source of the anger-producing experience is a low-status other. 

In contrast to the “compatibility hypothesis,” which suggests a negative rela- 
tionship between status and anger, these theoretical perspectives suggest that in 
certain situations, especially those involving interaction with low-status alters 
(Lively 2000; Ridgeway and Johnson 1990; Sloan 2004; Stets 2004; Stets and 
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Harrod 2004), high status should be positively connected to anger. The gist of this 
second perspective is that among the “negative” emotions, anger in particular will 
be experienced by those of high status whenever an interactional challenge to their 
privileged position arises on the part of formal subordinates or others perceived as 


lower status (Collins 1990; Shelly 2004).! 


An Integrative Account 


Of course, according to the emotional domination model outlined above, the 
effect of these anger-inducing encounters on the relationship between status and 
anger partly depends on the frequency of status-dissimilar interactions. If high- 
status individuals almost never came into contact with lower-status alters, the set 
of mechanisms highlighted above would never be activated, and we might expect 
a simple linear negative association between status and anger as suggested by the 
compatibility hypothesis. However, Blau’s (1977) and Ridgeway and Balkwell’s 
(1997) formulation regarding the macro-structural distribution of encounters 
suggests that high-status individuals might possess, by virtue of their position 
in the social structure, a higher likelihood of having encounters with low-status 
alters — the type of interactions most likely to result in the experience of anger. 

From this perspective, individuals on the top rung of the status ladder, as 
a relative minority group, should be structurally more likely to experience 
encounters with individuals of lower status who comprise a relative majority 
(Ridgeway and Balkwell 1997). Thus, higher status leads to a higher likeli- 
hood of encounters with lower-status alters and these encounters increase the 
probability of the experience of anger on the part of the high-status member 
through the proximate situational and interactional mechanisms of status loss 
(Kemper 1978), status challenge (Collins 1990; Ridgeway and Johnson 1990) 
and identity disconfirmation (Stets 2004). 

Therefore, we suggest that after a certain threshold of status is crossed, the 
point at which those of high status become a minority in relation to the rest of 
the population, high-status individuals begin to acquire a higher likelihood of 
experiencing status-dissimilar encounters in which they occupy the dominant po- 
sition. As status continues to increase, the anger-eliciting effect of status-dissimilar 
encounters — the “ecology of encounters” (Ridgeway 2000) — begins to outweigh 
the negative first-order effect on the experience of anger (the compatibility per- 
spective). If the above is correct we should expect the experience of anger to be 
unevenly distributed in the status hierarchy, with those at the extreme low and 
high ends more likely to feel anger than those in the middle: 


Hypothesis 1: Individuals located at the extreme ends of the 
occupational status hierarchy are more likely to experience 
frequent feelings of anger than those in the middle. 
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Explanatory Mechanisms 


The mechanisms that produce increasing feelings of anger should be different for 
the groups at the extremes. While the first-order negative effect should explained 
by trans-situational factors such as adherence to emotion rules and a sense of 
control over life events, the second-order positive effect is purely structural and 
should be explained by the higher likelihood of high-status individuals to have 
had a recent anger-eliciting interaction. The data allow us to test for the possible 
intervening role of two broad classes of explanatory mechanisms implicated in the 
former set of processes: perceptions of control (Mirowsky and Ross 1990, 1991) 
and emotion rules (Hochschild 1979, 1983). Controlling for these should there- 


fore partially account for any relationship between occupational status and anger. 


Perceptions of Control 


Gecas (1989) suggests that most sociological research on individuals’ beliefs about 
causality is systematically related to social structure, as one’s sense of control is 
often attributed to an efficacy building middle and upper-class workplace (Kohn 
. 1963). Not surprisingly, “along with the various undesirable consequences of lower 
income, poorer education, poorer work conditions and more uncertain employment 
status... [an] absence of control is one of the ‘hidden injuries’ of social class.” (Gecas 
1989:304) Status is important to perceptions of control because of its relation to the 
opportunities for action afforded to individuals (Mirowsky and Ross 1990). 
From this perspective, the feeling of anger is usually triggered by some form 
of frustration or inability to control the external physical or social environment 
(Agnew 1997; Kemper 1978). If this account is correct, we should expect that: 


Hypothesis 2a: As occupational status increases, the 
concomitant sense of control increases. 


Hypothesis 2b: Sense of control should partially explain 
any negative association between the experience of anger 
and occupational status. 


Thus, perceptions of control should stand as a reasonable intervening factor which 
may help explain a negative association between occupational status and anger. 


Adherence to Emotion Rules 


A second plausible mechanism in the negative relationship between status and 
anger is that high-status individuals are more likely to abide by emotion rules that 
encourage anger suppression. According to Stearns and Stearns (1986) these types 
of anger-reducing emotion rules became institutionalized in the upper-middle 
class following the 19th century separation of home from the workplace. From 
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this point of view, internalization of cultural codes that restrict negative emo- 
tion leads high-status individuals to engage in “deep” management (Hochschild 
1979, 1983) of their emotional reactions in potentially anger-producing settings, 
ultimately decreasing the likelihood of experiencing anger. If this is correct, we 
should expect that: 


Hypothesis 3a: Persons of high occupational status should 
be more likely than persons of low occupational status to 
report agreement with an emotion rule that encourages the 
suppression of negative emotion 


Furthermore, we should find that: 


Hypothesis 3b: Holding constant allegiance to specific 
rules that regulate the expression of anger should partially 
explain any negative association between the experience of 
anger and occupational status. 


Ecology of Encounters 


While the higher probability of persons of low occupational status to experience 
negative emotion are tied to chronic and durable features of low-status positions 
such as lower perceptions of control and lower adherence to emotion rules govern- 
ing the suppression of negative emotion, the higher likelihood of those of high 
occupational status to experience negative emotions is tied to more situational 
features of their interactional ecology. Data limitations preclude us from directly 
testing all of the empirical implications of this second-order effect. Most impor- 
tantly, we cannot directly demonstrate that the ecology of encounters of high- 
status individuals is biased in the way that we propose. However, with the data at 
hand, we can use suggestive indirect evidence to explore the salience, recency and 
target of the anger-producing events for high-status individuals to hone in on the 
potential of the “ecology of encounters” argument. 


Data and Variables 


We test our hypotheses using data from the 1996 General Social Survey (N = 
1460). The GSS is administered biannually by the National Opinion Research 
Center to a nationally representative sample of non-institutionalized, English- 
speaking, American adults. The 1996 wave included an emotions module with 
questions regarding the respondents’ recent experience of several emotions.’ A 
portion of the emotions module was dedicated to asking respondents about 
anger, focusing in particular on the target of the feelings of anger and whom 
they held responsible for the situation that occasioned the anger (self or other 
person). Respondents were also asked about various ways to cope with the anger 
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(cognitively, behaviorally, etc.) and their attitudes toward emdtional expression 
and the public management of emotion. 


Dependent Variable: The Experience of Anger 


The experience of anger is measured with a series of questions that asked respon- 
dents to report “On how many days in the past 7 days have you... (felt a certain 
emotion)?” We constructed a single “number of angry days in the past week” vari- 
able by taking the maximum response from either of two items asking respondents 
whether they had “felt angry at something or someone,” and had “felt mad at 
something or someone.” The variable has a mean of 1.85 and a standard devia- 
tion of 2.00, and ranges from a minimum of zero days in the past to a possible 
maximum of seven days. The distribution of respondents across this variable is 
skewed, following a slightly over-dispersed Poisson distribution, with 57 percent 
of individuals in the 0 to 1 range, and 73 percent in the 0 to 2 interval (thus the 
median of about one day per week having experienced anger is much more indica- 
tive of the emotional history of the modal individual than the mean). 


Independent Variable: Occupational Status 
As Hauser and Warren (1997), Hauser (1998) and Miech et al. (2003) point out, 


occupational status is an important component of overall socioeconomic status. 
Occupational status thus serves as a shorthand summary for those characteristics 
of a person's social position that determines his or her capacity to create and 
access valued material and social resources. Occupational status is also highly cor- 
related with widely recognized symbolic indicators of social standing (Hope 1982). 
Occupational status thus serves to link an individual’s position within markets 
for material resources with membership in exclusive groups of acquaintance and 
sociability (Bourdieu 1986; Weber 1994). Empirical research tends to support 
the notion of occupational status as a distinct component of socioeconomic sta- 
tus, which contributes predictive power net of income and education in models 
predicting a host of important outcomes, including those associated with health 
(Dahl 1994) and social connectivity (Lin 2001). 

Hauser and Warren (1997) and Hauser (1998) show that whether we use one 
or the other of the two most commonly specified components of occupational sta- 
tus — occupational education or occupational earnings — leads to very different sub- 
stantive conclusions in everyday practice. They suggest that the use of a weighted 
measure incorporating both education and economic indicators is “scientifically 
obsolete” and note that future research interested in using occupational status as 
a predictor of a given outcome should not combine them into an omnibus SES 
measure. Separating these two indicators of occupational status when looking at 
their effects on a given outcome is therefore the more appropriate analytic strategy 
because they tend to yield different information about the underlying cultural and 
material resources of interest (Miech et al. 2003). 
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The bulk of evidence suggests that occupational education is a more valid (in 
terms of criterion validity) indicator of occupational standing when it comes to 
most outcomes of interest to sociologists, including intergenerational mobility 
(Hauser and Warren 1997) and health outcomes (Miech and Hauser 2001). As 
Hauser (1998:8, italics added) concludes, “one can plausibly regard occupational 
education as the central dimension of intergenerational occupational stratification 
and specify occupational prestige and occupational income each as weak indica- 
tors of that construct.” Consistent with this line of research, we should expect 
occupational education to be a more consistent predictor of emotional experience 
than occupational earnings. 

We follow Hauser and Warren (1997) and Miech et al. (2003) in using two 
separate scalar indicators of occupational rank. As already noted, one scale ranks 
occupations according to empirically observed levels of education of individuals 
holding that title, and the other ranks occupations according to typical earnings 
in that occupation. More specifically, occupational education is the percentage of 
an occupation’s incumbents in the total labor force who had one or more years of 
college education as reported in the 1990 U.S. Census, and occupational earnings 
is the percentage of incumbents in the total labor force who earned $14.30 or 
more per hour in 1989 (Hauser and Warren 1997).’ Occupational earnings and 
occupational education were transformed into started logits (Hauser and Warren 
1997). Given pas the percentage of respondents above a threshold level, the started 
logit transformation = /n[(p + L)/(100 — p + 1)).* 


Intervening Mechanisms 


Adherence to Emotion Rules 
We use one item to measure allegiance to emotion rules regarding anger: “When 
I’m angry I let people know.” Respondents were asked to choose from a five-point 
Likert scale that ranged from “strongly agree” (1) to “strongly disagree” (5) with 
“neither agree nor disagree” as the midpoint (3). These types of statements are 
consonant with Hochschild’s (1990:122) conceptualization of emotion rules that 
“ ..[refer] specifically to the display or masking of feeling.” Because higher scores 
indicate that respondents are less likely to report expressing anger to others, higher 
levels of this ordinal variable indicate increasing levels of allegiance to the “anger- 
suppression” emotion rule characteristic of middle-class Americans (Stearns and 
Stearns 1985). We should thus expect this variable to have a negative effect on the 
frequency of anger experience. 


Sense of Control 
The sense of control variable is the estimated regression score for the first factor 


(Eigenvalue = 2.32, proportion of variance accounted for = 58%) extracted from 
a principal components factor analysis of the polychoric correlation matrix of the 
four variables that comprise Mirowsky and Ross's (1991) sense of control scale 
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(Cronbach’s w = .68): (1. “Most of my problems are due to bad breaks,” (2. “The 
really good things that happen to me are mostly luck,” (3. “I have little control 
over the bad things that happen to me,” and (4. “There’s no sense planning a lot 
— if something good is going to happen, it will.”> Higher values on the predicted 
factor indicate higher levels of perceived control over life events. 


Control Variables 


Gender, which has been shown to influence the frequency, intensity and experi- 
ence of anger, with women scoring higher on all (Haukkala 2002; Simon and 
Nath 2004; Thomas 2002), is coded (1) if the respondent is a woman and (0) for 
men. Race, which has been linked to feelings of anger with the external attribu- 
tion of prejudice against members of minority groups (Wong 1996), is a three- 
category variable designating whites, blacks and “other.” Dummy-variable coding 
uses “white” as the reference category. Previous research (Birditt and Fingerman 
2003; Schieman 2003) found that adolescents and young adults are more likely 
to experience and express more anger while older respondents are less likely to 
experience potent emotions such as anger (Lively and Heise 2004), so we also 
controlled for age and the square of age in years. 


Results 


Poisson vs. Negative Binomial Model 

Because the dependent variable is a count of the number of days having felt angry or 
mad in the past week, we use models appropriate for this type of limited dependent 
variable (Long 1997). Preliminary analyses showed that the Poisson Model was 
not appropriate for these data, due to the presence of over-dispersion. A likelihood 
ratio test of the assumption of conditional mean-variance equality rejected the null- 
hypothesis of equidispersion (x? = 430.33, p < .01) for a model predicting frequency 
of anger experience using occupational educational education, race, gender and age. 
We followed standard practice in this case by fitting a Negative Binomial Regression 
Model instead of a Poisson Model. The NBR Model accounts for over-dispersion 
through the incorporation of an extra gamma-distributed parameter which allows 
the expected variance of the dependent variable to be different from its expected 
mean (relaxing the Poisson assumption of equi-dispersion). 


Effect of Occupational Education vs. Occupational Earnings 

Hypothesis 1 suggests that the association between status and the experience of an- 
ger is not linear but parabolic (Stolzenberg 1980). Models 1 and 2 in Table 1 show 
the regression results for the model corresponding to the parabolic specification 
of the effects of occupational education and occupational earnings. As shown in 
Model 1, the results are clearly consistent with the composite model (Hypothesis 
1) when using occupational education as our indicator of occupational status. The 
effect of occupational status on the frequency of anger experience is non-linear 
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with the maximum-likelihood estimate for the linear effect of status being less 
than zero B, = -.078, p < .05) and the corresponding estimate for the quadratic 
effect being more than zero (6, = .026, p < .05). This pattern of results suggests 
that the expected distribution of the frequency of anger experience across the 
linear occupational status dimension should be a parabola with the “U” shape 
(Stolzenberg 1980). Had we limited ourselves to a linear specification of the occu- 
pational education effect, we would have erroneously concluded that there are no 
differences among individuals in different socioeconomic strata in the experience 
of anger (p = .20 for the status coefficient in the linear specification). 

Model 2 shows the results obtained using the same parabolic specification of the 
occupational status effect as in Model 1, but this time using occupational earnings 
as our indicator of occupational status. The results show that this variable has no 
statistically discernable impact on the experience of anger, although the coefficients 
are in the expected direction (8, = -.026, p < .57; B, = -.019, p < .50). These diver- 
gent results are consistent with previous research that shows occupational education 
to be a more valid indicator of occupational status than is occupational earnings 
(Hauser and Warren 1997; Hauser 1998). This adds confidence to our claim that it 
is something associated with overall social status (rather than income or possession 
of material resources) that generates the observed correlation between social position 
and emotional experience. In the rest of the analyses we therefore limit ourselves 
to the relationship between occupational education and emotion-related outcomes. 

To provide a more concrete picture of the magnitude of the estimated effects 
of occupational education on the frequency of angry feelings, as well as a tangible 
representation of the substantive meaning of the parabolic effect described above, 
Figure 1 shows a line plot of the expected number of days in a typical week an 
individual (white male, at the mean age of 45 years, and of average status) expe- 
rienced feelings of anger. While respondents at the extreme ends of occupational 
education, either of high status (e.g., physician) or low status (e.g., janitors), are 
expected to experience anger about two and half days per week on average, for 
respondents toward the middle of the distribution (such as teacher's aides) this 
number drops to about 1.8. As shown by the simulated confidence intervals 
around the coefficient estimates (King, Tomz and Wittenberg 2000), high- and 
low-status respondents are statistically more likely to experience anger compared 
to middle-status respondents, but are indistinguishable from one another. ‘This 
expected weekly difference becomes substantial when we project it toward longer 
stretches in time: in a given year, respondents at the top and the bottom of the 
status ladder are expected to experience anger 36 more days on average than 


middle-status respondents. 


Occupational Status, Emotion Rules and Perceived Sense of Control 


To explore the relationship between occupational status and (1. adherence to emo- 
tion rules that regulate the experience of anger and (2. sense of control as possible 
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mechanisms behind the (negative) association between occupational status and 
the experience of anger, we first specified an ordered logit regression equation 
with the ordinal variable indexing adherence to an emotion rule that restricts the 
public display of anger to others (Model 3). We then specified an OLS regression 
equation with the sense of control principal factor score as the dependent variable 
(Model 4). All of the models include occupational education as the main predictor 
and age, gender, and race as controls. 

As shown in Model 3 of Table 1, we find that, consistent with Hochschild 
(1979) and work in the history of emotions and social class in the United States 
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(Stearns and Stearns 1985), occupational status is indeed associated with an emo- 
tion rule that restricts the display of anger. As shown by the positive coefficient 
corresponding to occupational education (increasing values on this variable indi- 
cate higher levels of disagreement), persons of high occupational status are more 
likely to disagree with the statement “when I’m angry I let people know,” suggest- 


_ ing that they engage more often in conscious (and possibly habitualized) attempts 


to restrict their display of feelings of anger towards others. In addition, as shown 


_ in Model 4 of Table 1, and consistent with previous research (Mirowsky and Ross 
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Figure 1. Expected Number of Days Having Experienced Anger by 
Occupational Status 


Number Of Days Having Felt Angry In The Past Week 
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126.93, p < .001 for the hypothesis test that the coefficient corresponding to the 
effect of status is different from zero). As occupational status increases, a person's 
perception of control over every day events also increases. 


Emotion Rules, Sense of Control and the Frequency of Anger Experience 


In accord with Hypothesis 2a, which suggests a connection between perceived 
lack of control over life events and the experience of negative emotion (Gecas 
1989), we find that persons who believe they possess high levels of control over 
their environment are significantly less likely to experience anger. This is shown by 
the fact that in Model 5, the subjective sense of control is a strong and significant 
predictor of anger frequency (t= 4.14). Individuals scoring at the top of the sense 
of control are expected (holding all other variables at their means) to experience 
anger only 1.5 times per week, while those with the lowest scores are more likely 
to experience anger (about 2.5 times per week). 
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Adherence to an emotion rule regarding the suppression of anger has an inde- 
pendent (net of age, status, gender and race) effect on the experience of anger (p 
< .01), providing support for Hypothesis 3a. Those respondents who report being 
less likely to let other persons know about their feelings of anger are also less likely 
to report experiencing anger frequently. This is consistent with theories that point 
to the link between emotion expression and emotional experience — where at- 
tempts at managing and suppressing emotion feed back into a person's likelihood 
of experiencing that emotion — as formulated in Hochschild’s (1975, 1979) and 
Shott’s (1979) early work. 

More importantly, the magnitude of the association between occupational sta- 
tus and anger is affected by the inclusion in the equation of indicators of sense of 
control and adherence to an anger suppression rule. The estimate for linear effect of 
occupational education is now reduced by 50 percent in comparison to the origi- 
nal estimate in Model 1. In addition the linear negative component of the status/ 
anger experience association is no longer statistically significant from zero (p = .15). 
Additional analyses which included both the anger-expression rule indicator and the 
sense of control scale showed that the latter does a much better job of accounting 
for the negative association between occupational status and anger (Gecas 1989; 
Mirowsky and Ross 1990), providing stronger support for Hypothesis 2b (sense 
of control as an important mediating factor) than for 3b (class-based adherence to 
anger-suppression rules as an important mediating factor). 


The Anger-Producing Encounters of High Status Individuals 


Our integration of the compatibility and emotion-domination accounts suggests 

that while the higher probability of persons with low occupational status to expe- 
rience negative emotion may be tied to trans-situational and durable features of 
low-status positions, the higher likelihood of those with high occupational status 

to experience negative emotions is tied to more situational characteristics of their 
interactional ecology. In particular we argue that high-status individuals will be more 

likely to experience anger: (1. in direct person-to-person interaction (Collins 1990), 
(2. when those encounters feature relatively unfamiliar, identity-disconfirming oth- 
ers (Stets 2004; Stets and Harrod 2004), and (3. when those interactions are status- 
dissimilar, with alter being of Jower status than self (Ridgeway and Johnson 1990). 
We argue that the ecology of encounters of those of high status will tend to become 

more biased with increasing status because of their very position as numerical mi- 
norities in the social structure (Blau 1977; Ridgeway and Balkwell 1997). 

Data limitations preclude us from testing all of the empirical implications of 
this model. Most importantly, we cannot directly demonstrate that the ecology 
of encounters of high-status individuals is biased in the way that we propose, as 
this would require direct “experience sampling” of the features of all interpersonal 
encounters of a representative sample of the population (Osborn and Stets 2007). 
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a specific anger-producing event in the past week. The results show that net of 
socio-demographic controls, persons of high-occupational status are more likely to 
remember such an event (¢ = 3.39).° It is important to note that in contrast to the 
relationship between occupational status and experience, the relationship between 
occupational status and the odds of recalling an anger-producing interaction is 
linear and not curvilinear. Results of a model including a logistic regression model 
predicting recall of an anger-producing event that includes a quadratic term for 
status explicitly suggest a rejection of the curvilinearity hypothesis. 


Occupational Status and the Recency of Anger-Producing Events 

If high-status individuals are structurally “biased” toward experiencing relatively 
frequent anger-producing encounters, then we should find that, among those who 
report remembering an event, high-status individuals should be more likely to report 
remembering a more recent event. Figure 2, shows evidence consistent with this 
claim. The predicted probabilities in the figure come from a multinomial logit equa- 
tion with a self-reported estimate of how recent the anger-producing event recalled 
was as the dependent variable and occupational education (with the effect specified 
to be non-linear), age, race and gender as the predictors. The figure shows that 
indeed, as occupational education increases, the chances of remembering an event 
that happened very recently increase, while the chances of remembering a more 
distant event decline. This suggests high-status individuals are more likely than low- 
status individuals to be exposed to routine, recurrent anger-producing interpersonal 
interactions which remain salient in memory to a larger extent. 


Occupational Status and the Micro-Ecology of Anger-Producing Encounters 

We also anticipate that high-status individuals should be more likely to report 
directing their anger toward specific persons rather than toward unspecified crowds, 
events, circumstances and other impersonal objects. Furthermore we should find 
that within the category of persons, the anger-producing interactions of high- 
status persons should feature (relatively) unfamiliar others. Finally we should find 
that the targets of anger of high-status persons should be lower-status others not 
similar high-status alters. Models 2-8 in Table 2 speak to these questions. 

As Model 2 of Table 2 shows, among those individuals who reported recalling 
an anger-producing event (NV = 1,091), as occupational education increases the 
odds of reporting having been angry at a person rather than at circumstances or 
other impersonal objects increase (¢ = 2.58). In addition, as shown by the coef- 
ficient estimate for occupational status in Model 3, we find that, among those 
respondents who recalled an event and who reported that the target of their anger 
was a person, high occupational status persons are much less likely to be angry at 
family members (¢ = -3.04). Model 4 shows that persons across the occupational 
status continuum are equally likely to have directed their anger towards friends, 
neighbors or acquaintances (¢ = -.74). This is in contrast to the result shown in 
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Figure 2. Expected Probability of Having Anger-Producing Event Recall by 
Occupational Status 


Predicted Probability 
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Model 5. ‘The positive and statistically significant coefficient for occupational 
status in this model suggests that, high occupational status persons are much more 
likely to have encountered the target of their anger at work (¢ = 2.35) not at home. 
This suggests a natural ordering of the likelihood that a person will be featured 
in an anger-producing interaction for high-status — and conversely for low-status 
— persons, with family being disproportionately least likely, friends being equally 
likely and persons encountered in the instrumental arenas or work and other 
public settings being disproportionately more likely. This also means that when 
lower-status individuals experience anger, they are disproportionately more likely 
to direct that anger toward familiar others (spouses, children and kin), who form 
a more substantial part of their immediate social network (Marsden 1987). 
What are the characteristics of those alters towards whom high-status persons 
direct their anger? As models 6-8 show, we find that the relative status of the target 
(Johnson, Ford and Kaufman 2000; Ridgeway and Johnson 1990) in relation to 
the focal person affects the likelihood that a given encounter will result in an expe- 


4 


Occupational Status and the Experience of Anger @ 2097 


rience of anger. As shown in Model 6, among those individuals who report having 
recalled an anger-producing interaction at work (NV = 369) high occupational 
status persons are statistically no more likely to have experienced anger than low- 
status persons in an interpersonal interaction involving a superior (e.g., boss or su- 
pervisor), although the coefficient is in the negative direction. However as shown 
in Model 7 they are substantially more likely to report a relative status inferior at 
work as the target of their anger, suggesting that these status-dissimilar encounters 
are more likely produce anger among high-status persons (Lively 2000). Following 
a similar dynamic, high-status persons are also statistically more likely to report 
recalling an anger-producing event involving a government worker or someone 
who provided a service. These are the types of encounters classically documented 
by Hochschild (1983), and which can be interpreted here as involving the local 
interactional ingredients (relative lack of familiarity, status differentials) likely to 
generate anger among high-status individuals. 

Taken together, these results suggest that when encounters produce the experience 
of anger for high-status persons, (1. these tend to feature relatively unfamiliar others 
outside of intimate sites of sociability such as the workplace and other public settings, 
and (2. those others tend to be of lower (not of equal or higher) status than self. 


Discussion and Conclusion 
Summary of the Results 


The finding, that those at extreme ends of the occupational status hierarchy are 
more likely to experience anger, points to the uneven distribution of anger in the 
status structure (Collins 1990). Our analysis of the possible intervening mecha- 
nisms between occupational status and anger experience show that when the two 
primary trans-situational factors responsible for the negative effect of status on the 
experience of anger are held constant, namely, the sense of control over life events 
and allegiance to specific emotion rules, the relationship between occupational 
status and anger experience is no longer described by a “U-shaped” curve, but as 
monotonically and exponentially increasing with status. 

We argue that the positive association between status and anger experience 
that remains (indicated by the quadratic component of the occupational educa- 
tion effect) is at least partially the result of factors associated with the features of 
the micro-ecology of encounters as these are shaped and biased by the structural 
position of high-status persons. In examining data from the last anger-producing 
encounter recalled by a representative sample of individuals, we find that (1. high- 
status individuals are more likely to have anger-producing encounters outside 
of the private realm, among unfamiliar others (Stets 2004), and (2. anger is sys- 
tematically directed downwards in the status ladder as high-status individuals are 
more likely to have felt anger against an alter of lower status than against a higher 
status target (Johnson, Ford and Kaufman 2000; Ridgeway and Johnson 1990). 
Our results in this last regard are only suggestive, but they are consistent with 
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previous research and theory (Collins 1990; Ridgeway and Johnson 1990). We 
believe these findings warrant further exploration. 

Our main finding of a U-shaped distribution of the relative frequency of anger- 
experience across the occupational status dimension are consistent with (and serve 
to partially reconcile) the two leading perspectives linking status and anger in 
structural social psychology and the sociology of emotions (the compatibility and 
domination accounts). Our integrative account however, also qualifies them and 
specifies their range of applicability. We suggest that anger is likely produced by 
different mechanisms for the two status groups, with the compatibility mecha- 
nisms being more plausible descriptors of the chronic anger-producing experiences 
among low-status persons, and the domination mechanisms being more accurate 
descriptors of the situational anger-producing experiences of high-status persons. 

We find that the relatively trans-situational factor of sense of control is — in 
addition to adherence to an emotion rule on anger suppression — an important 
intervening link between occupational status and anger for low-status individuals 
(Aneshensel 1992; Gordon 1990; Mirowsky and Ross 1990; Stearns and Stearns 
1985). Chronic stressors that are personally threatening, such as those lower-status 

' individuals experience, are more conducive to psychological distress than the “has- 
sles” (Thoits 1995) most likely experienced by those of higher status (Umberson, 
Williams and Anderson 2002). The higher likelihood of those in low-status positions 
to experience anger appears to be tied to the typical day-to-day experiences of those 
in low-status positions; in particular, the sense of not being in control of one’s life 
leads to frustration and anger (Berkowitz 1982). Given the connection between 
negative emotional experience and health-related outcomes (Aneshensel 1992), the 
occupational status/experience of anger linkage may be an important member of 
constellation of factors that serve to reproduce the relationship of “fundamental 
causality” between socio-economic status and health (Lutfey and Freese 2005). 

Further, reports by low-status individuals that the targets of their anger are 
most often family members is consistent with previous research. Individuals 
lacking personal control outside the home tend to attempt to regain control by 
exerting it within the home (Umberson, Williams and Anderson 2002), which 
some argue is related to higher incidences of domestic violence in lower-status 
households (Umberson, Anderson, Glick and Shapiro 1998). In other words, even 
if the threats to control come from outside intimate relationships, individuals 
experiencing this type of stress often express it toward family members, partially 
explaining the relationship between low socio-economic status, life strain and 
violent behavior (Aneshensel 1992; Umberson, Anderson, Glick and Shapiro 
1998; Umberson, Williams and Anderson 2002). 


Limitations of the Study and Future Directions 


Although we were unable to provide a direct test of the “ecology of encounters” 
argument with the General Social Survey emotions module, our results tenta- 
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tively suggest that high-status individuals’ pronounced rates of anger appear to 
be connected to specific episodes of interaction. Future research should address 
this head-on, and we believe that collecting the data required for such a venture 
is possible and desirable. One potential source might be an experience sampling 
method (Osborn and Stets 2007). It is our hope that our current research encour- 
ages others to no longer assume an either-or relationship between status and anger 
while taking into account the various sources of anger. 

In a social world where few cross-status interactions take place, the associa- 
tion between occupational status and anger might well look, as the compatibility 
perspective suggests, like anger is concentrated in the lower levels of the status 
structure. However, due to social structural arrangements (e.g., economically 
mediated transactions involving organizational representatives, interaction across 
levels of hierarchical differentiation in the bureaucratically organized workplace) 
that induce high-status individuals to interact with lower-status alters and the 
propensity for such interactions to result in anger, individuals of ultra-high status 
are induced to experience anger. 


Contributions and Broader Implications 


This article expands and contributes to theory and research in several areas. 
Following Collins (1975, 1990; Kemper and Collins 1990) and recent advances 
in the interactionist view of inequality and stratification (e.g., Schwalbe, Godwin, 
Holden, Schrock, Thompson and Wolkomir 2000) we view socio-economic status 
as leading to not only material or structural rewards (Featherman and Hauser 
1978), but also to a higher likelihood to exercise emotional domination over 
others (Collins 1990), especially lower-status alters encountered in the public 
sphere. However, we acknowledge that part of this dynamic is the product of the 
structural position of those in high-status positions, insofar as they are forced by 
the distribution of individuals to come into contact with individuals over which 
they can exercise interactional dominance through the expression of negative 
socio-emotional behavior. 

Conversely, our analyses support previous literature’s assertion (Gecas 1989) 
that chronic anger may be an important component of the “hidden injuries of 
class.” Those at the bottom of the power and prestige hierarchy are more likely to 
feel continuous anger for longer periods. Further, the feelings of anger of those 
at the bottom of the status hierarchy appear to be disconnected from specific 
situations and are more likely to be directed toward impersonal targets. The fact 
that they are more likely to take objects or circumstances, rather than persons, as 
a target may account for their endurance. In this way anger seems to play a role 
in the “chronically instantiated” reproduction of low-status positions in the social 
structure (Giddens 1984; Ridgeway and Smith-Lovin 1994), insofar as “objectless” 
anger may be less likely to find resolution or to lead to self-assertive action than 


situational anger in concrete settings. 
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Our guiding imagery throughout is largely consonant with that employed by 
Ridgeway and Smith-Lovin (1994) when they propose the link between micro- 
structure and macrostructure. Ridgeway and Smith-Lovin view microstructure as 
shaped and informed by larger cultural and structural factors, which the outcome 
of microinteraction goes on to reproduce, and in some instances modify. From 
this perspective, individuals bring into the interactional setting information, bi- 
ases and habits related to their position in the larger macro-setting. The dynamics 
associated with the face to face encounter, as well as its organization in time and 
space (Giddens 1984), are indelibly affected by these resources. 


Notes 


1. While our concern here is experience, the subsequent expression of anger by high- 
status individuals can then be seen as a strategy to sustain their position in the status 
structure and to enact the power of institutionalized roles. 


2. Respondents were queried about whether they had experienced a variety of primary 
(anger, sadness, happiness, etc.) and secondary (shame, embarrassment, outrage, 
pride, etc.) emotions in the past week. The specific item reads: “Now Im going to 
read a list of different feelings that people sometimes have. After each one, we would like 
you to tell me on how many days you have felt this way during the past seven days.” 


3. Out of the 1,451 respondents who participated in the 1996 emotions module, 56 
(4%) did not report an occupation. Among these 56 respondents with missing data 
on occupation 35 (54.7%) were classified (according to the work status variable 
WRKSTAT) as “keeping house,” 15 (23.4%) as “in school,” 3 as “working, fulltime,” 
2 (3.1%) as “unemployed,” and 1 (1.5%) as “other.” We included these individuals 
in the analysis by assigning them a value of occupation using the following rule: if 
the individual was married, we assigned them the value of their spouse’s occupation. 
If the individual was not married, we assigned the value of their father’s occupation, 
and if that value was not available that of their mother’s occupation. Excluding these 
individuals from the analysis changes none of the substantive conclusions. 


4. Note that these scores are defined at the detailed three-digit 1980 census classification 
level (OCC80). For details concerning the design, construction and external 
validation of these scores, see Hauser and Warren (1997) 


5. Respondents who were missing in one of the items were assigned the value of the 
average of the rest of the non-missing item in the final scale. Respondents who were 
missing on all items were coded as missing. 


6. This result is also consistent with the identity-control theory proposal that such anger- 


producing encounters are particularly salient for high-status individuals because they 
are identity disconfirming. 
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Political Capital in a Market Economy 


Victor Nee, Cornell University 
Sonja Opper, Lund University 


This research applies a transaction-focused institutional analysis to compare the value 
of political capital in different institutional domains of China’s market economy. Our 
results show that the value of political capital is associated with institutional domains 
of the economy in which agents can use political connections to secure advantages. 
Political capital is most fungible in institutional domains where government restricts 
economic activity. In this sense, the value of political connections in China does not 
differ fundamentally from patterns observable in established market economies. We 
interpret this as evidence suggesting China may have experienced a tipping point in 
its transition to a market economy around the turn of the new century. 





Introduction 


That political connections grease the wheels in all economic orders is a near uni- 
versal supposition. In non-market economies, production and distribution turn 
on the power of principals at the center of a hierarchy. As cadres of a redistributive 
state, political actors directly set the prices; hence, agents seeking comparative 
advantage compete for positional power in the hierarchy and cultivate political 
connections (Szelényi 1983). In market economies, political capital also matters. 
Political actors impose restrictions on economic activities, giving rise to rents 
for which agents compete. Whether to control entry of new competitors or to 
promote strategic interests, desire to capture or channel the coercive power of the 
state motivates the demand for regulation favorable to an industry or firm (Stigler 
1971). Given the fungibility of political capital in all types of economies, it is not 
surprising that its value in transition economies has motivated cross-disciplinary 
research in the social sciences. 

Recent empirical research on transition economies indicates that at the firm- 
level there are no simple answers about the overall economic value of political 
capital. On the one hand, there is evidence indicating that state-owned firms 
with connections to the political elite perform worse than private firms (Fan et al. 
2007). Firms with partial state ownership are forced to maintain higher employ- 
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ment levels and often serve as tools for private enrichment of politicians (Shleifer 
and Vishny 1994). Moreover, political screening is a persistent feature of party 
recruitment for the managerial elite (Bian et al. 2001). On the other hand, there 
is some indication that political capital may still have a positive impact on firm eq- 
uity value and measures of firm performance (Peng and Luo 2000). This suggests 
that politically connected firms can secure positional advantages over unconnected 
firms, even if overall firm productivity might be lower. Such advantages may be 
most pronounced under tight resource constraints. 

To identify whether economic actors and firms actually secure advantage 
through reliance on political capital, our analysis examines in what institutional 
domains political connections might be substantively helpful to the firm’s chances 
for survival and profitability. This approach also allows us to identify what features 
of the institutional environment contribute to either preserving or reducing the 
value of political capital. We define institutional domains as distinct institutional 
arrangements that enable and guide economic transactions. These include eco- 
nomic markets ranging from competitive markets to state-controlled markets, and 
political markets such as public service provision and regulatory supervision of 
markets. Broadly, our use of institutional domain is similar to Scott and Meyer's 
(1991) conception of “societal sectors” in their analysis of the variability of institu- 
tional elements that shape the environment in which organizational actors interact. 

‘Two distinct entrepreneurial types coexist, and both seem to prosper in their 
own ways: There are purely competitive players — often active in privately organized 
markets and knowledge-intensive sectors — who refuse any type of political network- 
ing. The general manager of a computer company in Zhejiang province rejected the 
idea of playing the “game of politics,” and explained, “In my sector, the government 
cannot give me much, not much tax breaks, and not much government contracts.” 
In contrast, there are many who invest heavily in political networking through 
party donations, active involvement as party secretaries or vice deputies, invitations 
to government officials to serve on so-called “expert committees” and maintaining 
close social ties with local government authorities. These entrepreneurs often regard 
their political connections as their most important asset. 

Evidently, assessments of the pervasiveness and sources of political capital 
advantages need to go beyond aggregate claims of whether political capital is 
diminished or augmented in departures from state socialism. We focus on distinct 
transaction outcomes across a broad spectrum of company activities to compare 
the value of political capital in different institutional domains of China’s market 
economy. A transaction is defined as any action that involves an exchange of an en- 
tity — such as a commodity, reward, sentiment, opinion or information — from one 
actor to another (Homans 1974; Greif 2006). A transaction-focused institutional 
analysis shifts attention to examine the nature of the institutional arrangements 
in which economic actors compete and cooperate to secure rewards. An advantage 
of a focus on distinct transactions is that it allows for a more nuanced view of the 
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fungibility of political capital. This also provides a fresh approach to assess the 
claim that in transition economies political connections continue to be highly 
fungible “not only despite, but because of marketization.” (Parish and Michelson 
1996:1045) Aggregate, society-wide claims about the value of political connec- 
tions would be rejected if the fungibility of political capital varied in the context 
of discrete institutional domains in which firms actually transact. We argue instead 
that in a market economy the fungibility of political capital corresponds with the 
extent to which government controls economic activity. 


What is Political Capital? 


Political capital shares with other forms of capital the quality of being produc- 
tive insofar as it makes possible the realization of interests, which would not be 
achieved otherwise; and its fungibility is like physical capital, limited to specified 
conditions. Although it shares these attributes with physical and human capital, 
political capital is more similar to social capital in that these forms of capital ac- 
cumulate in relational ties. As Coleman (1988:98) observed, the differentiating 
feature of social capital is that it “inheres in the structure of relations between 
actors and among actors.” Unlike social capital, however, political capital has the 
additional feature of being linked to the positional power of the politician, and 
thus it is rooted in institutional structures of the political order. When politicians 
transact with other politicians, their cumulative investment in political capital 
embeds itself both in the institutional arrangement of political exchange and in 
ongoing relationships among political actors. For example, Shepsle and Weingast 
(1987) show that the custom of deference and reciprocity among politicians in 
Congress is enforced by the institutional mechanism of ex post veto by commit- 
tees as a way of explaining the effectiveness of committees. An anecdote they use 
illustrates the exchange, “Sure, I let those people over on Education and Labor 
do pretty much what they think is reasonable. And they do the same for us on 
Armed Services. That’s the way things are done around here.”(100) This custom 
of deference and reciprocity is maintained by institutionalized norms and rules 
that enable enforcement of subcommittee autonomy. Thus the quality of informal 
and formal institutions contributes to the productivity and fungibility of political 
connections (Evans 1995; Nee and Opper 2007). 

Another dimension of political capital’s existence in relationships has to do 
with the ties that connect politicians with their constituencies. ‘The common- 
sense rendition of this aspect of political capital emphasizes the public’s approval 
or disapproval of a politician’s performance. A politician gains political capital 
through successful policies. But the approval rating of a politician is ephemeral, 
whereas political capital is embedded in a more enduring manner in concrete 
relationships with constituents. In these relationships, obligations, expectations 
and trust stem from social exchange in which a politician does something for 
a constituent and trusts that he or she will reciprocate in the future. This, over 
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time, establishes an expectation. If a politician accumulates a large volume of 
credit via these political favors, then the analogy to economic capital is clear. 
Political capital in this view is both productive and fungible, and importantly, 
because it inheres in the relationship, both politician and constituent can se- 
cure gains by maintaining the relationship. Further, political connections serve 
as conduits of information, with information flow favoring rent-seekers with 
the inside track to the politician. The institution of lobbying operates on the 
fungibility and productiveness of political connections as the basis of mutual 
expectations, obligations, trust and timely information. 

As a form of capital, the distinguishing feature of political capital is that it is 
embedded in social relationships and in political institutions. This duality, existing 
as an attribute of both positional power and relational ties, means that analysis 
of political capital must incorporate a focus on the institutional context in which 
transactions are conducted. To specify the productivity and fungibility of political 
capital, the focus of analysis needs to examine transactions between economic and 
political actors in concrete institutional domains of the economy. 


Of Markets and Political Connections 


Our approach draws on new institutionalism in economic sociology in its fo- 
cus on the effect of the institutional environment on firm behavior (Nee 2005). 
Organizations are responsive to the nature of the resource and power arrange- 
ments in the institutional environment (Pfeffer and Salancik 1978; DiMaggio and 
Powell 1983; Fligstein 2001). This suggests variability in the responses of firms 
to their institutional environment. For example, Meyer, Scott and Strang (1987) 
and Strang (1987) examine the effects of expanding federal and state involvement 
in education on the administrative complexity of local school districts. They show 
that the level of school district administrative complexity depends on the relative 
level and scope of federal, state and local funding. An analogous pattern of close 
coupling of administrative complexity and fiscal budgets of central, provincial and 
local governments exists in China (Whiting 2000). 

In command economies, the state assumed monopoly power over the alloca- 
tion of all resources. Productive assets from farmland to factories were owned and 
managed by the state, which set prices by administrative fiat to control the al- 
location of resources. Clearly, under the central plan government bureaucrats and 
party officials maintained an overwhelming advantage in power over economic ac- 
tors. A firm's access to resources and its bargaining over production quotas mainly 
depended on positional advantage stemming from political connections with 
planning authorities. In decentralized market transactions, terms of exchange are 
determined by both sellers and buyers. The gradual replacement of state planning 
by market allocation gives rise to a self-reinforcing empowerment of the consumer 
as an economic actor. This endogenously motivates strategic adjustments by firms 
to the emergent market economy, which in turn undermine the previous institu- 
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tional foundations of firm survival. Whether firms choose to rely on organizational 
improvements, superior exploitation of assets, higher innovativeness or minimized 

“time to market”, markets shift the focus to the importance of capabilities, as op- 
posed to investing in positional advantage through political capital. 

The greater importance that firm managers put on the development of firm ca- 
pabilities is evidenced by rapid strategic changes in China’s economy. Widespread 
experimentation with new organizational forms, gradual divestiture of state own- 
ership and the emergence of new property arrangements illustrate the search for 
a better fit between firm strategy and external environment (Nee 1992). Before 
the start of economic reforms in 1978, China’s industrial sector was made up of 
only two organizational forms (state-owned enterprises and collective enterprises). 
Now, China's national statistics differentiate 10 distinct organizational forms 
in the urban industrial sector (National Bureau of Statistics of China). Rapidly 
expanding research budgets, employee training, patenting activities and shortened 
product life cycles signal that firms actively seek to develop new capabilities to gain 
and secure competitive advantages in market niches. China's national research and 
development expenditures, for example, increased from .8 percent to 1.3 percent 
of gross domestic product between 1999 and 2003. Further, more than 60 percent 
of R&D funds are provided by firms themselves (National Bureau of Statistics/ 
Ministry of Science and Technology 2005). 

Concurrent with these trends, organizational autonomy embedded in decen- 
tralized markets enables economic actors to construct informal arrangements 
that build from the ground up the informal institutions of a private enterprise 
economy. From informal lending arrangements that provide private capital for 
start-up firms to far-flung supply and distribution networks, informal economic 
institutions are constructed to facilitate the expansion of private-sector entre- 
preneurial activities challenging the state-controlled economy from below. With 
the continuing expansion of markets, the economic success of firms becomes 
increasingly independent of the direct involvement of politicians. Vertical ties 
linking economic actors in firms with the state decline in significance as horizon- 
tal ties — interfirm networks and network ties between buyers and sellers based 
on repeat exchange — gain in importance. In other words, the embedded nature 
of economic transaction shifts from self-enforcing reliance on vertical political 
connections characteristic of state socialism (Whyte and Parish 1984) to self- 
reinforcing investment in horizontal network ties that sociologists emphasize as 
the basis of social capital in market economies (Uzzi 1996). 

Our core hypothesis is: Zhe more competitive markets replace state allocation of 
scarce resources and services, the less the value of political capital. 

Political capital, as a fungible form of capital, has greatest value in those insti- 
tutional domains where government restricts economic activity. The stronger the 
government's commitment to introduce competitive markets in an industrial and 
commercial sector, the more the value of political capital will decline in that sector. 
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During departures from state socialism, the declining value of political capital 
is triggered by shifts in parameters that combine and interact to cause change in 
the firm’s institutional environment. On the one hand, subsidies for loss-making 
state-owned enterprises, for example, decreased from 25 percent of government 
expenditures in 1985 to 2 percent in 2000 (National Bureau of Statistics of China 
2001). On the other hand, by 2005 private firms produced more than 50 percent of 
China’s GDP, and accounted for more than 75 percent of exports (National Bureau 
of Statistics of China 2006). Our hypothesis expects that political capital will be less 
fungible in transactions that are based on market competition. The shift from state 
to market allocation did not occur at the same speed and to the same extent across 
the spectrum of economic activities and sectors, however. The value of a firm's politi- 
cal capital may therefore vary across distinct types of institutional domains and may 
continue to play a decisive role in distinct transactions. In partly liberalized markets 
where the state tightly controls market entry or remains involved as a market actor, 
firms with political capital are likely to secure preferential treatment. 

Parameter shifts in the institutional environment, however, do not rule out 

. that political actors also adapt and profit from new elite opportunities stem- 
ming from marketization. In political markets, political actors can still provide - 
positional advantages by channeling government spending to clients, using tax 
and industrial policies to regulate market entry and exit. As Walder (2003:900) 
asserts the “decline of bureaucratic allocation in the face of market reform does 
not imply a decline of elite opportunity.” He believed that the state’s still consid- 
erable redistributive and regulatory power allows the creation of new sources of 
power, “as regulators of markets and private enterprises, brokers and middlemen 
for market transactions, managers or consultants in public, market-oriented 
enterprise...”(Walder 1996:1063) Preferential treatment in rule enforcement 
and regulatory markets would then provide the political elite with vast oppor- 
tunities to create “new market value for official discretion.” (Walder 2003:901) 
This line of reasoning reinforces the view that politically connected managers 
are advantaged in operating companies (Réna-Tas 1994). Despite considerable 
empirical research on the effect of political capital on individual- and household- 
level earnings, results remained inconclusive (Nee 1989, 1996; Réna-Tas 1994; 
Bian and Logan 1996; Xie and Hannum 1996; Gerber 2006; Zhou 2000; Wu 
2002; Zang 2002; Walder and Nguyen 2008). Particularly initial conditions, ex- 
tent of markets, structural change and industrial growth seem to have influenced 
different effects on income (Keister 2009). 

To detect more directly the inherent value of positional advantages stemming 
from political connections requires a focus on the specific institutional domains 
where firms compete for survival and profits, and transact to secure resources, Our 
empirical tests examine transactions across a broad spectrum of firm activities 
in different institutional domains representing varying levels of marketization 
and state control. Analysis of distinct types of transactions allows us to discern 


Political Capital in a Market Economy © 2111 


variability in the fungibility of political capital across the different institutional 


domains in which firms engage in routine economic activities. 


Institutional Domains in China’s Market Economy 


Our analysis of political capital seeks to cover a representative set of distinct trans- 
actions and their outcomes in various institutional domains to examine whether 
the value of political capital varies with the extent to which resource allocation 
remains bounded by the redistributive economy. Our focus is on typical transac- 
tions in daily business operations, which can substantively affect a firm’s survival. 

The transition to a market economy is an uneven evolutionary process. Thus, 
we selected three institutional domains, which reflect high, moderate and low 
levels of marketization (see Figure 1). First, we include the product market as the 
most competitive institutional domain of China’s economy. Private enterprise 
participation in the product market is consistently high, with only a few state mo- 
nopolies remaining, such as that for tobacco. Our dataset includes none of these 
monopolies. A provincial marketization index developed by China’s National 
Economic Research Institute confirms that the product market is the most liberal- 
ized, with a mean level of 7.84 points out of 10 across all provinces in the year 
2003 (Fan and Wang 2003). In addition, we examine two partially liberalized 
market sectors sequentially according to increasing degrees of state controls in 
resource allocation: the credit market and the market for government contracts. 

In the credit market, liberalization of commercial banking has proceeded 
slowly. In spite of the market-entry of foreign-invested banks and private do- 
mestic banks, the credit market is still heavily state-controlled. At the time of 
the survey, more than 70 percent of deposits and loans were accounted for by 
the four largest state-owned commercial banks. Only 1 percent of loans by state 
commercial banks were allocated to private-sector entrepreneurs, signalling a 
severe imbalance given the high proportion of the industrial and commercial 
output by private enterprises (Datastream). The NERI-marketization index con- 
firms slow liberalization. With a mean value of 2.67 out of 10 in 2002, credit 
allocation occupied the lowest level of marketization of all 22 categories covered 
by the index. 

In the market for government contracts, political connections easily undermine 
competitive bidding guidelines. In this sense, the market for government contracts 
can be readily used to redistribute rents to most-favored bidders with political 
connections. This market thus exemplifies an institutional domain where the 
government acts as a monopolist. China’s firm managers are generally interested 
in securing government contracts because the government is a trustworthy client. 
Firms with regular government contracts reduce substantively their market risk. 
The market for land use rights is an alternative example of this type of institutional 
domain. Both markets display similar features wherein government bureaus act as 
monopolist and enjoy undivided control rights over resources. 
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Figure 1. Institutional Domains in the Economic Market 
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In spite of considerable regional variation, the relative ranking of marketization for 
these three institutional domains is invariant across China. Hence, product markets 
are in general the most marketized regardless of whether we explore product markets 

. in Ningxia or in Zhejiang provinces. Similarly, markets for government contracts are 
in general ranked lowest in terms of marketization. If marketization and the value of . 
political capital are negatively correlated as specified in our core hypothesis, we would 
expect to find little or no effect of political capital in the highly marketized product 
market, while political capital can be expected to still provide positional advantages 
in institutional domains of the economy where the state remains either the dominant 
supplier (credit market) or customer (government contracts). 

Naturally, economic success of firms does not only depend on markets for private 
goods. Equally important, regulatory markets can critically affect firm performance 
and survival chances. Our comparative institutional analysis therefore also incorpo- 
rates institutional arrangements for rule enforcement and regulatory markets. The 
expectation is that network advantages and political ties are more likely to secure 
preferential treatment (Réna-Tas 1994; Parish and Michelson 1996). 

First we look at whether firms with political capital receive better government 
services, and benefit from improved rule enforcement (see Figure 2). Specifically, 
we explore how transacting with political capital affects the days needed for 
custom clearance, fines from regulatory inspection and the bribery by local tax 
inspectors. All these transactions contribute to calculability and facilitate strate- 
gic planning. Preferential treatment in these areas could substantively increase a 
firm’s survival chances. In the mid-1980s, for example, cursory reports indicate 
that local cadres and tax officers squeezed entrepreneurs to increase their office 
or personal income. Often managers cultivated political connections to protect 
their businesses from overtaxing and expropriation (Parris 1993). 

An even stronger test case is the market for government regulation. Similar 
to markets for private goods, the regulatory market and related opportunity for 
rent-seeking have a competitive element. Firms invest time, effort and financial 
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Figure 2. Institutional Domains in the Political Market 
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resources to compete for permits and licences. If, in addition, government officials 
enjoy some discretionary power in deciding on license allocations, the firm's efforts 
as well as its political connections may become decisive factors for success (Krueger 
1974). Typical cases of transactions in the regulatory market we review include the 
firms’ access to export licenses, tax exemptions and the cost of business licenses. 
If our tests indicate that firms do not compete on a level playing field and that 
political connections provide a competitive edge in firm-government transactions, 
the regulatory market mechanism is bound to be suspect. 


Research Design and Empirical Test 


Resource dependence shapes a firm’s interactions with its environment (Pfeffer 
and Salancik 1978). This suggests variability in a diverse sample of organizational 
forms and industrial sectors in firm-level responses. We use data from the 2003 
World Bank Investment Climate Survey, which provides an in-depth account of 
2,400 firms of different legal status — state-owned, collective, private and foreign. 
The survey is based on a random sample of cities and firms, stratified first on sub- 
sectors, which were selected to represent the most important (in terms of contribu- 
tion to national GDP) industries and service sectors, and then on location. Overall, 
18 middle-sized and large cities were chosen, broadly covering all regions of China, 
including highly marketized cities in the coastal areas as well as less competitive, 
lagging industrial cities in the northeast and northwestern regions. 

Part one of the survey covers factual information on company activities, politi- 
cal capital and government relations. Questionnaires were filled out in face-to-face 
interviews with senior managers of the respective establishments. Part two of the 
interview was conducted with the firm’s accountant, who provided quantitative 
information on costs, expenditures, and asset evaluations (for a thorough discus- 
sion of the survey, see Dollar et al. 2004). 

The overall ownership structure of the sample of firms resembles the general 
distribution of legal ownership forms (Table 1). Due to the urban focus of the 
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Table 1: Distribution of Legal Ownership Type of Firms, in 2002 . 


Total Firm 
Sample Firms % _ Population % 
State-owned enterprise 26 16 
Collective and cooperative firm 16 21 
Non-publicly traded shareholding companies 18 17 
Private, non-listed firms 28 27 
Joint ventures and subsidiaries of multinational firms 10 8 
Others* 2 11 





*A residual category which includes wholly foreign owned companies, limited liability 
companies, individually owned companies. 
Source: Investment Climate Survey 2003 and National Statistical Bureau of China. 


World Bank survey, however, the proportion of state-owned firms is significantly 
higher than in the total firm population, whereas the residual category of “others,” 
including small individual firms, is considerably smaller. The industrial structure 
provides a mix of labor-intensive, capital-intensive and knowledge-driven produc- 
tion, which approximates the country’s overall production structure. 

In this study, we focus our analysis on the non-state firms in the sample (n 
= 1764). By virtue of property rights, government authorities seek and maintain 
direct ties with state-owned enterprises. A focus on non-state firms provides a 
more reliable measure of the value of a firm’s acquired stock of political capital. 
The total number of observations varies across our estimation models due to 
varying occurrence of firm-level transactions and incomplete data on some items. 


Dependent Variables 

We test for positional advantage stemming from political connections in a range 
of institutional domains: economic markets, administrative service provision and 
regulatory markets. We measure outcomes through a set of three discrete transac- 
tional variables in each domain. Our aim was to define outcome variables, which 
are quantifiable, in order to contain the risk of a subjective perception bias. Table 
2 provides summary statistics of the selected variables. 


Feonomic Markets 


(1. Sales growth is a key transaction-focused measure assessing a firm’s ability to 
win market share and to survive in the respective product market. (2. Obtaining 
formal credit signals a firm’s access to secure external finance in the banking sec- 
tor. Given the standard assumption in lending research that firms generally need 
external finance, one can assume that any firm would have formal bank credit 
if credit could be secured (Uzzi 1999). (3. Getting government contracts reduces 
market risks by securing reliable and solvent customers. Moreover, government 
contracts typically allow for higher profit margins. With less than 20 percent of 
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Table 2: Summary Statistics for Dependent Variables 
Variable SD 


Observations Mean 
Private Goods 
Sales growth in 2002* 1,746 60.58 805.9 
Dee (dummy = 1 if firm has a formal bank 1,714 Zo 42 
oan 
Sales to government (dummy = 1 if firm secures 1,765 ae 42 


government contracts) 
Government Quality and Rule Enforcement 


Average days for custom clearance 698 7.70 11.54 

Inspection fines (total costs of fines/ in 1,000 RMB) 1,752 15.43. 250,71 

Local tax inspectors require gifts/bribes 1,765 27 444 
(dummy = 1, if yes) 

Regulatory Markets 

Firm holds an export license (dummy = 1 if yes) 1,685 23 420 

Firm enjoys tax exemptions (dummy = 1 if yes) 1,763 yO 434 


Price of getting a business registration (total of fee and 598 126.13  1,482.826 
bribes / in 1,000 RMB 


*We use sales growth in the most recent year covered in the survey (instead of using an average 
over all survey years) in order to be able to include absolute sales volume in preceding years as a 
control variable. 

Source: Investment Climate Survey 2003 


public projects going through a public bidding process in 1998, rents distributed 
through the market of government contracts are substantial (Yang 2004). 


Government Quality or Public Service Provision and Rule Enforcement 

(1. Days needed for custom clearance involves transactions enabling a firm to export 
goods in a timely and efficient manner. Delays can lead to loss of clients and 
weakens the firm’s competitiveness in international trade. (2. Inspection fines were 
in China a common instrument to discriminate against new private start-ups and 
other non-state firms (Parris 1993). The absolute amount of inspection fines there- 
fore indicates whether political capital is still helpful in dealing with inspections by 
local administrators. (3. Corruption is an endemic feature of China’s bureaucracy. 
Whether /ocal tax inspectors require gifis or bribes provides a measure of “hidden” 
transaction costs assessing the illegitimate and illegal use of political connections 
in standard administrative procedures. Political capital can lessen a firm’s reliance 
on bribery to secure resources and service from government (Cao et al. 2005). 


Regulatory Markets 

(1. Export licenses allow a firm to conduct international trade. If a firm is unable 
to secure an export license, it must rely on authorized state trading companies, 
resulting in higher transaction costs. (2. Tax exemptions are a common industrial 
policy instrument to support firm development. If a firm can secure tax exemp- 
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tions from local government, this helps the firm’s chances of survival during the 
critical start-up stage. (3. If regulators drive up registration costs or discriminate 
against applicants based on economic or political factors, the cost of getting a 
business license can serve as a market entry barrier. 


Independent Variables 


Rent-seeking activity involves firms using their political capital to secure most-favored 
treatment from government as a means to increase profits (Krueger 1974). Previous 
studies measured political capital by indicating whether the CEO is a party member 
or current or former government official (Fan et al. 2007). Political capital at the 
firm level, however, is likely to be embedded in multiple dimensions of the firm's 
organizational structure. A broader set of measures allows us to examine the various 
channels through which political capital can work to a firm’s competitive advantage. 

In our dataset, about 31 percent of CEOs indicate that they hold a position 
as party secretary or deputy party secretary, which is consistent with the national 
trend of political screening and communist party recruitment of the managerial 
elite (Bian et al. 2001). 

" Former government bureaucrats who leave their positions as part of China’s 
xia hai policy are likely to continue to have extensive ties with government. We 
therefore use a dummy variable to indicate whether the firm’s CEO previously 
held a government position. Overall, 5.6 percent of the CEOs in the sample had 
held government positions. 

Government routinely appoints the top managers of state-owned firms. The 
practice is also commonplace in corporatized and privatized firms in which the 
government retains partial ownership. Managers appointed directly by government 
are likely to have good political connections. In our sample, the government was 
involved in 16 percent of the recruitment decisions for managers of non-state firms. 

Another measure of political capital is whether a firm receives “government 
support” in acquiring resources and services (Ayyagari et al. 2008). Not every 
firm benefits from government assistance; hence a firm's ability to secure direct 
government assistance signals it has fungible political capital. The survey includes 
six typical fields of local government support (locating foreign technology; ob- 
taining bank financing; identifying foreign investors, foreign clients, foreign sup- 
pliers and domestic clients). Assistance in these areas is among the most sought 
after services, which companies need to support domestic and global growth and 
development. For these measures of political capital it does not matter whether 
government assistance was provided through formal channels and procedures, or 
rests on informal network-based support. We construct an index (ranging from 
0 to 6) of government assistance received in all six fields of support. Overall, 32 
percent (546) of respondents reported receiving some kind of government as- 
sistance over the past business year. Most of these respondents report that they 
have received government assistance in identifying potential domestic clients (n= 
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308). It should be noted that government assistance does not guarantee successful 
outcomes. In this sense, we are not running the risk of including a tautological 
independent variable. In sum, government support signals that the firm is on 
good terms with bureaucrats and politicians. It can also be interpreted as a signal 
of otherwise not explicitly revealed political capital. 

Pairwise correlation coefficients ranging from -.01 to .23 confirm that these 
selected measures capture different dimensions of firm-based political capital. 
Table 3 provides summary statistics for all independent variables. We have checked 
Pearson correlation statistics and variance inflation factors, and can rule out multi- 
collinearity concerns. Throughout, we have ensured that the joint use of these 
measures does not have a dampening effect on any single measure. Hence, we only 
report complete models instead of stepwise results. 


Control Variables 


We introduce a set of control variables to differentiate positional advantages stem- 
ming from political capital from other potentially confounding effects that may 
influence transactions in the institutional domains we examine. 


Company-specitic Capital 

Firm age: The firm’s age may affect outcomes in regulatory and economic markets. 
Older firms, for example, will naturally have a more extensive inter-firm network 
than younger, recently founded firms. 

Firm size: Not only may firm size influence a firm’s market position, but larger 
firms are also more attractive to extract rents. In order to capture potentially con- 
founding effects of firm size, we include the lagged natural logarithm of a firm’s 
sales value (in Model 1, in order to control the sales level in the preceding year) 
and assets (models 2 -9) of the preceding period. 

Capital structure: Financial leverage is positively correlated with a firm's ca- 
pability development. Because financial leverage may stem from a firm’s use of 
political connections in securing loans from state-owned banks, we include the 
debt-to-asset ratio. 

Industry: Because many regulations and industrial policies are sector specific, 
we included industrial dummy variables indicating 14 different sectors. Another 
dummy variable signals whether a firm is located in an industrial or technology 
park. Such parks benefit from favorable government treatment and have other 
political capital advantages. Further, we include a firm’s capital-to-labor ratio. As 
China’s industrial policy shifts the industrial structure away from labor-intensive 
production, a firm’s production technology may have an impact on the value of 
political connections. We control whether a firm is listed on one of China’s two 
stock exchanges. Listed firms comprise key enterprises, singled out to be the 
mainstay of economic development. We also control for government ownership, 
as public ownership shares may strengthen a firm’s ability to seek political support. 
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Politicians and government officials are usually more willing to distribute rents to 
fully or partly state-owned enterprises (Shleifer and Vishny 1994). Ten percent of 
the firms included in our sample indicate partial state-ownership shares. 


Human Capital and Incentive Structure 
We also control for professional competence of firm managers and for manage- 
ment’s incentive structure. 

Manager education: Whether managers act in an entrepreneurial and innovative 
manner or as rent-seekers may be tied to educational attainment. Moreover, edu- 
cation is positively associated with high-ranking party positions and the ability to 
secure political rents. To separate educational attainment from political power, we 
include a dummy variable which indicates whether the CEO has college education. 

Manager tenure: The managers’ tenure may not only determine firm-specific 
human capital, but long-time managers may be in a better position to secure gov- 
ernment help — they have had more time to develop a broader political network 
reflecting firm-specific needs. 

Incentive contract: Incentive contracts are a common corporate governance 
device, but are uncommon when governments remain involved in recruitment 
decisions or when former government employees are recruited. 


Regional Control Variables 


We included a set of regional, provincial and city-level controls that could affect 
outcomes in distinct transactions and also the market value of political capital. 

Regional context: Five control variables sort China into main regions (coastal, 
central, northeast, northwest and southwest). Coastal China, the most liberalized 
and developed region, is our benchmark. 

Provincial-level marketization: Provincial control variables for marketization 
(NERI-Index) are useful, as most of our survey firms operate at the provincial 
and even national level. 

City-level economic development: Finally, we include the natural logarithm of 
per capita GDP at the city-level, as local economic development may not only 


affect outcomes in various transactions but also a firm’s ability to benefit from 
political capital. 


Model Specification 


The stratified nature of the survey, with 100 to 150 observations each in 18 mu- 
nicipalities, calls for caution not to overlook confounding city-level effects. It is 
not unlikely that firms within a geographical context may be more similar than 
firms in other municipalities, due to local economic, political and also cultural fac- 
tors. Within-city variation in the estimated errors would thus potentially bias the 
standard errors downwards. Multi-level analysis appears as a standard approach 
to separate regional from individual effects. The asymptotic maximum likelihood 
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Table 3: Summary Statistics for Independent Variables 





Variable Observations Mean sD 

Political Ties 

CEO holds formal position in the party tia rod 46 
(dummy = 1 if yes) 

CEO previously held government position Tif52 .06 25 
(dummy = 1 if yes) 

Government involvement in CEO appointment 1,744 16 my 
(dummy = 1 if yes) 

Government support index (over 6 categories) 115 64 1.21 

Company 

Log of firm age 1,764 2.24 68 

Log of sales value in 2001 1,747 8.93 2.25 

Log of total assets in 2001 1,692 8.15 2.40 

Debt equity ratio in 2001 1,678 2.86 19.91 

Capital labor ratio 1,719 161.65 2,399.04 

Location in industrial park (dummy = 1 if yes) 1,739 30 46 

Listed firm (dummy = if yes) 1,744 .03 A7 

Government ownership 1,763 7.20 23.79 

Management 

CEO has college education (dummy = 1 if yes) 1,754 .80 40 

Tenure of CEO 1,747 6.19 4.44 

CEO has incentive contract (dummy = 1 if yes) 1,764 et 45 

Regional Controls 

Provincial level marketization of product market (NERI) 1,764 8.43 1.03 

Provincial level marketization of credit market (NERI) 1,764 3.34 2.93 

Overall provincial level marketization index (NERI) 1,764 6.08 Lhe 


Log value of per capita GDP (City level) 1,764 9.65 60 


Source: Investment Climate Survey 2003; Fan and Wang 2003; National Statistical Bureau of China 
2003. 


estimation methods used in multi-level analysis, however, require a minimum 
sample size. Simulation studies confirmed that the standard errors of the second 
level variances (here city effects) are estimated too small when the number of 
groups is substantially lower than 100 (Maas and Hox 2005), although 30 clusters 
are often indicated as a minimum number of groups (Kreft and De Leuuw 1998). 
Given the relatively small number of only 18 cities, we therefore chose to estimate 
city-clustered standard errors instead of using multi-level estimations. Our choice 
is further justified as we find mainly relatively low intraclass correlation in our 
sample (between 0 and 4 percent) (Iwisk 2006). Low intra-class correlation is 
most likely caused by broad organizational heterogeneity at the individual level, 
which overrules regional effects. 
We use the following reduced form model. 


Vier eie Pb 2, +d td Wve, 
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Y denotes the specified outcome variable of transactions in nine-distinct institu- 
tional domains. P denotes a vector of dimensions of political capital. Z is a vector 
of firm-level control variables covering company characteristics and management 
features likely to influence the outcome variables. In addition we control for 
region (a) and sector (a.). The variety of outcome variables involves OLS and 
Probit models. 


Results 


Table 4 presents our findings for the three institutional domains in the economic 
market. For sales growth in the highly competitive product market (Model 1), 
firms with political capital seem not to have positional advantage. None of the 
different dimensions of political capital has a significant positive effect on sales 
growth. Firms with CEOs appointed by the government even have a disadvantage 
in sales development. Clearly, this finding does not yet suggest a negative causality. 
Without longitudinal data we cannot rule out that poorly performing companies 
simply invite government involvement in recruitment decisions to help mitigate 
bad company performance. Our results indicate that political capital is not associ- 
ated with positive growth effects in the product market. To confirm the robustness 
of the model we experimented with log values of sales growth, but our results 
remain unchanged. We also explored whether political capital may potentially 
work through other forms of priority treatment. The inclusion of additional ex- 
planatory variables such as the availability of tax exemptions, sales contracts with 
government and trade licenses, however, does not affect our findings. Also the 
explanatory power of Model 1 remains the same. 

Another consideration is that gains from political connections could also be 
linked to a firm’s market-based competencies. On the one hand, local govern- 
ments might be more willing to support economically capable firms; on the other 
hand, capable firms might be in a better position to transform political support 
into market success. In order to explore a potential relation between market- 
based competencies and the value of political capital, we have calculated different 
measures of firm innovativeness to proxy firm capabilities. First we have included 
into our benchmark model a firm’s share of new products in total sales (t — 1) 
and added corresponding interaction effects with our political capital measures. 
Secondly, we have conducted the same exercise with an index of firm innovative- 
ness (covering the opening of new business lines: product, process, management 
and quality control innovations over a three-year period from 1999 to 2001). 
Our estimation results did not indicate that firms with stronger market-based 
competencies realize higher payoffs from their political capital. 

In contrast, political capital has a significant impact on economic transac- 
tions in the state-controlled credit market (Model 2). Government support is 
positively associated with success in securing formal bank loans. And as expected, 
political connections provide the strongest competitive advantage in the market 
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Table 4: Economic Markets and the Role of Political Ties 





Model 3 
Model 1 Model 2 Sales to 
Sales Growth Credit Access Government 
* Coefficient Coefficient Coefficient 
_(t-ratio) _(z-ratio) (z-ratio) 
Political Ties 
CEO holds a party position 26.60 15 03 
(1.13) (1.60) (.43) 
CEO previously held government 79.31 -.03 ot 
position (.82) (-0.17) (1.94) 
Government involvement in CEO “09.70 * -.13 002 
appointment (-2.41) (-1.06) | (02). 
Government support index -11.52 10ST? ots 
(-1.17) (2.54) (3.32) 
Company - - 
Log firmage 12.00 -.09* alk: 
(1.21) (-1.86) (-2.20) 
Log sales (lagged) -76.85* = - 
(-2.10) . 
Log assets (lagged) Ore -.002 
(6.72) (-.10) 
Debt equity ratio (lagged) AN -.002 ~.001 
(91) (-1.75) (-0.48) 
Capital labor ratio my fae -.0003* -.001** 
(3.44) (-1.81) (-2.50) 
Located in industrial park 107.38 OF -.14* 
(1.16) (1.14) (-1.66) 
Listed company ao2.21 48™* 48** 
(-.51) (2.30) (2.04) 
Government ownership shares 3o ~ =,002 002 
(2.04) (-.28) (1.45) 
Sector (14 industries) ES: Eo Yeo" 
Management 
CEO holds university degree 59.15 -.19 Onan 
(1.37) (-1.34) (3.21) 
Log of CEO tenure -45 58” 0. -.003 
(-1.77) (2.13) (-.06) 
CEO has incentive contract -21.08 Ae sot lee 
(-.54) (1.77) (3:71) 
Location 
Region YES® YES YES 
Log pe GDP (city level) 31.04 (4 -,103™ 
(.93) (-1.13) (-1.97) 
Marketization of commodity 88.54 
market (1.39) 
Marketization of credit market VAG 
(2.39) 
Marketindex -.06 
(-1.45) 
Constant 413.42 “2.01 49 
Se (-.71) (-1.21) (.66) 
Method OLS Probit Probit 
R2/ pseudo R2 518 16 A 
N 1,503 1,447 1,496 


*o0<.10 “*p<.05 **p<.01; standard errors are clustered on city. 
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for government contracts (Model 3). One of our interviewees in*the Yangzi delta 
region indicated, “Competitive bidding is just a form... Political connections are 
still as important as before... If some senior government official gives a signal 
we will get the project. Sometimes we lose bids, because someone else gets the 
nod from a senior official.” Two measures of political capital confirm significant 
positive effects on a firm's percentage of sales to the government. First of all, good 
standing with local government as indicated by the government support index 
provides advantages in securing government contracts. Moreover, managers with 
a previous career as a government bureaucrat increase a firm's ability to secure 
government contracts. Our interviews confirmed that many former bureaucrats 
who left their government positions and took jobs as CEOs systematically built 
on their insider networks to cultivate political connections in order to secure 
government contracts. Some of these businesses were even founded on the idea of 
becoming a major government supplier and to thereby reduce market risk in the 
highly competitive commodity market. 

The persistent value of positional advantage in non-marketized or only partially 
marketized sectors of the economy may invite inferences on persisting or even in- 
"creasing value of political capital in the political market. Our results on rule enforce- 
ment (Table 5), however, indicate that firms with politically connected CEOs do 
not enjoy a systematic advantage when it comes to rule enforcement. Only in one 
case do former government officials seem to enjoy advantages, when it comes to 
timely custom clearance of export goods (Model 4). However, in the same model, 
our estimates indicate that government-appointed CEOs have to expect on average 
three days more for custom clearance. We suspect that the delay in custom clearance 
may be related to specific product types. The negative effect actually disappears if we 
attach a lower weight to outliers and run robust regressions. Estimation results of 
Model 5 indicate that the total amount of inspection fines depends neither on politi- 
cal capital nor on key characteristics of the company or management. Estimations 
with alternative dependent variables such as the ratio between total inspection fines 
in relation to total business costs confirm these results. Model 6 displays another 
unexpected finding. Firms with a good standing with the government —as indicated 
by the government support index — seem to experience more cases of bribery in 
their dealings with local tax inspectors. However, both the level of significance and 
the marginal effect are relatively low. It is notable that neither political capital nor 
company features such as size, age and industry help to explain how individual firms 

are treated by administrative authorities. 

The extremely low explanatory power of models 5 and 6 requires some clarifica- 
tion. Based on extensive interviews we conducted with municipal bureaucrats and 
entrepreneurs, we interpret these findings as a signal of increasing fairness and 
unbiased treatment of economic actors. Ultimately, in a rule-based economy, the 
total amount of inspection fines is determined by the individual cases of detected 
misdemeanor, but not by general company characteristics or political capital. In 
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this sense the low explanatory power could reflect a case-by-case treatment based 
on a review of the specific facts and circumstances. The focal finding with respect 
to our analysis of rule enforcement, however, is that management network ties 
with government and firm characteristics obviously capture little of the variation 
in specific transaction outcomes. 

Analysis of the effects of political capital in different regulatory markets pro- 
vides an even stronger test of its fungibility in political markets (Table 6). In 
contrast to the claim that the political elite rely on opportunities embedded in the 
regulatory roles of the state to compensate for a decline in redistributive power, 
we find no systematic evidence that companies with political connections enjoy 
advantages in the institutional domain of regulatory markets. 
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In two cases, we find slight disadvantages for politically connected firms. Firms 
with government involvement in CEO recruitment decisions are less likely (the 
marginal effect is 8 percent) to hold an export license (Model 7), which would 
allow the firm to directly conduct foreign trade without involving trading com- 
panies. We have also re-estimated Model 7 for a reduced sample under exclusion 
of firms which do not currently export, but find this effect confirmed. While we 
do not expect any arbitrary discrimination of firms with government-appointed 
CEOs, we suspect that these managers feel a stronger pressure to continue to 
trade through the traditional foreign trade system operating through state-run 
trading companies. Model 8 suggests similar disadvantages for former government 
officials in securing tax exemptions. 
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Overall, firm age, size, production technology and industry are the central 
determinants explaining the granting of export licenses and tax exemptions. This 
is in line with China's industrial policy guidelines, which specify certain company 
features as prerequisites for access to trade and tax exemptions in order to actively 
promote the modernization of the economy. Model 9 complements this overall 
impression and underscores that the price of business registrations is not affected 
by company characteristics or political capital. 

Skeptics might object that political connections came to full play at earlier stages 
of market transition and have helped to build up specific company characteristics 
that now —at an advanced stage of transition — independently yield positive effects 
on transaction outcomes. Company size, for example, might theoretically be the 
result of earlier asset appropriations in the course of privatization policies. While 
clear counter evidence would require time-series data covering earlier periods of 
China’s transition, some observations contradict this idea. First, company size does 
not create a general advantage, but is only associated with advantages in four out of 
nine models (see tables 4-6). Secondly, company size and the examined measures 
of political capital are only moderately correlated, with a negative correlation for 
former government officials. Even exclusion of company size as a control variable 
would not substantially affect our results for political capital. Hence, we do not see 
evidence for lasting economic advantages due to potential asset accumulation of 
politically connected firms in prior periods. Clearly, the weak correlation between 
firm size and political capital in the survey year does not rule out that political 
connections may have played a more decisive role during the firms’ founding 
process. The main emphasis of our study, however, is to compare the current value 
of political connections in different types of institutional domains. As the qualities 
of institutional domains have gradually changed in the course of market transition, 
our results do not invite inferences on earlier stages of development. 

A final note of caution should be added, which we share with the whole em- 
pirical literature studying the economic value of political connections. While we 
believe that we have provided the best possible approximation of political capital, 
we cannot rule out that specific state-firm connections are simply not revealed by 
the available survey data. Firms may possess direct linkages through relatives or 
acquaintances of key personnel or owners, which are not covered by the World 
Bank survey instrument. However, “not revealed” or “hidden” political connec- 
tions will most likely be correlated with at least one of the dimensions of political 
capital captured in our model. It is unlikely that firms without any formal party 
association and government assistance systematically possess informal connections 
that would create otherwise unrecognised advantages. 


Robustness Test 


We consider three robustness issues. First, we checked whether our OLS results 
were driven by outliers or predictor variables with high leverage. Robust regres- 
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sions, however, generally confirmed our findings and did not indicate systematic 
advantages for firms with political connections. 

Secondly, reverse causality is a potential risk when using cross sectional datasets. 
It is possible that political capital may have different effects depending on the 
firm’s economic performance. Political connections may be easier to activate for 
poorly performing firms facing bankruptcy, or alternatively, political capital might 
become more valuable for top performers thanks to the government’ interest 
in cultivating national brands and global players. To mitigate the risk of reverse 
causality, we have experimented with different lagged performance measures (such 
as return on assets) as additional control variables, but our substantive findings 
remained unaffected. Also, our findings are confirmed for a subsample of loss- 
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making firms. Theoretically, there is also a chance that firms which are performing 
weakly in the specified market settings respond by recruiting new CEOs with 
viable government ties. If such a selection effect actually existed, our estimation 
results would possibly not reveal positive effects of political capital. To rule out 
such concerns, we have explored whether our results still hold up if firms with 
recently recruited CEOs (we experimented with tenure of up to 5, 3 or 1 years) 
are excluded from the sample. Overall, our general findings were confirmed. 
While our model specification controlled for economic development, we cannot 
rule out that advantages stemming from political connections may not still be linked 
with it. Responding to the question whether economic advantages of political capital 


vary with economic development, we repeated our analysis for a subsample of cit- 
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ies with below-mean per capita GDP. To support a causal link between economic 
development and the role of political capital, we would expect stronger effects for 
positional power and political connections among the less developed municipalities. 
In order to explore the potential impact of structural contextual effects connected 
with the early stage of regional industrialization, another test reduced our sample 
to municipalities with more than 50 percent rural population. As before, political 
capital was only associated with competitive advantages in the credit market and the 
market for government contracts. Outside of these markets, political connections 
yield no systematic positional advantages for firms with political capital. 


Discussion and Conclusion 


In economic markets, advantages stemming from political capital are highest in 
state-dominated markets (such as the credit market and the market for govern- 
ment contracts), while we cannot detect a positive payoff for political connections 
in the competitive product markets. Our findings correspond with earlier work 
on the connection between political capital and market regulation, which finds a 
positive correlation between the representation of political directors on corporate 
‘boards and the level of industry regulation (Helland and Sykuta 2004). 

Surprisingly, we found no systematic advantage for politically connected firms 
in political markets. Instead, outcomes in administrative and regulatory markets 
seem to a large extent determined by company characteristics and firm location. 
Particularly, economic transactions, which are closely related to the government's 
ambition to launch an active industrial policy supporting promising and strong 
firms, depend in the first place on the firm’s organizational characteristics. We 
observe advantages for younger and large firms, not involved in capital-intensive 
production, and located in industrial parks and technology zones in the markets 
for export licenses (Model 7) and in applications for tax exemptions (Model 8). In 
contrast, administrative fees are not determined by company features or political 
capital (Model 9). In this sense, the development of the political market seems 
to move in the direction of a level playing field, where rule-based enforcement of 
regulatory standards prevails. 

Our findings may be indicative of a linked state and market evolutionary de- 
velopment (Zhou 2000), where progress in marketization triggers changes in the 
reward structure for political actors, which eventually leads to a decline in the 
value of political connections in the marketized sectors and provides impetus for 
the creation of a rational-legal bureaucracy (Yang 2004; Nee and Opper 2007). 
Most importantly, the decentralization and privatization of economic activities 
render the socialist profit retention system, built on central planning and state- 
ownership, ineffective. As lower-level governments increase their revenue to the 
extent that they succeed in promoting economic development in their jurisdiction, 
firms’ chances to benefit ftom their political connections are weakened. Like other 
markets, regulatory markets gradually become competitive markets. 
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Overall, our findings suggest that by 2002 political connections in urban China 
provide advantages principally in economic markets where government restricts 
economic activity and where economic actors compete for rents. In the competi- 
tive product market, however, political capital — whether in the form of political 
connections or government support — does not seem to confer competitive advan- 
tage in economic transactions. The absence of systematic advantages for political 
capital in administrative and regulatory markets, moreover, suggests that a linked 
evolution of market institutions and bureaucracy might strengthen rational-legal 
authority and procedures in the political market (Guthrie 1999). This does not 
mean that politically connected entrepreneurs may no longer benefit from their 
political capital to build up large-scale firms. Our findings on political advantages 
in the credit market and in the market for government contracts suggest that po- 
litical capital still can yield valuable economic advantages. Entrepreneurs and firms 
who frequently transact in those markets will therefore have strong incentives to 
maintain and cultivate political ties. 

Many on China’s growing list of parvenus are politically connected, and a 
third of China’s richest 800 entrepreneurs are party members (Kwong 2007). 
Consistent with our analysis, anecdotal evidence suggests that positional advan- 
tages are usually pervasive in highly regulated sectors, such as the real estate sector, 
construction industry, banking or foreign trade, where the “right connections” 
can help to secure super profits. Also, our results do not rule out that rent-seeking 
with positional power still offers routine sources of opportunity for the political 
elite, as it continued to be manifest in bribe-taking by government officials. The 
mean expenditure for fees and bribes used to obtain government-issued business 
registration, for example, was 125,000 rmb (Model 9); the standard deviation 
(1,482,820 rmb) indicates that bribes in regulatory markets were not trifling. 

Some final caveats apply: Our analysis aims to compare the value of political 
capital in different types of routine transactions by firms across a variation of distinct 
institutional domains. Our analytical focus does not imply that political capital 
may not also serve as insurance — a mechanism that helps firms mitigate potentially 
unfavorable interference from political actors. When economic actors face uncer- 
tainty about political contingencies, investing in political capital is a prudent form 
of strategic action. We acknowledge that there may be incentives for economic 
actors to invest in the cultivation of political connections beyond the scope of our 
transaction-focused analysis. Similarly, our study does not incorporate the role of 
financial contributions in rent-seeking activities of economic actors. In light of sys- 
tematic evidence in mature market economies that documents the common use of 
financial contributions to political coffers to buy outcomes from legislative processes 
(Monardi and Glatz 1998) and in access to strategic resource (Claessens et al. 2008), 
such use of political connections is also likely to persist in China. Reports of financial 
payments disguised as contributions to social projects and payments to “political 
consultants” (Huang 2008) suggest that “political support” is also purchased. 
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Finally, our predictions should not be interpreted as implying that the state loses 
power in the transition to a market economy. Indeed, with more relative autonomy, 
state power becomes less encumbered by rent-seeking activity (Evans 1995). Fuelled 
by dynamic market capitalism, economic development in China has generated 
growing revenues for state coffers, as reflected in the huge currency reserve held by 
the central bank. Growth in tax revenues augments the state’s wealth and power to 
make credible commitments to long-term national goals of modernization. 
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How Socio-Economic Change Shapes Income Inequality in 
Post-Socialist Europe 


Nina Bandelj, University of California, Irvine 
Matthew C. Mahutga, University of California, Riverside 


Although income inequality in Central and Eastern Europe was considerably lower 
during socialism than in other countries at comparable levels of development, it in- 
creased significantly in all Central and East European states after the fall of communist 
regimes. However, some of these countries managed to maintain comparatively low 
inequality levels 10 years into the transition period while inequalities have skyrocketed 
in others. What explains this variation? This article presents one of the first longitudi- 
nal cross-national analyses of the factors that determine changes in income inequality 
in 10 Central and East European countries during the first decade after 1989. Results 
suggest that rising income inequality is principally related to (1. the expansion of the 
private sector, (2. retrenchment of the redistributive state, (3. the social exclusion 
of ethno-national minorities, and (4. penetration of foreign capital. Moreover, the 
analyses suggest that privatization strategies promoting foreign investment created 
more inequality than those promoting domestic investment. These findings reveal the 
social, political and cultural foundations of the income inequality dynamic during 
post-socialist transition in Central and Eastern Europe. 





Introduction 


The official ideology of socialism portrayed the system as one that ensured social 
justice and equality. The party-state would secure full employment and take care of 
the population's basic needs by providing universal education, health care, subsidized 
housing and cultural goods (Kornai 1992). While actual socialist systems did not erase 
inequalities (Szelényi 1978), scholars overwhelmingly agree that income inequality was 
substantially /ower during socialism than inequality in other systems at comparable 
levels of industrial development (Boswell and Chase-Dunn 2000; Heyns 2005).! 

With the collapse of communist regimes, state-level efforts to reduce inequality 
were largely abandoned. After 1989, Central and East European states quickly 
embraced market exchange as the governing economic principle and capitalism as 
the preferred system of economic organization. They also began to integrate into 
the global economy by opening their borders to foreign direct investment, which 
was restricted during the socialist period. 

Since the onset of these overarching transformations, social inequalities in 
CEE have, without exception, increased throughout the region (for review see 
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Figure 1. Rising Income Inequality in 10 Post-Socialist European Countries 
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Heyns 2005). ‘The rising trends are depicted in Figure 1, which shows levels of 
income inequality from 1989 to 2001 for 10 CEE countries. In 1989, the aver- 
age Gini index was 22 across these countries, but increased to 34 only a dozen 
years later. The magnitude of this change becomes evident when we consider 
that the Gini index increased about 3 points in the United States, and declined 
slightly in a few West European countries during this same period (Alderson 
and Nielsen 2002). 

Despite this general rising trend, differences across post-socialist countries 
are quite substantial. As displayed in Figure 2, a dozen years into the transi- 
tion, some post-socialist states managed to maintain comparatively low income 
inequality levels, while inequalities skyrocketed in peer states. For example, Gini 
indices for Romania, Estonia and Lithuania in 2001 are close to the high levels 
we find in Anglo-Saxon countries. On the other hand, inequality levels in the 
Czech Republic and Slovenia after 10 years of market reform resemble those of 
Scandinavian countries, known for their relatively equitable income distribution. 
How can we explain this cross-country variation? Which aspects of the post- 
socialist transformation might account for these differences? 

Studying income inequality trends in CEE can enhance our understanding of 
post-socialist transformation. In addition, examining the determinants of income 
inequality since the collapse of communist regimes is also strategic in addressing 
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how specific structural, political and cultural contexts shape social stratification. 
Low levels of inequality during socialism were largely due to the communist 
ideology that supported a socialist redistributive command economy. Likewise, 
political choices about privatization at the onset of market reform in post-socialist 
Europe, and societal changes that weakened the redistributive role of the state 
and legitimated the exercise of self interest in the market, influenced income 
inequalities. Highlighting the role of social forces is in line with research in eco- 
nomic sociology (Smelser and Swedberg 2005). Research in this field has made an 
important contribution by showing that economic action at the individual and 
organizational level is shaped by social forces. Nevertheless, there is an increasingly 
salient call for economic sociologists to more systematically uncover the social, 
political and cultural foundations of macroeconomic outcomes (Ingham 1996; 
Granovetter and Swedberg 2001) and pay more attention to stratification and 
inequality (Swedberg 2003). Studying changes in macro-level income inequality 
during large-scale transformations in CEE, therefore, provides an ideal opportu- 
nity to advance these lines of economic sociology. 


Socio-Economic Change and Income Inequality in Postsocialism 


Changes in CEE are marked by the simultaneity of economic, political and 
social transformations brought about by the revolutions of 1989. Thus, income 
inequality trends in this region should be viewed within the context of macro- 
level societal transformations involving the transition from a socialist command 
economy to a market-based economy. This brought a fundamental change to 
the institutional structure of society in which collective interests guarded by the 
redistributive state take backstage to private interests and individual self-reliance. 
Economic transformations were accompanied by declarations of political sov- 
ereignty from the Soviet Union and, in many cases, also national sovereignty 
and dissolution of multinational federations. This all amplified nationalist sen- 
timents in post-socialist states to the detriment of ethno-national minorities. 
Further, post-socialist changes happened during a time of intensified global 
capital flows worldwide. Therefore, we want to highlight four social processes 
that are central to post-socialist transformation and, we argue, also encourage 
differentiations in income-inequality in post-1989 CEE: (1. privatization, (2. 
redistributive state retrenchment, (3. ethno-nationalist discrimination, and (4. 
foreign investment penetration. 


Privatization 


After the revolutionary events in 1989, CEE states were set on building capitalism. 
Stabilization, liberalization and privatization were three key pillars adopted to 
accomplish the daunting task of market reform. Privatization —the conversion of 
a system where private ownership was absent to one where economic actors have 
property rights—was universally recognized as necessary for market transition. 
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Indeed, if free economic agents (buyers and sellers) are to exchange property stakes 
on the market, then private property needs to be allowed, and property rights 
clearly defined and protected. 

Nevertheless, privatization was not a uniform and straightforward byproduct 
of economic transformation. Rather, it was a social and political process that was 
contested and played out differently in individual post-socialist states (Stark 1992; 
Stark and Bruszt 1998; Verdery 2004; Bandelj 2008). Countries implemented 
a variety of privatization strategies, including selling assets as in auctions to the 
highest bidder, encouraging management-employee buyouts of formerly state- 
owned firms, or distributing vouchers to citizens that they then could invest (like 
in the stock market) in privatizing firms or managed funds that invested on their 
behalf (Megginson and Netter 2001). Likewise, there were significant differences 
in the pace of private sector expansion, characterized as a shock therapy approach 
vs. gradualist privatization (Spicer, McDermott and Kogut 2000). For example, 
in Bulgaria and Slovenia the private sector share in GDP by 1996 was 50 percent, 
while for Estonia and Hungary this figure stood at 80 percent. 

Has the size of the private sector influenced the differences in income inequal- 
ity across post-socialist countries and over time? We argue, yes. Privatization is 
principally related to capital accumulation, and the more extensive the privatiza- 
tion, the greater the opportunities for differentiation among households based on 
their private sector activities. In general, individuals can garner higher incomes 
from jobs in the private sector than in the public sector, which creates greater dif- 
ferentiation at the top of the income distribution. On the other hand, those in the 
private sector can also lose jobs more easily (and end up among the unemployed 
poor), which adds to differentiation at the bottom of the income distribution 
(see Milanovic [1999] for a similar argument based on individual-level data). We 
argue that in the aggregate, the greater the size of the private sector, the greater the 
opportunity for individual differentiation and the higher the overall income inequality. 


Redistributive State Retrenchment 


A major factor explaining cross-national differences in inequality is variation in 
the institutions that mediate the labor market outcomes. ‘The institutions that 
encourage an equitable distribution of income in the West include the presence 
of unions (Western 1997; Streeck 2005), corporatist arrangements (Hicks 1999; 
Swank 2002; Wilensky 2002), social-democratic parties and other facets of what 
has generally been termed the welfare state (Esping-Andersen 1990). For example, 
Alderson and Nielsen (2002) find that income inequality is significantly higher in 
countries where wage setting coordination is absent and labor decommodification’ 
is low (cf., Korpi and Palme 1998). 

Recently, decreases in the welfare state provisions associated with the rise of 
neoliberalism (i.e., welfare state retrenchment) has been shown to contribute to 
income inequality increases in the Western nations (e.g., Alderson and Nielson 
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2002). Similarly, we argue that the shrinking redistributive role’of the formerly 
socialist states, or redistributive state retrenchment, has consequences for income 

inequality in CEE. The socialist system put equality as a top political priority, and 
the provision of universal healthcare and education, along with full employment, 
were integral to state redistribution efforts (Kornai 1992). The neoliberal market 
transition reforms more or less widely embraced by post-communist governments 

posed great challenges to the legitimacy of a redistributive socialist state (Standing 
1996). Despite these general trends of a declining redistributive state, cross-na- 
tional differences in the equalizing activities of the state within the post-socialist 
region do exist because some states preserved a greater redistributive capacity than 

others (Haggard and Kaufman 2008). For example, Garner and Terrell (1998) 

report that since the early 1990s, the Czech Republic put several “safety net” and 

welfare state programs in place, including the establishment of unemployment 

benefits and retirement pensions. The authors conclude that the social insurance 

programs -— particularly retirement pensions—played a large role in mitigating the 

unequal effects on the poor of the transition to a market economy in the Czech 

Republic (cf., Zaidi 2009). In contrast, unemployment benefits and pensions have 

been quite low in Romania, and the real value of allocations to government-run 

social welfare programs decreased steadily after 1989 (Rotariu and Popescu 1999). 
Romania is also one of the CEE countries with the highest income inequality. 

In short, the extent to which the state retrenches from its historically active role in 
redistribution, or maintains social protections, should have a significant impact on 
patterns of income inequality during post-socialist transition. We would expect that 
the greater the state expenditures in a country, the lower the overall income inequality? 


Social Exclusion of Ethno-National Minorities 


The collapse of communism also brought political-cultural transformations that 
heightened social differentiation between people. Increasing marketization and 
democratization privileged individual rights and responsibilities over collective in- 
terests. One key aspect of differentiation became ethno-national identities, distin- 
guishing those who belong to the core-national populations from ethno-national 
minorities, such as the Roma (Gypsies) who live throughout CEE, Russians in the 
Baltic states, Slovaks in the Czech Republic and Turks in Bulgaria. 
Discrimination of ethnic minorities can be linked to the amplification of 
nationalist sentiments that gained wide-spread popularity after the collapse of 
communism and were “widely available and resonant as a category of social vi- 
sion and division.” (Brubaker 1996:21) The world witnessed a proliferation of 
national movements and national sentiments in post-socialist Europe after the 
end of Communist Party rule and assertion of national sovereignty of several 
newly-established post-socialist states (Calhoun 1993; Cohen 1999). Competing 
parties used nationalism as a tool of political mobilization and support so that in 
a number of countries, “the rhetoric and symbols with the greatest electoral appeal 
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were national(ist) ones.” (Verdery 1998:294) Overall, nation-oriented idioms had 
a prominent place in the cultural repertoires of people experiencing and making 
sense of budding democratization and marketization. 

However, any differentiation can easily serve as a basis for social exclusion and 
thereby lead to inequality. Consequences can be particularly stark when ethno- 
national exclusion becomes institutionalized and “postcommunist governments 
take the view that they do not represent citizens but the [core] nation” (Schopflin 
1996:153), a situation faced by ethnic Russians in the Baltic States of Estonia and 
Latvia, countries which have the largest minority populations in CEE. Ethnic 
Russians, who were encouraged to migrate to the Baltic States when these were 
annexed to the USSR, were denied automatic citizenship in Estonia and Latvia 
upon the break-up of the Soviet Union as were their offspring born after 1940. As 
Kirch (1997) reports, ethnic Russians in Estonia could acquire citizenship through 
naturalization, which required knowledge of the local language and history. This 
represented an obstacle for Russians who grew up in Soviet Estonia because they 
were only proficient in Russian, which was the de facto official language in the 
USSR. However, Estonian language proficiency became a requirement for employ- 
ment after Estonia's independence. The Russian-speaking minority is therefore 
more vulnerable to economic disadvantage than ethnic Estonians (Budryte 2005). 

Moreover, as Smooha (2001:6-7) asserts, “there is a strong tendency for indig- 
enous minorities to be non-assimilating [and] for majorities to be intolerant of 
cultural diversity” in many post-socialist states. One key example of social exclusion 
in Central and Eastern Europe is the case of Roma, who, unlike other minorities, are 
often treated as belonging to different ethnic and racial categories (Emigh, Fodor and 
Szelényi 2001), often facing both nationalist and racialized discrimination. Roma 
experienced a relatively high degree of social inclusion under communism when 
‘full employment, industrialization, socialist urbanization, the housing policy, ‘free’ 
health care, and mass education all worked to incorporate the Roma more fully in 
society.” (Magyari et al. 2001:136) However, this ethnic group has been among the 
worst affected by the transition. As Magyari et al. (2001:135) note, “in the years after 
1989, ethnic segregation has become rather common... [and the Roma] have been 
the most negatively affected by it. The ‘newly created segregation’ of the Romanian 
Roma is forced, built on Roma marginalization, experiences of discrimination, and 
their rejection from the majority society.”(See also Szalai 1999.) Similarly, ethnic 
Turks in Bulgaria have experienced rampant poverty (Todorova 1999). 

In short, the collapse of the communist system in CEE countries promoted 
an atmosphere of social differentiation that put ethnic minority groups at risk of 
reactionary policies, discriminatory employment practices and other problems. 
This dynamic has been particularly pronounced among the ethnic Russians in the 
Baltic States, and the Roma in other CEE countries. We argue that countries with 
substantial ethno-national minority populations (i.e., those of ethnicities other 
than nominal nationals) will have larger segments of their populations subject to 
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some form of social exclusion. As a consequence, this will lead to impoverishment 
of ethno-national minorities and greater levels of income inequality in countries with 
more sizable ethno-national minority populations. 


Foreign Investment 


Post-socialist transformations have been influenced not only by domestic processes 
but also by the changing character of the world-system. In fact, post-socialist trans- 
formations and the intensification of globalization have significantly influenced 
each other (Bandelj 2008). The rise in foreign direct investment after the 1980s, 
one of the key indicators of the intensity of economic globalization (Guillén 2001) 
has been spectacular. The collapse of communist regimes, and liberalization of 
CEE provided ample opportunity for foreign investors. To emphasize the impor- 
tance of the simultaneity of the two processes of market transition and globaliza- 
tion, some observers describe the penetration of foreign capital after the collapse 
of state socialism as “the most momentous structural force in the transformation 
process.” (B6récz 2001:1163) 

_ The coincidence of rising inequality and foreign capital inflows suggests some 
type of relationship between them. Analysts overwhelmingly agree that national-_ 
level outcomes are affected by the integration of individual countries into the 
network of global connections (e.g., Mahutga 2006), and there is a significant 
amount of evidence supporting the inequality increasing effects of FDI, although 
the purported mechanisms behind the effects are quite disparate. Early foreign 
capital dependency researchers suggested that foreign capital penetration distorts 
the evolution of the labor force structure by increasing the tertiary sector, where 

“income distribution is extremely polarized.”(Evans and Timberlake 1980:534) 
Moreover, they believed that governments in less developed countries implement 
policies that lower the bargaining power of labor and eliminate provisions that 
encourage full employment and wage enhancement in order to attract foreign in- 
vestment (Beer and Boswell 2002; Bornschier and Ballmer-Cao 1979; DeMartino 
1998; McMichael 1996; Ranney 1998). 

A major aspect of early dependency thinking on the relationship between FDI 
and underdevelopment was that short-term “flows” tended to spur development, 
while the long-term accumulation of “stock” tended to harm it, but this distinction 
was not made with respect to the relationship between FDI and income inequality 
(Bornschier et al. 1978; Bornschier and Chase-Dunn 1985). Moreover, we propose 
that several unique characteristics of post-socialist CEE states set them apart from 
the prototypical underdeveloped country. First, inflows of FDI were practically 
non-existent prior to the fall of communism (Bandelj 2008), so there was little 
opportunity for the sort of long-term accumulation of EDI capital stock consistent 
with the FDI penetration literature.‘ Second, post-socialist CEE countries had a 
very high degree of industrialization prior to the fall of socialism compared to the 
prototypical underdeveloped country (Kornai 1992). Finally, post-socialist coun- 
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tries had comparatively low levels of income inequality prior to the fall of socialism 
(Heyns 2005). Taking these specific features into account, we suggest that FDI 
increases inequality in CEE in the short term via three potential causal pathways. 

First, FDI in post-socialist Europe primarily entered the sectors of trade, busi- 
ness activities and financial services (UNCTAD 2006), and therefore contributed 
to a growing cadre of high wage managerial employment. Indeed, in his study 
of firm restructuring in several post-socialist countries, King (2001) finds that 
the wealth and prestige that go along with being a manager in a foreign multina- 
tional are quite substantial. Thus, irrespective of the issue of whether or not FDI 
depresses labor wages, it could increase overall inequality by increasing the wage 
premium to management within the foreign sector (Milanovic 1999). 

Second, FDI inflows to CEE might increase inequality by raising the wage 
premium to workers in the foreign sector. Indeed, there is some evidence for this dy- 
namic at work in countries outside of the CEE (Aitken, Harrison and Lipsey 1996; 
ILO 1998; Moran 2002), where the foreign sector is shown to be more productive 
than the domestic sector for various reasons, including a tendency for higher capital 
and scale intensity among foreign firms vis-a-vis domestic ones (Aitken, Harrison 
and Lipsey 1996; Lipsey and Sjoholm 2001), which increases the demand for skilled 
labor in the foreign sector relative to unskilled labor in the domestic sector, inducing 
divergence between the two (Feenstra and Hanson 1997). Indeed, the little evidence 
from organizational level studies in CEE show companies that privatized with for- 
eign capital are, on average, more productive than those privatized by domestic 
capital (Megginson and Netter 2001). This dynamic could also lower productivity 
in the domestic sector if the foreign sector absorbed the majority of the skilled labor, 
in turn lowering wages in the domestic sector vis-a-vis those in the foreign sector. 
This wage dynamic resonates with the finding that FDI penetration lowers the 
productivity of total capital investment through its reduction in the returns to the 
domestic investment rate (Dixon and Boswell 1996b). 

In short, FDI may increase the wages of management vis-a-vis labor within the 
foreign sector. It may also increase differences between the foreign and domestic 
sectors due to higher productivity-linked wages in the foreign sector, and/or the 
foreign sector absorbing a larger proportion of the skilled labor force than the do- 
mestic sector. Any or all of these mechanisms lead us to expect a positive association 
between FDI inflows and income inequality over the short term in CEE. 


Data and Methods 
Sample 


We assembled a dataset that includes 10 post-socialist countries, Bulgaria, the Czech 
Republic, Estonia, Hungary, Latvia, Lithuania, Poland, Romania, Slovakia and 
Slovenia, for the period from 1989 to 2001. These countries represent a set of 
most comparable post-socialist cases because of their common membership in the 
European Union and the similar political and economic changes that acquiring such 
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membership implies. Because of data limitations, panels are unbalanced, with vary- 
ing numbers of observations over time for different countries. (See appendix A for 
the list of country/year observations.) This is a commonly used strategy in inequality 
studies and necessitates the estimation of models with fixed effects or random ef- 
fects, rather than ordinary least squares to correct for unmeasured country specific 

heterogeneity. (See Alderson and Nielsen [1999, 2002] for a detailed discussion on 

these methods for analyzing unbalanced panel data.) To present models that allow 

for comparison across different specifications, we estimate all models on a sample 

of 75 observations for which data on all variables are available. 


Dependent Variable 


The dependent variable in this analysis is the Gini index for each country/year 
observation. ‘The details of its operationalization are provided in Table 1. The 
data come from the TRANSMONEE dataset on CEE, which is the largest East 
European dataset with Gini indices across our countries of interest and over time. 
Researchers have noted that there are differences in measurement across datasets 

_(for review, see Heyns 2005), so using a single source is most appropriate. A limi- 
tation to these data is that they refer to the formal sector of the economy and do _ 
not capture inequality arising from informal employment.’ Despite this limitation, 
the inequality data we analyze are the best data available and comparable to that 
used by other researchers (e.g., Alderson and Nielsen 1999; Lee 2005). 

In order to establish causal priority of predictors to the outcome, we lag the 
independent variables one year so that all right-hand covariates are measured at 
time ¢ and the dependent variable is measured at time t+/. We opt for a one-year 
time lag for several reasons. First, a priori rationales for how long a lag should be 
do not exist, and studies of income inequality commonly use a one-year time 
lag (Alderson and Nielsen 1999, 2002; Lee 2005; Nielsen 1994; Nielsen and 
Alderson 1995), or a variety of longer time lags (i.e., Kentor 1998; Dixon and 
Boswell 1996a). Second, we base our decision about the length of the lag on both 
substantive and methodological considerations. Substantively, we are interested in 
estimating the effect of our independent variables on income inequality over the 
short term because the primary causal processes we identify began after the fall of 
Communism in 1989, so a short time lag is more appropriate. Methodologically, 
the issue of sample size is very relevant as each additional year lagged reduces the 
sample size by 10 percent. In our case, even a modest five-year lag would cut our 
sample size by more than half, bringing it to 33 observations and making it impos- 
sible for us to assess the robustness of our main variables of interest to alternative 
explanations of the differential increases of income inequality in CEE. Unreported 
analyses show that the coefficients that result from longer lags are substantively 


identical to those reported, except for the substantial decreases in statistical power. 
These are available upon request. 
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Independent Variables 


The key independent variables of interest capture social processes of privatization, re- 
distributive state retrenchment, social differentiation and global integration in their 
impact on post-socialist income inequality, as indicated by the size of private sector, 
government expenditures, ethno-national minority size and FDI inflow, respectively. 

We also include a set of controls based on previous research on income inequal- 
ity that follows Kuznets’ (1953, 1955) theory of an inverted U-shape relationship 
between development and inequality. Hence, we control for GDP/capita and its 
square term. Nielsen (1994) extended Kuznets and specified the mechanisms 
underlying the inequality transition over the course of development: the size of 
the agricultural sector, sector dualism, the demographic transition and the spread 
of education (see also Alderson and Nielsen 1999, 2002). We control for the influ- 
ence of all these factors, as well as the level of unemployment, domestic investment 
(Alderson and Nielsen 1999) and changes in the share of employment in the 
service sector (Evans and Timberlake 1980). In addition, we include a variable 
that captures inequality levels in 1990, to control for initial country differences. 

_ We also fit a time trend. 

‘The list of all the independent variables used in the analysis with operation- 
alizations and descriptive statistics is included in Table 1. Zero-order Pearson’s 
correlation coefficients appear in Appendix B. 


Pooled Cross-Sectional Time Series Analysis 


Our data set consists of a pooled cross section of time series. Pooling requires us 
to control for unmeasured heterogeneity across cases by estimating either random- 
effects or a fixed-effects regression models (Amemiya 1985; Halaby 2004; for ap- 
plications see Alderson and Nielsen 1999, 2002; Gustafsson and Johansson 1999), 
For methodological and substantive reasons, we report REM models. First, while 
FEM are the most conservative approach because they control for any unmeasured 
time invariant variation across cases by removing all between-country variation, 
Tuma and Hannan (1984) showed that REM are asymptotically more efficient 
relative to FEM, which is especially helpful in the context of our small sample. 
Moreover, the FEM approach precludes the identification of parameters for time 
invariant covariates, such as the ethno-nationalism variable that we include. In 
addition, we are primarily interested in whether or not social processes explain 
the cross-case variation in inequality change over time. Thus, we want to include 
the initial value of inequality because it generalizes to a dynamic interpretation in 
which right-hand side variables predict change in income inequality from time 1 
to time T (e.g., Jackman 1980). 

While the REM approach is more efficient and flexible to various types of speci- 
fications, it is valid only if the assumption that the unmeasured unit effects‘are 
uncorrelated with regressors is met (Halaby 2004). The Hausman test (1978) —the 
textbook procedure to determine which approach is appropriate—is a test of the 
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hypothesis that 8-6 = 0, for all 8. Differences across the two approaches raise 
concern that the unit effects are correlated with the regressors, and therefore that 
the RE estimates of 8 are plagued with heterogeneity bias. Should the FEM and 
REM specifications reveal no substantive differences in 8, there is little reason to 
worry about such bias. We estimated Hausman tests across all specifications, and 
in no cases were we able to reject the hypothesis about insignificant difference in 
8 across the two estimating procedures. 

Finally, an additional requirement of pooled cross-section of time series models 
is that there be no serial correlation in the idiosyncratic errors. To the extent that 
this serial correlation does exist, it must be accounted for. We test the hypothesis 
that the idiosyncratic errors do not follow a first order autoregressive process, 
and these tests uniformly led to the conclusion that serial correlation in the id- 
iosyncratic errors was not present (Drucker 2003; Woodridge 2002). Still, we 
re-estimated the full models after estimating and adjusting for a first-order, auto- 
regressive process in the idiosyncratic errors, which were substantively identical. 
We also estimated a series of models to make sure that the linear time trend we 
include is preferable to T-1 dummy variables, the results of which suggested that 
the cost of the latter approach in terms of degrees of freedom was not justified 
(Green 2000). These analyses are available upon request. 

We analyzed the data using Stata 9.2. In addition, we tested for potential outliers 
and influential cases. None of the individual cases was unduly influential, so we did 
not exclude any observations and ran all the models with a sample of 75 observations. 


Results 


The list of potentially relevant variables is long compared to the degrees of freedom. 
Hence, we conduct the analyses in two stages. In the first stage (Table 2), we fit 
a baseline model by examining the relevance of development indicators. Then 
(Table 3), we incorporate the statistically significant baseline model indicators 
into the analysis of the key variables of interest: privatization, redistributive state 
retrenchment, exclusion of minorities and foreign investment. 

The findings reported in Table 2 show that economic development is not 
related to income inequality trends in post-socialist Europe. We tried various 
specifications of this relationship, including a linear term (Model 1) and a 
quadratic term (Model 2) but neither was significant. Model 3 shows that most 
of the other covariates for the baseline development model do not significantly 
predict inequality changes in these post-socialist countries. The only factor 
from the baseline model that has a robust relationship to the income inequality 
upswing and explains a significant amount of variance is sector dualism. Sector 
captures the wage gap between high industrial sector wages and low agricultural 
sector wages. Our results indicate that the greater this share, the greater the 
subsequent inequality. However, none of our countries experienced major shifts 
in the agricultural sector during the transition period (in most countries this 
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Table 2: Baseline Model: Unstandardized Coefficients from Random-Effects 


Regression of Income Inequality on Development Indicators 


Model 1 Model 2 Model 3 
Baseline Development Indicators ssa ; 
GDPicapia(icg) == = = =i HA 
ee a) 
GDP’ capita (log) 8.086 
aw é (.70) 
Labor in agriculture 6.401 
Sector dualism 31.269** 
(2.47) 
Unemployment = -.048 
i. (-.50) 
Education level -.016 
BOMBS Sve (-.24) 
Population growth = -.637* 
oe ‘ _ (-2.25) 
Controls a | oer 
Gin 4980, la gs Vase io. 
rr 74) (2:36) (249) 
- Time trend Sleran 202 .451** 
(7.44) (7.29) (2.69) 
Constant. 39.538 (0461; 160) 
0 2 230) (94) (.75) 
R? 490 caZe .610 


Notes: Coefficients are obtained with random effects GLS regression. 


*p<.05 “p<.01 ***p <.001 (two-tailed tests), t-values based on robust standard 
errors are in parentheses, N = 75 


share remains relatively stable over time). Thus, it seems that a more plausible 
explanation for the dualism effect is the contraction of the contribution of the 
agricultural sector to overall GDP, which reduces the denominator in the dual- 
ism ratio and contributes to higher values on the dualism score. This can happen 
on account of the absolute decline in the agricultural sector as well as because 
of the relative decreased productivity of the agricultural sector compared to the 
non-agricultural sector. We propose that in post-socialist Europe the effect is 
due to both of these mechanisms. Some other studies provide evidence of strug- 
gling agricultural sectors, in absolute terms. For example, Greif (1998) reports 
that the Eastern European agriculture faces a “subsistence crisis” and Maddock 


(1993) and Hristova and Maddock (1995) provide evidence of this for Lithuania 
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and Bulgaria. The robust effect of the sector dualism variable leads us to include 
it in subsequent models reported in Table 3. 

Table 3 presents the results of our key variables of interest. The significant posi- 
tive association between the size of the private sector and inequality reported in 
Model 1 suggests that the amount of privatization in CEE is associated with larger 
increases in income inequality.° Only certain segments of the population benefited 
from the possibilities for income accumulation associated with market transition. 

In order to assess the effect of redistributive state retrenchment, we include a 
variable for overall government expenditures. Its effect is consistently negative in 
Table 3, as hypothesized, and in line with much literature that argues for the im- 
portance of institutions on social outcomes (e.g., Esping-Andersen 1990; Western 
1997; Korpi and Palme 1998). We find evidence that income inequality is lower 
in cases where government expenditures, including social transfers and provision 
of public goods, are higher. 

In addition to these state-institutional factors, we also highlighted the role of 
socio-cultural factors as predictors of income inequality by arguing that ethno- 
national minorities were especially vulnerable during post-socialist transition. We 
argued that inequality should be higher in societies with a larger share of minority 
population, such as sizable Russian minorities in Estonia, Latvia and Lithuania, 
Roma in Hungary and Romania, and Roma and Turks in Bulgaria. This is re- 
flected in the significantly positive effect of our covariate that captures the relative 
size of ethno-national minorities reported in Model 2 of Table 3. In short, the 
larger the ethno-national minorities in the population, the higher the income in- 
equality in that country. It is important to note that especially the Roma minority 
may be subject to indirect effects of ethno-nationalism that work through lower 
educational attainment and higher fertility rates among them, perpetuating their 
economic disadvantage (Emigh, Fodor and Szelényi 2001). 

Model 3 introduces foreign investment penetration as an explanation for in- 
come inequality in postsocialism. FDI was restricted in CEE during socialism, but 
the region opened up to FDI during market transition to varying degrees across 
individual countries. Does variation in FDI inflow explain variation in income 
inequality across these CEE states? The positive and statistically significant coef- 
ficient for the per capita inflow of FDI in Model 3 suggests that foreign capital 
penetration is indeed associated with greater subsequent income inequality levels 
and increases in the initial 10 years of market transition. 

Moreover, FDI seems to intervene in the relationship between privatization 
and income inequality. This is not surprising when we consider that size of the 
private sector is strongly correlated with FDI inflow. In fact, available research 
suggests that size of the private sector is a significant determinant of the amount 
of incoming FDI, reflecting the extent to which post-socialist transition presented 
opportunities for foreign investors to come and acquire privatizing or newly-priva- 


tized firms in CEE (Bandelj 2008). Indeed, as Model 4 shows a specification that 
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includes FDI in place of privatization produces a higher R’ than when privatization 
is included, prompting us to exclude it from Model 5. In short, the fact that the 
significant effect of privatization disappears once the FDI variable is added to the 
model suggests that it is the volume of FDI, not the size of the private sector per 
se, that is more consequential for increasing income inequality. 

Finally, Model 5 introduces domestic investment and changes in tertiary em- 
ployment as a final pair of control variables. With respect to tertiary employment, 
changes in the share of employment in the service sector have been shown to 
intervene in the relationship between FDI and income inequality (Evans and 
Timberlake 1980), so we want to control for this potential cause of rising income 
inequality in CEE. It is important to include domestic investment (Alderson and 
Nielsen 1999) because it is possible that FDI is acting as a proxy for domestic capi- 
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tal formation, in which case the inequality upswing would be related to processes 
of domestic rather than foreign capital formation. As Model 5 shows, neither 
change in the share of employment in the service sector nor domestic investment 
are significant predictors of income inequality, and the effects of our four primary 
variables of interest remain robust to their inclusion. We also estimated a series of 
models in order to assuage concern over spurious “denominator effects” (Alderson 
and Nielsen 1999; de Soya and Oneal 1999; Dixon and Boswell 1996; Firebaugh 
1992; Kentor 1998), which suggested that our estimates of the effect of FDI 
inflows per capita are robust to that potentiality.’ 


Discussion and Conclusion 


Socialist CEE states represented relatively egalitarian and industrially developed 
societies. The collapse of communist re- 
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investment per capita also registered more income inequality In fact, the simi- 
larity between the privatization and foreign capital penetration effects, coupled 
with the marginal inequality reducing effect of domestic investment, suggest that 
privatization strategies favoring foreign capital created more inequality than those 
favoring domestic capital. 

Overall, these findings provide strong evidence for social foundations of in- 
come inequality trends. The inequality increasing effect of ethno-national minor- 
ity exclusion seems to be the most obvious of these social factors. But privatization 
and globalization related outcomes also have social bases. Researchers have argued 
that privatization is a neoliberal policy idea (Brune, Garrett and Kogut 2001), and 
that its adoption is heavily influenced by institutional factors (Henisz, Zelner and 
Guillen 2005). Scholars of post-socialist transition point to the role of politics and 
socialist legacies in decisions about the speed and forms of privatization (Stark and 
Bruszt 1998) and decisions to institutionalize and legitimize FDI as a desirable 
development strategy (Bandelj 2009). 

Ultimately, then, the key implication of our findings is that major upsurges 
in inequality are not an inevitable feature of the transition from socialism to 

" capitalism. Post-socialist states that made political choices to preserve a larger state 
sector, limit FDI penetration and keep individualist orientations in check with’ 
institutional provisions for social protection experienced less income inequality 
after the collapse of communism than did their counterparts. 

Finally, we want to summarize the contributions that our research makes to 
three strands of scholarship. First, our results have implications for the market 
transition thesis about positive externalities that would (at least initially) accom- 
pany the spread of markets, which commands a great deal of attention among 
scholars of post-socialist transformations (Nee 1989; Réna-Tas 1994; Szelényi and 
Kostello 1996; Gerber and Hout 1998; Gerber 2002). We complement existing 
individual-level studies with a macro cross-national analysis and find that income 
inequality increases across and within CEE countries in the first decade after the 
fall of communist regimes. The effect of sector dualism suggests that farmers and 
agricultural laborers, who accumulated little political and social capital during 
socialism (Hanley 2003; Titma, Tooding and Tuma 2004), have been particu- 
larly left behind in the market transition and are a segment of the population 
particularly prone to poverty in many CEE countries (Szalai 1999; Todorova 
1999). In addition, our findings suggest that ethno-national minorities have been 
economically disadvantaged, likely due to various formal institutional and in- 
formal prejudice-based obstacles to fair social inclusion (Emigh and Szelenyi 
2001). Moreover, declines in the redistributive role of post-socialist states further 
disadvantage those on the societies’ brinks. 

Second, our research adds to the literature on income inequality and globaliza- 
tion, finding that inflow of FDI is significantly related to higher income inequality 
in the first decade of post-socialist market transition. Available qualitative evidence 
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and sectoral distribution of FDI leads us to believe that FDI increases inequality 
by increasing the wage premium to management within the foreign sector and by 
increasing the wage gap between the foreign and domestic sectors. In the latter 
case, the gap may increase if foreign firms provide higher wages or if large flows of 
FDI tend to decrease the productivity of the domestic sector through “brain drain.” 
Our analysis also suggests that these FDI effects manifest themselves already in 

the short/medium term. What the long-term consequences of this dependency on 

Western capital will be for Eastern Europe is yet to be discovered. 

Third, this research advances the economic sociology perspective on macro- 
economic phenomena, a line of inquiry deserving more attention by economic 
sociologists (Granovetter and Swedberg 2001). Our findings prompt us to recon- 
sider general explanations that assume income inequality is a direct by-product 
of economic development. Socio-economic processes that generate income in- 
equality in CEE (privatization, foreign capital penetration and ethno-national 
minority exclusion) and institutional factors that deter rising differentiation 
(the preservation of the redistribute capacity of the state) constitute the social- 
structural, political and cultural context of postsocialist stratification. Researchers 
should test these arguments on a larger set of cases, over a longer time span 
and with more refined measures. Such research may further explicate the social 
foundations of economic inequality trends. 


Notes 


1. In one of the very few studies that include socialist countries in their analyses of 
inequality, Alderson and Nielsen (1999) find that the presence of a Marxist/Leninist 
regime reduces the Gini coefficient by 5.3 to 10.1 points. 


i) 


Decommodification refers to the differing degrees to which individuals in different 
societies are able, given the same level of total social welfare expenditure, to opt out 
of the market while maintaining a socially acceptable standard of living (Esping- 
Andersen 1990; Alderson 1997). While previous work finds a fairly robust negative 
association between income inequality and decommodification, it is also possible, as 
noted by one of the reviewers, that some workers could use the time afforded them 
through decommodification to engage in entrepreneurial or other types of activity, 
which could generate rising inequality within working classes. 


3. Unfortunately, overtime data for more nuanced aspects of state redistribution, 
including proportion of social transfers and health and education spending, are 
not yet available for CEE, so we can test only the influence of a most general 
indicator— government expenditures on income inequality. 


4, Some intra-regional investment activities occurred between COMECON countries 
(the Soviet Union and its satellite states (McMillan 1987). However, such efforts 
did not involve direct equity investment of one COMECON state into another, 
and thus would not qualify as foreign direct investment according to contemporary 
international organizations that collect information on FDI, including the United 
Nations Conference on Trade and Development, International Monetary Fund, and 
the Organization for Economic Cooperation and Development. 
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5. Most inequality datasets based on official reports and surveys have this limitation. In 
order to address it we estimated fixed-effects models to capture differential country 
propensities for informal employment, and the results for our independent variables 
of interest remained robust. 


6. In unreported analyses, we also estimated fixed-effects models to control for any 
differentiation due to privatization strategies across countries that we may not be able 
to measure directly, which were substantively identical with the ones reported here, 
consistent with the non-significant Hausman tests. 


7. We estimated a series of models that included both FDI stock and FDI flow 
separately, finding that they produce substantively identical effects owing to their 
high correlation (.940 in these data). We estimated a series that included flow and 
stock simultaneously, which revealed no evidence of spurious denominator effects 

—while both coefficients were positively signed, neither effect was significant, lending 
further support to the interpretation that they have substantively identical effects. 
Finally, we estimated models that controlled for the rate of FDI inflow (flow/stock), 
which was neither significant nor altered the effect of FDI on income inequality. 
These tables are available upon request. 
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Appendix A. Data Structure and Sources 


Sample: Pooled Cross-National Time Series 

Country/Year Observations included in these analyses: Bulgaria (1990-1994; 1996- 
2000), Czech Republic (1990-93, 1996-2000), Estonia (1995-2000), Hungary (1990, 
1992-1993, 1996-), Latvia (1992-2000), Lithuania (1997-2000), Poland (1990-2000), 
Romania (1991-2000), Slovakia (1996-2000), Slovenia (1993, 1995-2000) 


Dependent Variable 
Gini Coefficient 
TransMONEE 2003 Database, UNICEF IRC, Florence 


Economic Development 

GPD per capita 

Sources: World Development Indicators online data base: http://devdata. 
worldbank.org/dataonline 


Internal Development Model 


Secondary School Enrollment 
Source: TransMONEE 2003 Database, UNICEF IRC, Florence 


Natural Rate of Population Increase 


Source: TransMONEE 2003 Database, UNICEF IRC, Florence 


Share Employed in Agriculture 
Source: World Development Indicators online data base: http://devdata. 


worldbank.org/dataonline 


Share of Agriculture in GDP (for Sector Dualism measure) 
Share of Agriculture in GDP source: EBRD Transition Report, 1999 and 2003 


Unemployment 
Source: EBRD Transition Report, 1999 and 2003 


Privatization 
Private Sector Share in GDP 
Source: EBRD Transition Report, 1999 and 2003 
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Redistributive State Retrenchment 

Government Expenditure 

Source: World Development Indicators online data base: http://devdata. 
worldbank.org/dataonline 


Social Differentiation 

Ethno-national Minorities Size 

Source: Freedomhouse Transition Reports http://www.freedomhouse.org/ 
template.cfm?page=1 


Global Integration 
Inward Foreign Direct Investment Flow 


Source: EBRD Transition Report, 1999 and 2003 


Controls 
Domestic Investment 

* Source: World Development Indicators online data base: http://devdata. 
worldbank.org/dataonline 


Change in Tertiary Sector 
Source: World Development Indicators online data base: http://devdata. 
worldbank.org/dataonline 
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The Ecology of Technological Progress: How Symbiosis and 
Competition Affect the Growth of Technology Domains 


Gianluca Carnabuci, University of Lugano 


We show that the progress of technological knowledge is an inherently ecological 
process, wherein the growth rate of each technology domain depends on dynamics 
occurring in other technology domains. We identify two sources of ecological inter- 
dependence among technology domains. First, there are symbiotic interdependencies, 
implying that the rate of growth of one technology domain is driven by the advances 
made in other technology domains. Second, some technology domains compete with 
each other, implying that the rate at which a given technology domain advances varies 
inversely with the competitive pressure it receives from other technology domains. 
Based on all the technological knowledge patented in the United States between 1975 
and 1999, we find statistical support for our argument and hypotheses. 





The question of why some technology domains grow faster than others has drawn 
the attention of both sociologists and economists (Carnabuci 2006). As a result, 
several factors have been identified that bear on this question. Some scholars have 
argued that the uneven progress of technological knowledge reflects the chang- 
ing needs of the market (Schmookler 1966), while others have emphasized the 
importance of institutional regimes (Bonaccorsi and Thoma 2007). Also, it has 
been argued that the growth of technological knowledge depends on “supply-push’” 
factors, such as the ease and precision with which experimental tests can be carried 
out in a given technology domain (Nelson 2003), its inherent technical bounds 
(Girifalco 1991), and its degree of specialization (Carnabuci and Bruggeman 
2009). Furthermore, the progress of technology domains has been argued to hinge 
on the social construction of what becomes legitimately accepted as “new” knowl- 
edge by the relevant epistemic community (Latour 1987; Knorr-Cetina 1999). 
An implicit assumption common to all these explanations is that technology 
domains grow independently of each other. Departing from this assumption, 
our main contention in this paper is that technology domains evolve in an eco- 
logically interdependent fashion. We reckon that two kinds of ecological pressure 
impact the growth of technology domains. First, there may be symbiosis among 
some technology domains. That is, the rate of growth of one technology domain 
may be accelerated by the advances made in other technology domains. Second, 
some technology domains may be in competition with each other. When this 
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happens, the rate at which a given technology domain advances can be expected 
to vary inversely with the competitive pressure it receives from other technology 
domains. Due to symbiosis and competition, technological progress may therefore 
be described as an ecological process wherein the growth rate of each technology 
domain depends on dynamics occurring in other technology domains. 


An Ecological View of Technological Progress 


Nowadays, there is widespread agreement that the generation of new technological 
knowledge is ultimately driven by the novel recombination of existing knowledge 
(e.g., Nelson and Winter 1982; Weitzman 1998; Jones 2005; Fleming 2007). As 
the historian Usher (1929:11) was early in recognizing, “[i]nvention finds its 
distinctive feature in the constructive assimilation of pre-existing elements into 
new syntheses, new patterns, or new configurations [...].” Reflecting the view 
that technological progress largely occurs within path-dependent trajectories of 
accumulation (Dosi 1982), received research has underscored that the bulk of 
knowledge recombination takes place within the boundaries of well-circumscribed 
technology domains. Consistent with this view, extant models and empirical 
" studies have focused on the historical development of single technology domains 
(Girifalco 1991; Malerba et al. 1999). , 
Recently, however, a different and complementary perspective was offered by 
Carnabuci and Bruggeman (2009). To explain why certain technology domains 
grow faster than others, the authors built on the premise that a non-negligible 
share of the knowledge generated in any technology domain results from the 
recombination of knowledge developed in other technology domains. Consistent 
with this perspective, the authors proposed to conceptualize the growth of tech- 
nological knowledge as an evolving network of knowledge recombination pat- 
terns across technology domains. Figure | presents a simple visual example of the 
authors’ proposed network model (Carnabuci and Bruggeman 2009). 

“Figure 1 is an example of such a network which depicts a cross-cut of the 
growth process of a hypothetical stock of public technological knowledge within a 
given time window. This stock consists of three technology domains, A, B and C, 
in which 30, 40 and 10 new inventions, respectively, have accumulated over the 
given time interval. The arcs point to the domains from which ideas are taken and 
are drawn in the direction of knowledge search. Knowledge flows in the opposite 
direction of the arrows, though, and in the literature on diffusion (Rogers 2003) 
arrows are in line with the flows. The inventions generated in A resulted from the 
recombination of inventions from A’s own knowledge base 25 times, from B’s 30 
times, and from C’s 20 times. Or, equivalently, knowledge spilled over’ 25 times 
from past to current inventions in A, 30 times from past inventions in B to cur- 
rent inventions in A and 20 times from past inventions in C to current inventions 
in A. During the same time interval, the inventions generated in A worked as an 
input for inventions in B 40 times, i-e., ideas spilled over from A to B 40 times.” 


> 
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Figure 1. Recombination Patterns Within and Between Technology Domains 
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Source: Carnabuci and Bruggeman (2009) 


In this article, we build on Carnabuci and Bruggeman’s network conception to 
argue and show that technological progress is an inherently ecological process. In 
particular, our contention is that two kinds of ecological interdependence—sym- 
biosis and competition—arise among technology domains due to the dynamics of 
knowledge recombination that underlie technological progress.’ 


Symbiotic Interdependencies among Technology Domains 


In general terms, symbiotic interdependencies exist among two entities to the extent 
that they derive some kind of fit benefit from each other.’ In biology, for example, 
the ocellaris clownfish is known to entertain a mutually symbiotic relationship with 
the Ritteri sea anemones. By residing among the anemones’ tentacles, the clownfish 
guards the anemone from anemone-eating fish, while in turn the tentacles of the 
anemone fence off the clownfish from its predators. Symbiotic phenomena occur 
outside the biological sphere as well. And indeed in sociology, the role of symbiotic 
interdependencies has soon been recognized as essential to many diverse social 
dynamics (Durkheim 1997[1893]), including social influence (Park 1936), social 
cohesion (Gross 1956), organizational survival (Barnett and Carroll 1987), urban- 
ization (Hawley 1950), and state formation (Ingram and Simons 2000). Similarly, 
our contention is that symbiotic interdependencies may intervene in the process 
of technological knowledge growth, implying that the progress of one technology 
domain is driven by the progress of other technology domains. 

While the role of symbiotic interdependencies across technology domains has 
not yet been systematically investigated (Pistorius and Utterback 1997), the lim- 
ited available empirical evidence provides some support to this possibility. For 
example, in his analysis of technological change, Girifalco (1991:43) noticed that 
quite often “bottlenecks in one technology may call forth advances in another, 
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advances in one may open up opportunities in another, or progress in one may 
have to await improvements in the other.” Furthermore, the history of human 
inventions abounds with anecdotal evidence suggesting that symbiotic interac- 
tions across technologies may constitute an important and pervasive mechanism of 
technological progress. To name a few notable cases, the invention of the electric 
engine triggered new fruitful technological trajectories in the fields of refrigera- 
tion, washing and vacuum cleaning, among others. Similarly in more recent times, 
advances in the field of biotechnology have spawned new technological develop- 
ments in a wide range of technology domains from agricultural chemicals to phar- 
maceutical drugs, while important technological advances in the biotechnology 
field itself have resulted from technologies developed in the fields of biorobotics 
and information technology. 

Providing a general theoretical interpretation of these examples, the knowledge 
recombination view suggests that the growth of any given technology domain is en- 
hanced by developments occurring in other domains, granted that some aspects of 
these developments are productively combined into the focal technology domain. 
From this perspective, the existence of symbiotic interdependencies across technol- 
ogy domains is neither a marginal nor an exceptional phenomenon. Rather, it is an 
inherent characteristic of the process of knowledge recombination through which” 
technological knowledge advances. The argument that the process of knowledge 
recombination engenders interdependencies among technology domains has a 
straightforward and testable implication. Namely, the more heavily a technology 
domain depends on another as a source of knowledge recombination, the more 
the growth rate of the former will tend to be driven by the growth rate of the latter. 

Because it captures the extent to which technology domains are dependent on 
each other as sources of knowledge recombination, Carnabuci and Bruggeman’s 
network model provides a general framework through which the effect of symbiot- 
ic interdependencies across technology domains can be both formally represented 
and empirically tested. Let us consider again the illustration in Figure 1, wherein 
technology domain A recombines knowledge from both technology domain B 
and technology domain C, within a given time interval z. In this situation, our 
proposed ecological perspective implies that A’s growth rate will be to some ex- 
tent dependent on the growth rate of both B and C, because the faster B and C 
develop, the more abundant will be the pool of new technological knowledge 
that can potentially spill over to A. By the same token, the more sluggish are B’s 
and C’s growth rates, the smaller the pool of potentially symbiotic technological 
developments that A can benefit from. While we expect A to be symbiotically 
interdependent with both B and C, however, we do not expect the strength of 
such interdependencies to be equal. Rather, because B is for A a more important 
source of knowledge recombination than is C, our contention is that A’s growth 
will be more heavily affected by the rate of development of B than by that of C, 
Generalizing these arguments, our first hypothesis can be expressed as follows. 
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H1: The growth rate of any technology domain depends on 
the growth rates of the technology domains from which it 
recombines knowledge; furthermore, the more important is a 
technology domain as a source of knowledge recombination for 
the focal technology domain, the stronger is its impact on the 
focal domain’ growth rate. 


Competitive Interdependencies Among Technology Domains 


Unlike most economic resources, technological knowledge is a non-rival good, i.e., 
it can be used infinitely many times without depleting (Arrow 1962; Romer 1986). 
Given that the input to new knowledge is existing knowledge, it is therefore often 
assumed that the process of knowledge recombination is immune from the eco- 
logical pressures of competition. Consistent with this view, for example, economist 
Martin Weitzman proposed in an influential pair of papers that knowledge growth 
be represented as a combinatorial process wherein new ideas can be unbound- 
edly recombined from existing ideas (Weitzman 1996, 1998). Indeed, this view 
has had a substantial impact on both economic theory, leading to the research 
program often labeled New Growth Theory (see Jones 2005 for a critical review), 
and on growth and innovation policies worldwide. To the best of our knowledge, 
however, the assumption that the knowledge recombination process is immune 
from competitive interdependencies has never been tested empirically. Rather, 
in a study at the level of individual patents, Podolny and Stuart (1995) showed 
that whether a patented invention has an impact or, on the contrary, it becomes 
an inconsequential deadend, depends to a relevant extent on how crowded is the 
niche of source ideas the invention recombined. Therefore, the authors concluded 
that there is competition among technological ideas. 

Similarly, we suspect that there may be competitive interdependencies among 
technology domains. We reckon that there may be two sources of competition 
among technology domains. First, as said, a new idea springs from the novel 
recombination of existing knowledge. Therefore, even if knowledge inputs could 
in principle be recombined infinitely many times without depleting, one should 
expect the value of each recombination to be directly proportional to its novelty. 
To the extent that technology domains draw from the same source ideas, it is likely 
that they reduce the novelty of each other's recombinations. Second, ultimately all 
inventions are the product of boundedly rational human beings, whether operat- 
ing as lone inventors or, more typically, within an organization. Hence, technology 
domains may be in competition with each other for the limited attention of inven- 
tors (Podolny and Stuart 1995) and, relatedly, for the research and development 
investments made by the organizations they work for. 

But what determines the structure and the intensity of competitive interde- 
pendencies among technology domains? Elaborating on a well-established notion 
in sociology (Podolny et al. 1996; Popielarz and Neal 2007), we propose that 
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whether technology domains compete with each other depends on the degree to 

which they recombine knowledge from the same niche of source ideas. Our argu- 
ment is as follows. The more two or more technology domains draw from a similar 
niche of source ideas, the more their knowledge recombinations, and therefore 

their newly developed ideas, are likely to be similar. As a consequence, the average 

portion of truly novel knowledge that each new recombination yields will decrease. 
Furthermore, as similarity increases and novelty decreases, both investors and 

inventors will tend to choose one technology and not a similar other, or they will 

look for more fruitful opportunities in less crowded regions of the technological 

landscape. This argument leads to our second hypothesis. 


H2: ‘The greater the niche overlap of a technology domain, the 


lower its growth rate. 


To disambiguate our concepts and hypotheses, in the next section we will 
develop a pair of network-analytic measures capturing symbiotic and competitive 
interdependencies among technology domains. 


Formalizing the Argument 


To formally model the structure of symbiotic and competitive interdependencies 
among technology domains, our starting point is what Carnabuci and Bruggeman 
(2009:612) define a technology domain's recombinant niche, i.e., “the sub-network 
comprising a focal domain, the domains from which it recombines ideas (the source 
domains), the valued and directed ties linking the focal domain to its source do- 
mains, and the valued and directed ties linking source domains among each other.” 


Modeling Symbiotic Interdependencies 


With regard to symbiotic interdependencies, we have argued that the more a tech- 
nology domain is dependent on another technology domain as a source of knowl- 
edge recombination, the more the growth of the latter will affect the growth of the 
former. To formalize and, subsequently, test this idea, we follow a consolidated 
approach in the social network literature and use a “network autocorrelation” model 
(Leenders 2002). Network autocorrelation models capture the extent to which the 
outcome variable of a focal node in a network varies as a function of the outcome 
variables of its direct network contacts. In this article, we use an autocorrelation 
model to capture the degree to which the growth rate of a focal technology domain 
is impacted by the growth rates of the technology domains in its recombinant niche. 
Following Doreian cum suis (2004), we use the following model: 

Dy = Dn Wy dy (1) 


iej 


& 


where J, indicates the growth rate of each technology domain, j, in ’’s recombinant 
niche during 4 w,, is the impact, or weight, that each technology domain j is 
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expected to exert on the growth rate of i; and, sy, is the overall impact exerted on 
the growth rate of domain i by the domains in its recombinant niche. Because our 
theoretical argument is that w,. varies with the relative importance of each j as a 
source of knowledge recombination for i, we further assume that the larger is the 
proportion of knowledge that 7 recombines from /, the higher is W 

Let us now show how the model works by comparing two bepocicocal technol- 
ogy domains, i and &. Of the knowledge that technology domain i recombines 
from other technology domains during ¢, suppose that 30 percent comes from 
domain j » 20 percent from domain jy 40 percent from domain j . and 10 percent 
from domain j '; Also assume that during ¢, 7 , grows by 3 percent, J, grows by 7 
percent, j, grows by 5 percent, and j, decreases by 9 percent, relative to t.. Now 
consider technology domain k&. Of the knowledge that technology domain k re- 
combines from other technology domains during ¢, let us assume that 10 percent 
comes from domain j., 70 percent from domain j,, and 20 percent from domain 
j,, Let us further assume that relative to t,, j, decreases by 5 percent, while j,and 
Jj grow by 30 percent and 15 percent, respectively. Under these conditions, our 
argument and model imply that: 


= .3x.03+.2x.07+.4x.05-.1x.09 = .009 + .014 + .02 — .009 = .034 
ea X00 4) X) the K 15 == .005 +21 ¢..03 2235 


Hence in this example, our expectation is that during ¢ the growth of both & and 
i will be accelerated by the symbiotic interdependencies they entertain with other 
technology domains; however, this acceleration should be almost seven times as 
fast for & as it is for 7. 


Modeling Competitive Interpendencies 

With regard to competitive interdependencies, our argument is that a focal tech- 
nology domain is subjected to competitive pressure by other technology domains 

insofar as its recombinant niche overlaps with the recombinant niches of other 
technology domains. Accordingly, we define the dyadic niche overlap a, of a focal 
domain i by another domain kas the proportion of ’’s recombinations thatd iand k 
carry out from the same source technology domains within the same time interval: 


2 Ci 2 iN) 
Oh ell ateaerr Seca Gal We I (2) 


where i  k andr # j, andt is a time interval, and H stands for the set of re- 
combinations carried out by the domain in the right-hand subscript, from ideas 


belonging to the domain in left-hand subscript. 
Figure 2 presents an example. During interval ¢, domain 7 recombined 100 
times ideas belonging to domain j,, 50 times ideas from j,, and 60 times ideas 
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from j,. Meanwhile, domain k recombined 20 times ideas fromy,, 90 times ideas 
from j,, and 75 times ideas from j,. Thus, «,,, =(20+50)/210 = .3. 

This network-analytic measure of dyadic niche overlap is continuous; it is 
normalized within the interval [0, 1]; and, it takes into account units’ size differ- 
ences and is therefore asymmetric.‘ Thus, it meets the three conditions necessary 
for a correct operationalization of dyadic niche overlap (Sohn 2001). The overall 
overlap that 7’s niche is subjected to, A;,, can be simply calculated as the sum of 
its dyadic niche overlaps, 

Arey 

it _ tkyt (3) 

kes 

This overall measure is similar to Podolny cum suis’ (1996:666) niche crowding 

measure, but for the a-terms we do not just have to deal with “plain” niche over- 

laps, because in our case niches consist of pair-wise disjoint subsets. To relate a 

domain’s competitive interdependencies to its growth rate, we then simply take 

the difference in the overall niche overlap of a technology domain between sub- 
sequent time periods. 


" Empirical Data 


To test our hypotheses, we use the very same dataset used by Carnabuci and 
Bruggeman in their study (2009), which guarantees comparability of the results 
and facilitates knowledge accumulation. The data, which have been collected by 
the National Bureau for Economic Research and are freely and publicly accessible, 
comprise all patents (over two million) granted by the United States Patent and 
Trademark Office between 1975 and 1999, as well as all citations (over 16 mil- 
lion) among them. Extensive descriptive statistics and methodological discussions 
concerning these data are available in Hall cwm suis (2001). Following Carnabuci 
and Bruggeman, we use these data to indicate (1. the network nodes, i.e., the 
technology domains; (2. their directed and valued ties, i.e. the knowledge recom- 


bination patterns within and among technology domains, and (3. the growth rate 
of each technology domain. 


Network Nodes 


The USPTO has developed an articulated system of classification (the United 
States Patent Classification, henceforth USPC), so that every patent can be identi- 
fied by the technical content of its subject matter. In 1999, the last observation 
year, there were 418 domains of technological knowledge, or primary classes, and 
over 120,000 subclasses. Following Carnabuci and Bruggeman (2009), we used 
the USPC primary classes to carry out a homomorphic mapping of the acyclic 
network of individual patents and patent citations onto a cyclic network of tech- 
nology domains and cross-domain citations. The choice of indicating technology 
domains through USPC primary classes is generally regarded as methodologically 
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Figure 2. Niche Overlap of Two Hypothetical Technology Domains, i and k 
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appropriate, particularly because primary classes partition patented knowledge 
into non-overlapping technology domains, they are more reliable and robust 
than other partitions, and retrospective patent reassignments are extremely rare 
(Henderson et al. 2005). Accordingly, USPC primary patent classes have been 
widely used to indicate domains of technological knowledge, providing indirect 
validation for this measure (see, among many others, Jaffe et al. 1993). In addition, 
the choice of USPC primary classes to study technological progress is theoretically 
grounded. Technological progress is an inherently public process, which manifests 
itself whenever new knowledge accumulates within the boundaries of what comes 
to be institutionalized as distinct technology domains. USPC primary classes 
provide a straightforward unit of analysis to study these domains and the accumu- 
lation process they undergo because, compared to higher or lower classifications, 
at this level of observation the boundaries of technology domains are both more 
neatly circumscribed from a strictly technical viewpoint and more salient from a 
socio-cognitive and institutional one (for an extensive discussion of these issues, 
see Carnabuci and Bruggeman 2009). 


Network Ties 


To model the valued and directed ties between technology domains, i.e., the 
cross-domain knowledge recombination patterns, we relate the USPC patent 
classes by patent citations. Patent citations can be obtained from the so-called 
prior art, a list of all previously patented inventions a focal patent draws from. By 
signalling which bits of disclosed knowledge a patented invention draws from 
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(Jaffe et al. 1993), patent citations are indicative of the knowledge recombination 
underlying the newly generated invention. Since Griliches’ seminal work (1979), 
dozens of econometric studies have exploited this property of patent citations to 
investigate knowledge recombination patterns (e.g., Jaffe 1986). In addition to 
indirect evidence, two validation studies (Jaffe et al. 1998, 2000) concluded that 
patent citations are a valid measure of knowledge recombination.’ 


Domains’ Growth Rate 


The more a patented invention consists of knowledge that is useful for the gen- 
eration of new inventions, the more subsequent patents will cite it. Therefore, a 
widely established method to measure how much a patent contributed to the 
advancement of technological knowledge is counting the number of citations it 
received, also called forward citations (e.g., Griliches 1990). Indicators of knowl- 
edge output based on forward citations have received empirical validation from 
all studies known to us. Besides indirect evidence, their validity was confirmed by 
three validation studies (Albert et al. 1991; Jaffe et al. 2000; Albert et al. 2002). To 
measure the knowledge output of a technology domain within a given interval, we 

~ count the total number of patents generated in that technology domain, weighed 
by the number of citations each of these patents received, within that time interval. 
Then, to calculate the growth rate of a technology domain—our dependent vari- 
able—we take the percentage growth between that domain’s knowledge output 
over subsequent time windows. This brings us to the problem of choosing the 
length of time intervals, which we discuss next. 


Network Evolution 


Based on the abovementioned modeling specifications, we constructed our knowl- 
edge recombination network. To capture the evolution of this network, we mod- 
eled each network cross-section based on all citations between patents granted 
within the same time window. For the length of the time window, we chose five 
years, following a common practice (e.g., Podolny et al. 1996). While the choice 
of five years is somewhat arbitrary, Carnabuci and Bruggeman (2009) devised a 
number of tests that show that it is an appropriate way to model the evolution 
of our knowledge recombination network. Further, to model the network cross- 
sections from entirely non-overlapping data, we reduced the number of intervals 
to five non-overlapping networks. As a consequence, the evolving network of 
patent citations between technology domains in the years from 1975 to 1999 is 
modeled as a time series of five subsequent network cross-cuts. 


The Empirical Measurement of Domains’ Symbiotic /nterdependencies 


While the computation of domains’ niche overlap is straightforward, two qualifi- 
cations must be pointed out concerning the empirical measurement of domains 
symbiotic interdependencies. First, the growth rate of each source domain YS 
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(marginally) affected by the citations j receives from i. To eliminate this problem, 
we detracted all the citations i makes to j, prior to calculating y;, in Equation 3. 
Second, as we anticipated, a decision had to be made concerning which time lag 
should be used between y,, and w,. That is, to measure the relative importance of 
each technology domain j jasa sonreeiof knowledge recombination for i during 4, 
when exactly should we calculate the proportion of patent citations i makes to ;? 
As an upper bound, one could choose to measure y, and w,, contemporaneously, 
i.e., within the same time interval. But to avoid potential problems of endogeneity, 
one could also argue that w,, should be lagged with respect to J, Because sorting 
out this issue on purely theoretical grounds appears impossible, we measured w,,, 
with six different lags, ranging from zero to five years before J We were glad o 
find that the results of our estimates of interest remained qualitatively unchanged. 
Indeed, both the effect size and the precision of our estimates improved with 
longer lags, which we interpret as a strong indication that the results do not reflect 
endogenous dynamics. In the statistical models reported here, w,, was lagged by 
five years with respect to y., 


Analysis 
Statistical Models 


To facilitate knowledge accumulation and guarantee comparability across studies, 
our sample and statistical models reproduce exactly Carnabuci and Bruggeman’s 
(2009). Accordingly, we accounted for possible sources of unobserved heteorege- 
neity using both a fixed-effects and random-effects estimation (Hsiao 2003). We 
used a network disturbance model to get rid of possible interdependencies in 
the error (Leenders 2002).° We included the full set of control variables used by 
Carnabuci and Bruggeman in their study. Namely, we included a set of time dum- 
mies that control for possible time-varying factors that could affect the growth of 
all technology domains in a similar way (e.g., macro-economic fluctuations, or 
changes in USPTO practices). We included a set of dummies to capture aggregate 
effects associated with the macro technological areas to which our technology do- 
mains belong (i.e., chemicals, mechanical, computer and communication, electric 
and electronic, pharmaceutical and drugs, and other). These dummies correspond 
to Hall et al.’s (2001) highest level classification of the USPTO patent classes, 
which has been developed by the authors as an integral part of the NBER Patent 
and Patent Citations Data Set. As Hall et al. (2001) showed, the validity of these 
six macro technological areas is reflected in a rich variety of patent and patent 
citations statistics, including aggregate tendencies to receive and make patent 
citations, the number of claims made per patent, and the degree of “generality” of 
the inventions patented. By now, Hall et al.’s classification is accepted as a standard 
proxy for broad technological areas in patent-based studies. 

Because the vast majority of technological inventions are made by firms, we 
controlled for the (log of the) number of firms active in each technology domain 
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during each time interval. It has been argued that the share of backward patent 
citations firms make to their own patents indicates their ability to appropriate the 
benefits of their own inventions, which may have an impact on firms’ knowledge- 
related investments and strategies (Hall et al. 2001). To ensure that these dynamics 
do not alter our estimates of interest, we computed for each technology domain 
during each time interval the average ratio of firm self-citations. Carnabuci and 
Bruggeman (2009) showed that technology domains tend to grow faster as they 
specialize in an increasingly homogeneous set of input ideas. However, the more 
specialized they become the slower their future growth rate; and the beneficial 
effect of increasing specialization decreases with the degree of specialization 
reached by a technology domain. In our study, we control for all three effects. As 
Carnabuci and Bruggeman point out, their measure of specialization does not 
take into account whether the degree of specialization of a technology domain 
results from a set of similarly specialized inventions or, rather, from a combina- 
tion of highly specialized and highly brokering inventions. To account for these 
patent-level differences, the authors calculate for each technology domain during 
each time interval the coefficient of variation in patents’ “originality,” where the 
originality index is taken from Hall et al. (2001). This same variable is included 
as a control in our models. Lastly, we added a variable controlling for the total’ 
R&D dollars spent in each technology domain during each time interval.’ This 
control is important because technology domains are likely to grow faster if they 
experience more active government interventions or if they receive more R&D 
investments from firms. Because our control includes both private and public 
R&D investments, we account for these possible effects. 

To make sure our statistical models are estimated on exactly the same sample as 
Carnabuci and Bruggeman, we removed the same 33 observations they removed 
from their study. Six of these are outliers. Twenty-three are technology domains 
that received no citations at all during a given time interval, which had to be 
removed because the concept of recombinant niche is meaningless in those cases. 
The last four dropped observations pertain to technology domain “miscellaneous.” 


Asa result of these choices, our sample comprises 1,639 observations out of 1,672, 
like Carnabuci and Bruggeman’s. 


Results 


Before discussing the results of our econometric models, let us briefly look at 
the evolution of our explanatory and response variables. Interestingly, our data 
show that the extent of both competitive and symbiotic interdependencies among 
technology domains has grown substantially over the observation period, with an 
average yearly increase of roughly 1.5 percent and 3 percent, respectively. This gen- 
eralized increase in ecological interdependencies seems consistent with the view, 
common among historians and theorists of technical change, that technological 
knowledge has become increasingly general and abstract over the years (Arora 
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and Gambardella 1994), resulting in more pervasive knowledge recombinations 
across once far removed technology domains (Carnabuci, forthcoming). As the 
network of knowledge recombinations became more and more widespread, so did 
the ecological interdependencies that result from those recombinations. 

When one breaks down the data by macro area of technology, however, it also 
becomes evident that two areas, “computer and communications” and “drugs and 
medicals,” behaved somewhat differently from the rest of the technological land- 
scape. In particular, these are the only two areas wherein competition has been 
consistently declining over the observation period, and where symbiotic interde- 
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pendencies grew most rapidly. This is interesting because, as shown by Figure 5 and 
consistent with received wisdom, “computer and communications” and “drugs and 
medicals” underwent a spectacular progress during the observation period. These 
aggregate trends thus provide prima facie evidence in favour of our hypotheses. 
Table 1 reports means, standard deviations, and correlations among the vari- 
ables used in our statistical models. There appear to be no especially high cor- 
relations among the independent variables, as is confirmed by unproblematic 
multicollinearity statistics (the average VIF on the pooled data is 2.24). Notice 
that competition and symbiosis have a near-zero first-order correlation, suggest- 


Random-effects models based on Maximum Likelihood estimation. T-ratios in parentheses. 
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ing that the competitive pressure to which technology domains are subjected 
is independent of their symbiotic pressure, and vice-versa. Thus, a technology 
domain may at the same time be subjected to a high degree of competition and 
symbiosis or, by the same token, to a lack of both. ‘This is in line with our theory 
and empirical constructs, which assume that symbiosis and competition are not 
endpoints of a conceptual continuum but, rather, orthogonal concepts capturing 
distinct ecological forces. 

Table 2 displays the results of our statistical tests. Model 1 and Model 3 are 
baseline models based on fixed-effects and random-effects estimations, respectively. 
These models are identical to Carnabuci and Bruggeman’ full models (2009). Model 
2 and Model 4 add our two variables of interest (symbiosis and competition) to, re- 
spectively, Model 1 and Model 3. As said, our response variable is a domain's growth 
rate—domain’s percentage growth between subsequent time intervals. 

Consistent with received wisdom, models 1 and 3 show that the growth rates 
of technology domains tend to increase over time (albeit non-linearly). Indeed, 
economists have used this observation to substantiate their claim that unlike 
any other resource, the process of knowledge recombination yields increasing 
returns to scale (Mokyr 2002). When the effects of symbiosis and competition are 
controlled for, however, the trend changes quite dramatically. Based on analyses 
not reported in the paper, we observed that this change in the estimates of the 
time dummies is entirely due to the introduction in the model of the variable 

“symbiosis.” We regard this finding as very interesting, as it suggests that the root 
cause of increasing returns to scale in the knowledge recombination process lies 
in the symbiotic interactions occurring among technology domains. 

As we anticipated, models 3 and 4 indicate that the technological areas of 

“computers and communications” and “drugs and medicals” have been growing 
fastest. By contrast, the “chemical” sector has been growing significantly more 
slowly than the reference category “others” over the observation period. Although 
the results are less stark in the case of “electric and electronic” and “mechanical,” 
also these sectors appear to have lagged behind. The number of firms technologi- 
cally active in a domain has no effect on domains’ growth rates, except in Model 
2. Similarly, there seems to be at most a marginal effect associated with the co- 
presence of highly specialized and highly “original” inventions in a technology 
domain. The three variables at the core of Carnabuci and Bruggeman’s (2009) 
study, conversely, display the effects that were predicted by the authors. Namely, 
technology domains tend to grow significantly faster during periods in which 
their degree of specialization increases. However, the positive effect of increasing 
specialization is reversed for high levels of specialization and indeed, the higher 
the degree of specialization reached by a technology domain, the lower on aver- 
age its future growth rate. Importantly, these results remain unchanged when the 
effects of symbiosis and competition among technology domains are controlled 
for, strengthening Carnabuci and Bruggeman’s original test. 
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Figure 3. Average Degree of Competitive Interdependence Among Technology Domains 
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Figure 4. Average Degree of Symbiotic Interdependence Among Technology Domains 
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The last two rows of Model 2 and Model 4 report the estimates of our variables 


of interest. As predicted by Hypothesis 1, there exist symbiotic interdependencies 
among technology domains, so that the growth rate of any technology domain 
depends to a relevant extent on the growth rates of the technology domains it 
draws knowledge from. ‘This effect is highly significant from a statistical point of 
view, with a t-ratio as high as 18.69 in the fixed-eftects model and 23.92 in the 
random-effects one. In addition to a high degree of statistical significance, our 
estimates indicate that the role of symbiotic interdependencies among technology 
domains has a remarkable effect size. Based on our fixed-effects estimates, a one 
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Figure 5. Average Growth Rate 
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standard deviation difference in the amount of symbiotic pressure received by a 
focal technology domain can be expected to engender as much as a 29 percent 
increase in the domain’s growth rate. This means that holding everything else 
constant, the growth rate of a technology domain embedded in high-growth posi- 
tion in the knowledge recombination network (top decile) is on average more than 
three times as large as the average growth rate of one embedded in a low-growth 
position (bottom decile). 

Our Hypothesis 2 predicted that technology domains are in competition with 
each other to the extent that they tap knowledge inputs from overlapping recom- 
binant niches. Based both on the fixed effects (Model 2) and the random effects 
(Model 4), the results of our analysis offer statistical support to this prediction. 
Hence, the more the overall niche overlap of a technology domain increases (de- 
creases) over time, the more the domain’s growth rate tends to decrease (increase). 
Namely, a one standard deviation increase in the overall niche overlap of a technol- 


ogy domain results on average in a 4 percent decrease in its growth rate, according 
to our fixed-effects estimate. 


Discussion and Conclusions 


In this article, we offered and documented a novel explanation for an important 
and widely acknowledged empirical phenomenon, i.e., that technology domains 
grow at different rates. Challenging the commonly held assumption that technol- 
ogy domains grow independently of one another, we argued and showed that 
technology domains are ecologically interdependent. We identified two kinds of 
ecological pressure that significantly impact the growth of technological knowl- 
edge across technology domains. First, we showed that because knowledge grows 
by recombination, and recombinant patterns cut across the boundaries of tech- 
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nology domains, symbiotic interdependencies are pervasive among technology 
domains. Accordingly, the rate of growth of one technology domain tends to 
be accelerated by the advances made in other technology domains, to the extent 
that the latter is an habitual source of knowledge recombination for the former. 
Second, we showed that technology domains are in competition with each other 
insofar as they tap knowledge from overlapping recombinant niches. As a result, 
the rate at which a given technology domain advances varies inversely with the 
competitive pressure it receives from overlapping technology domains. Due to 
competition and mutualism, and unlike conventional accounts of technological 
progress, this article showed that technological progress is an inherently ecological 
process wherein the growth rate of each technology domain depends on dynamics 
occurring in other technology domains. 

We think that the article makes four main contributions. First, it extends the 
line of inquiry initiated by Carnabuci and Bruggeman’s study of the growth of 
technology domains (2009). Employing a characteristically sociological toolkit to 
analyze the growth of public technological knowledge, Carnabuci and Bruggeman 
proposed that the process of technological knowledge growth is best conceived 
as an evolving network of knowledge recombination patterns among technology 
domains. Through this perspective, the authors were able to show that the growth 
of technology domains depends on intertwined dynamics of knowledge special- 
ization and knowledge brokerage. In the present article, we built on Carnabuci 
and Bruggeman’s intuition and model to study how technology domains “grow 
together” (Girifalco 1991), thereby offering a distinctly ecological interpretation 
of their network conception of technological progress. As we are confident that 
the space of research opportunities opened up by the network perspective on 
technological progress is still ample, we regard the possibility of further knowledge 
accumulation along this line of inquiry as both promising and desirable. Indeed, 
both the mathematical formalization provided by network methods and the avail- 
ability of the NBER public data set ought to render the pursuit of knowledge 
accumulation easier in this case than in many other sociological researches. 

Second, thus far most scholars have been persuaded that because knowledge is 
a non-rival resource, the knowledge recombination process is unboundedly self- 
reinforcing; hence, the rate of knowledge generation is increasing in scale. In a pair of 
influential articles, for example, Weitzman (1996, 1998) argued that given sufficient 
R&D investments, the stock of technological knowledge will increase combina- 
torially. We qualified Weitzman’s work by explicating a mechanism-—competition 
among technology domains—that acts as an endogenous constraint in the knowl- 
edge recombination process. Indeed, Weitzman (1998:354, italics added) alluded 
to the possible existence of competition in knowledge recombination by remarking 
that “[i]f there were a finite number of potential ideas, then one researcher might 
create negative externalities for other researchers through a process akin to overfish- 
ing in a crowded pond.” Our study substantiated this speculative remark, showing 
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that the negative externalities of competition are an inherent and non-negligible 
characteristic of the knowledge recombination process. Besides being an important 
theoretical qualification per se, accounting for the role of knowledge competition 
suggests that the productivity of knowledge recombination may decline even in the 
presence of increasing R&D resources—a conclusion that helps to make sense of “... 
the well-known ‘puzzle’ of the large fall in the ratio of US patents to US research 
inputs in the post-war period.” (Caballero and Jaffe 1993:50) . 

To be sure, we agree with Weitzman and the proponents of so-called New 
Growth Theory (Jones 2005) that the process of knowledge recombination can 
and often does generate symbiotic externalities and increasing returns to scale. 
However, rather than assuming a priori an unconstrained combinatorial knowl- 
edge growth across the board, our proposed ecological perspective indicates that 
the extent of symbiotic pressure varies among technology domains depending on 
their position in the knowledge recombination network. In this respect, a third 
contribution of our study is that it provides a useful middle point between two 
well-established, yet starkly contrasting, visions of technological progress. On the 
one hand, as noted by Harberger (1998), economists have proposed a “yeast” vi- 
sion of technological progress, wherein the positive externalities of technological 
advances are implicitly assumed to spread out homogeneously throughout the 
whole technological landscape. Indeed, the very concept of technology domain is 
irrelevant according to this vision, and technological progress may be aptly mod- 
elled @ da Weitzman. On the other hand, the evolutionary view of technological 
change (Dosi 1982, Nelson and Winter 1982) has maintained that each techno- 
logical domain has highly path-dependent and idiosyncratic dynamics, implying 
that the growth trajectories of technology domains are largely independent of 
each other. Our ecological perspective indicates that technological knowledge 
grows neither as “yeast,” nor as a collection of independent technology domains. 
Rather, the growth of technology domains is driven by an evolving network of 
symbiotic interdependencies whose structure is determined by the actual pattern 
of knowledge recombination taking place among them. 

A fourth contribution of this article is that it offers a novel perspective on the 
role of technology policy.* Reflecting the traditional view that technology domains 
grow independent of one another, scholars have noted that in order to favour the 
development of specific technology domains, it is imperative to increase the R&D 
investments therein. Implied from this argument is the precept that policy makers 
should aim to counteract “market failures” by funnelling R&D resources towards 
socially desirables technologies, either directly through public R&D spending or 
by providing a suitable incentive structure to private investors (Tassey 1997). A 
notable example occurred in the nineties, when the U.S. government proposed 
a multi-billion budget for government-funded R&D and tax credits to foster 
the progress of renewable-energy technologies. While it is obvious that R&D 
spending influences the rate and direction of technological progress, our analyses 
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showed that the ecological dynamics of competition and symbiosis exert a strong 
additional effect. Rather than merely increasing the level of R&D investment on 
a given technology domain, we therefore suggest that policy makers should use 
their resources to take better advantage of the growth dynamics occurring in other 
technology domains. Namely, our results indicate that to boost the progress of a 
given technology domain, R&D resources should be allocated in such a way that 
the competitive interdependencies to which it is subjected are minimized, while 
its symbiotic ones are maximized. For example, government interventions may 
favour the cross-domain projects that involve fast-growing technology domains, 
or that reduce dependence on sluggish and crowded ones. 

In concluding, we would like to acknowledge that our study suffers from 
at least two noteworthy limitations. First, our analyses are based exclusively on 
patent-based indicators. Whether and to what extent competitive and symbiotic 
interactions occur in non-patented technological knowledge is a question that 
ought to be addressed. Second, we took a markedly macroscopic perspective. By 
so doing, we abstracted away from knowledge growth mechanisms taking place at 
lower levels of analysis, particularly the business firm. To be sure, extant research 
indicates that the ecological dynamics of knowledge competition (Podolny and 
Stuart 1995) and symbiosis (Operti and Carnabuci 2008) do affect firms’ ability 
to generate new technologies. However, we do not yet know how these ecological 
dynamics affect the growth rate of technology domains. For example, there is evi- 
dence showing that the more a firm competes for knowledge with other firms, the 
less it tends to grow (Podolny et al. 1996); but how the growth rate of technology 
domains is affected by different degrees of inter-firm knowledge competition is 
an open question and, we believe, a potentially fruitful avenue of future research. 


Notes 


1. The present article extends Carnabuci and Bruggeman’s (2009) network- 
based conception of technological progress by examining the role of ecological 
interdependencies among technology domains. Unlike previous analytical 
approaches, a distinguishing aspect of Carnabuci and Bruggeman’s network model 
of technological progress is that it assumes that technology domains are dependent 
on each other as sources of knowledge recombination. In their article, however, 
Carnabuci and Bruggeman did not explore, either theoretically or empirically, the 
ecological implications of this assumption. Rather, they examined how the growth 
of technology domains varies as they become more or less specialized. By contrast, 
we build on Carnabuci and Bruggeman’s intuition and network model to derive 
and test an ecological theory of technological progress. Namely, we argue that net 
of the specialization effects found by Carnabuci and Bruggeman and other relevant 
domain-specific factors, the growth of each technology domain depends on dynamics 
occurring in other technology domains. 


2. Symbiotic interdependence is here defined as a relationship between two entities, in our 
case technology domains, wherein at least one entity derives some kind of fit benefit 
from the other, while the fit of the latter is not reduced. Two aspects of this definition 
deserve to be mentioned. First, symbiotic interdependencies can, but do not need to 
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be, symmetric. Thus, the degree of symbiotic dependence of technology domain i on 
technology domain j may be different from the degree of symbiotic dependence from 
jto i. And indeed, i may be symbiotically dependent on j even if j is not symbiotically 
dependent on i. Notice that this definition is consistent with the etymology of the 
compound term interdependence (i.c., dependence between), which denotes a dependence 
relationship between distinct entities, but not necessarily a symmetric one. Second, our 
definition of symbiotic interdependence excludes predatory and parasitic relations, 
wherein the fit benefit accruing to one entity corresponds to a fit deficit in the other 
(e.g., the fox benefits from eating hares, but this reduces the fit of hares). 


3. For the sake of simplicity, in this example we assumed that the knowledge 

recombination patterns on which w,, is calculated are contemporaneous to y,, (i.e., 
ae : di 
we assumed that there should be no time lag between W, and Ip) As we will discuss 
later in the article, however, this choice is not obvious, as one might argue that Ww, 
should be lagged with respect to y,. Because the choice of any specific time lag would 
be somewhat arbitrary, for the empirical test of our argument we will use several 
operationalizations of w., . with time lags ranging from zero to five years. 
ij 


4. For an illustration of asymmetry, the niche overlap of i on & in Figure 1 is a,, = (20 
+ 50) / 185 = .378. As can be seen, because the total number of recombinations by & 
is smaller than 7's, the overlap of i on & is larger than the overlap of & on i. 


5. Alcdcer and Gittelman (2006) have shown that because many patent citations are 
inserted by the USPTO patent examiners after the patent has been applied for, if 
one uses patent citations to infer which prior inventions an inventor consciously 
built on to generate his/her own invention, one is likely to incur both type I and 
type II errors. However if one is interested, as we are, in mapping how knowledge is 
recombined across the technological landscape, the additional citations inserted by 
patent examiners provide a valuable integration (Sorenson and Singh 2007). 


6. Network disturbance models use information on the network structure of the data 
to account for a possible lack of independence in the error terms of the regression 
equation. This kind of models assumes that there are patches of interrelated cases 
that fit the prediction of the statistical model better that one would expect by chance, 
patches for which the model predictions are systematically underestimated and, finally, 
patches for which the model predictions are systematically overestimated (Dow et 
al. 1984). Let y be a (m x 1) vector of values of a network autocorrelated variable 
for the 7 nodes of a network, let X denote a (7 x &) matrix of values for the 7 nodes 
on & covariates and let W be a (” x m) “weight” matrix based on the network data. 
Following Carnabuci and Bruggeman (2009), we modelled the weight matrix W by 
row-normalizing our domain-to-domain patent citation adjacency matrix. Within 
this framework, a network disturbance model accounts for the effects of network 
autocorrelation by modeling parameter p in the following statistical equation: 

y=XBb +6, e=pWe +, v ~ N(0, 221) 


7. Although publicly available, these R&D data contain many missing values. For 
that reason, Carnabuci and Bruggeman (2009) ran a separate batch of models with 
this additional control, which they did not report in the paper. We did the same. 
Notwithstanding the great reduction in sample size, our results remained qualitatively 


unchanged with regard to our variables of interest, when the R&D control was 
included in the model. 


8. We are grateful to an anonymous reviewer for pointing out that our theory could be 


related to technology policy making. 
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The Deviant Organization and the Bad Apple CEO: 
Ideology and Accountability in Media Coverage of 
Corporate Scandals 


Mike Owen Benediktsson, Princeton University 


What role do the media play in the identification and construction of white-collar 
crimes? Few studies have examined media coverage of corporate deviance. This study 
investigates news coverage of six large-scale accounting scandals that broke in 2001 
and 2002. Using a variety of empirical methods to analyze the 51 largest U.S. news- 
papers, the study tests several explanations for tendencies to run more or less coverage 
of the scandals in question. The study then examines the substantive focus of coverage. 
First, the results suggest that scandal coverage was influenced by the political ideology 
of newspapers, as opposed to economic interests or social structural ties between firms. 
Second, the analysis shows that attention to the adjudication of individual crimes and 
the punishment of individual offenders received the bulk of media attention. 


In 2002, a wave of accounting scandals shook the American business commu- 
nity. In print and broadcast media, the conduct of large corporations and highly 
paid executives came under a glaring public spotlight. Eventually, accounting 
malfeasance faded from national headlines, but the moment begs to be revisited 
analytically. The scandals provoked broad speculation concerning the nature of 
corporate crime as pundits debated the merits of a “bad apple theory” emphasizing 
individual culpability rather than endemic corruption (Washington Post 2004). 
For contemporary observers of the crisis, there was more at stake in this debate than 
the scope of actual wrongdoing: the scandals highlighted the role of the news media 
in identifying and defining cases of corporate deviance. President George W. Bush, 
while addressing a Wall Street audience, argued that “[t]he business pages of American 
newspapers should not read like a scandal sheet,” and suggested that news reports 
neglected the overall health of the economy, focusing disproportionately on the work 
of a few isolated individuals (The New York Times 2002). In contrast, an Op-Ed in 
the The New York Times blamed the scandals on a “virus” of “infectious greed” which 
had long been ignored by news media steeped in the same “blue-chip money culture” 
as the offenders (Andersen 2002). At play in these comments are two very different 
views of the media’s treatment of corporate deviance. In the first formulation, the 
media appear as external “watchdogs” who may err on the side of overzealousness and 
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sensationalism in their corporate crime reporting. In the second model, the media's 
insider status with regard to corporate culture renders them insufficiently watchful. 
Journalists help to construct public understandings of the economy and of 

crime. For this reason, their relationship to corporate elites and their representa- 
tions of deviance are of scholarly concern. News organizations are private busi- 
nesses staffed by upper- and middle-class professionals with opinions on politics 
and economics (Gans 1979; Lichter and Rothman 1986). Increasingly, these 
organizations are owned by large corporations (Bagdikian 2000). Realizing the po- 
tential for any or all of these factors to influence news coverage, a multidisciplinary 
body of research has investigated economic coverage. Do the characteristics of 
media corporations affect the way their newspapers cover other corporations? And 
do other aspects of news outlets, for example their geographic location or political 
culture, affect their treatment of local business elites? 

The present study analyzes the likelihood that any given newspaper will cover 
a scandal and seeks insights into the forms of deviance (for example, individual or 
organizational) that are more likely to make it onto the printed page. Building on 
these parallel lines of inquiry, we analyze the coverage of six accounting scandals 
that broke in 2001 through 2003. The objectives of the analysis are twofold: first, 
to test possible explanations for the quantity of scandal coverage produced by any 
given newspaper; and second, to provide a descriptive account of the news stories 


that were published. Specifically: 


1. During a moment of reputational crisis for corporate 
elites, what are the organizational and ideological factors 
that lead any given newspaper to produce more or less 
coverage of a given white-collar crime? In other words: Who 
covered the scandals? 


2. Second, to what degree did news reports emphasize the 
acts of individual criminals vs. the organizational deviance 
of large corporations? And to what degree did news reports 
emphasize the investigation vs. the adjudication of crimes? 
In other words: What did they cover? 


‘The series of accounting scandals that began to surface in late 2001 and early 
2002 offer an excellent context in which to seek answers to these questions. Largely 
absent from existing research has been an approach that compares the coverage 
produced by media organizations during a time of corporate crisis. Gilens and 
Hertzman (2000:371) offer some guidance here, arguing that researchers should 
select a news story that: “(1) [H]as received substantial coverage, (2) has clear 
implications for the financial interests of corporate media owners, and (3) has 
different implications for the interests of different corporate owners.” 
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Large-scale corporate accounting scandals fulfill all three criteria. The scandals 
sparked a broad, multi-faceted crisis for business elites. As news events, they pro- 
voked hundreds of thousands of articles in U.S. newspapers. Although an isolated 
incident of corporate malfeasance might have hurt investor confidence in the short 
run, the rash of large scale accounting scandals threatened not simply the short-term 
profitability of the stock market, but the very reputation of corporate elites. In the 
years after the first scandals broke, polls showed a persistent decline in the public 
trust and respect given executives at large corporations, who were surpassed in their 
perceived untrustworthiness only by politicians and car dealers (Blake 2003; Swanson 
2003). As profound shocks to investor confidence, the scandals were devastating 
to advertises and the stock market as a whole, and thus had clear implications for 
media corporations (Browning 2002). Finally, the accounting scandals did not affect 
all corporations equally. Because the scandals involved misreporting of profits to 
investors and regulatory agencies, large, publicly traded corporations bore the brunt 
of both federal regulatory attempts (Schroeder 2002) and loss of public confidence 
(Blake 2003; Swanson 2003), as well as other financial repercussions of the economic 
downturn (Weisul 2002). If ever there were a moment in which the interests of 
corporate media could be expected to visibly shape coverage, this would be it. 


Making Crime Salient: Who Covered the Scandals? 


A variety of empirical and theoretical arguments in media scholarship converge on 
the claim that newspapers owned by large, publicly traded corporations produce 
favorable coverage of business elites. One argument holds that newspapers owned 
by large corporations are embedded in a web of economic interests that impinge 
upon journalistic objectivity through the exertion of managerial pressure (Herman 
and Chomsky 1988; Price 2003). Empirically, the effects of ownership on media 
content have been examined in several case studies focusing on cross-promotion 
of commercial products (Weis and Burke 1986; Williams 2002a). Other studies 
have examined coverage of attempts to regulate the media, a scenario that may 
pose a clear conflict of interest for large news organizations (Gilens and Hertzman 
2000; Pratte and Whiting 1986; Snider and Page 1997). The majority of these case 
studies have found that the production of media content can indeed be overtly 
subjected to the economic interests of its producers, a potential.that increases in 
significance as media companies become larger and their interests more widely 
dispersed (Bagdikian 2000). On the other hand, an intriguing study of popular 
music blacklisting by media conglomerates found no evidence that large corpora- 
tions were willing to restrict content out of self-interest, a finding that conflicts 
with the perspective outlined above (Rossman 2004). 

Media sociologists have also found that the profit motives of media companies 
may filter down into newsrooms through deeply entrenched professional norms 
(Soloski 1989) or systems of “social control.”(Breed 1955) Other researchers have 
argued that a pro-business ideological stance is broadly institutionalized in con- 
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ventions of journalistic practice (Croteau 1998; Gans 1979; Reese 1990). In its 
most extreme form, this line of argument holds that reporters working for large, 
publicly traded media companies are rendered uncritical by the culture of the 
community they inhabit, exhibiting a brand of journalistic “professionalism” that 
exempts the entire business community from investigative scrutiny (McChesney 
2004). The cross-sectional analysis utilized in this study is not (nor should it claim 
to be) capable of parsing the micro-level journalistic processes through which par- 
ent companies’ interests might conceivably influence news coverage. It provides, 
however, a necessary complement to case studies and newsroom ethnographies, 
testing patterns consistent with their claims. Building on this body of research, 
the first two hypotheses are clear: 


Hypothesis 1: Newspapers owned by large parent 
companies will be less likely to provide extensive coverage 
of prominent accounting scandals. 


Hypothesis 2: Newspapers owned by publicly traded parent 
companies will be less likely to provide extensive coverage 
of prominent accounting scandals. 


A second perspective holds that shared class interests are reflected in social struc- 
tural ties between media professionals and corporate elites. Journalists and media 
professionals, it is claimed, share membership in formal and informal organiza- 
tions, live in the same neighborhoods, and know the same people (Dreier 1982; 
Dreier and Weinberg 1979). These social ties are by no means limited to observ- 
able institutional links. Still, interlocking directorates, describing connections 
between companies through shared board members, have largely been considered 
an adequate proxy for the ties that could connect two corporations. An important 
characteristic of interlocking directorates is that, as Mizruchi (1996:273) observes, 
they “need not be the result of conscious decisions by a corporation’s management 
to link the two firms in question.” Instead, they serve a wide range of purposes for 
the corporations and the individuals involved, including “collusion, cooptation 
and monitoring, legitimacy, career advancement and social cohesion.” 

To the extent that interlocking directorates are imperfect proxies for relevant 
social ties between corporations and media professionals, prior research suggests 
that geographical proximity captures additional dimensions of interest (Mizruchi 
1989; Mizruchi and Koenig 1991). As Palmer and his colleagues (1986:786) put 
it: “In the United States, geographic locale is an important basis upon which the 
capitalist class is internally differentiated.” Business elites residing in the same 
region have shared interests, as well as shared friends, associates and organiza- 
tional affiliations. Furthermore, as Molotch (1976) has observed, media outlets 
have a rational incentive to protect local businesses: the commercial viability of 
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metropolitan newspapers is wedded to the economic growth of the city and the 
corporations that sustain it. 

Among newspaper companies, any tendency to protect local firms and their 
executives should be all the more salient, considering a professional mandate to do 
the opposite. An economic disruption on the scale of a major accounting scandal 
at a local corporation, whether or not layoffs accompany it, can be presumed to 
have broad local consequences, and as such, should generate substantial news 
coverage from a local newspaper. Moreover, local events are more highly valued 
by journalists, more readily responded to, and easier to cover—factors which have 
led social conflict researchers and media scholars to acknowledge geographical 
proximity as a principle component of media “sensitivity” to potential news events 
(Snyder and Kelly 1977; Shoemaker and Reese 1996). Using interlocking direc- 
torates and geographical proximity as empirical proxies for social ties and shared 
affiliations between journalists and corporate elites, a third and fourth hypothesis 
can be derived: 


Hypothesis 3: Newspapers owned by a parent company 
that shares an interlock with a company experiencing an 
accounting scandal will be less likely to provide extensive 
coverage of that scandal. 


Hypothesis 4: Newspapers geographically proximate to 
the corporate headquarters of a company experiencing an 
accounting scandal will be less likely to provide extensive 
coverage of that scandal. 


A third perspective suggests that news coverage of business bears the mark of 
journalists’ political ideology. In recent years, arguing that the media's political 
allegiances lie on one “wing” or another has become a cottage industry of sorts, 
surfacing repeatedly in political rhetoric and trade books (e.g., Coulter 2002; 
Alterman 2004). The academic standard bearers of the “liberal media” perspective, 
S. Robert Lichter and Stanley Rothman have argued that editors and reporters 
represent a privileged, highly-educated cross-section of society that holds hostile 
views toward both corporate and political elites, and that these views are echoed 
in antagonistic reporting on the actions of large corporations (Lichter et al. 1986; 
Rothman and Black 2001). Surveys for the most part corroborate Lichter et al.'s 
account of journalists’ left-of-center political leanings (Weaver 1996; Weaver and 
Wilhoit 1986; Hess 1992). On the other hand, any conservative bias on the part of 
media managers or owners may have a more fundamental effect upon coverage if 
it plays into personnel decisions concerning liberal journalists (Klinenberg 2007). 

Attempts to systematically measure ideological bias have proven difficult, yield- 
ing mixed results (Niven 2003; Groseclose and Milyo 2005). One way to sidestep 
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this measurement problem is to focus on correlations between a newspaper's 
political endorsements and patterns of presumably objective news coverage. The 
endorsements of an editorial staff offer an objective indicator of a newspaper's 
espoused political ideology. Any relationship between political endorsements and 
news coverage should be deemed especially noteworthy considering the common 
practice of claiming a rigid separation between the objective decisions of the news- 
room and the subjective stances taken on the editorial page. The sacrifice implicit 
in this methodological move is that it takes us away from “journalistic ideology 
writ large, refocusing the lens on the political culture of individual newspapers. 
Reciprocating the findings of Lichter et al. in order to stay consistent with the 
other hypotheses, a fourth hypothesis can be laid out: 


b 


Hypothesis 5: Newspapers that endorsed the Republican 
candidate in recent presidential campaigns will be less likely 
to cover incidents of prominent accounting scandals. 


Defining Malfeasance: What Did the Newspapers Cover? 


Hypotheses 6 and 7 concern the substantive focus of coverage rather than the. 
question of who produces it. A consistent finding of white collar crime research 
is that legal institutions tend to be blind to the severity of crimes committed by 
organizational actors. Malfeasance by governmental agencies or corporations 
(“juridical individuals”) encounters lenience in the courtroom, while prosecu- 
tors reserve their harshest treatment for “natural individuals,” elite or otherwise 
(Coleman 1987; Shapiro 1990; Griffin 2002). A number of studies have pursued 
this tendency in the media, investigating the construction of individualistic vs. 
structural accounts of white-collar crime. A consistent finding of these studies is 
that newspapers focus on individual acts of wrongdoing rather than a pattern of 
organizational negligence, even when an organization is arguably more at fault 
than any one individual (Goff 2001; Lynch et al. 2000; Evans and Lundman 
1983; Burns and Orrick 2002; Wright et al. 1995). Media scholars have made 
parallel observations, arguing that journalists construct ordinary street crimes 
as discrete episodes rather than a larger “theme” (Iyengar 1991), emphasizing 
infractions by deviant individuals and highlighting the legitimacy of the process 
through which they are brought to justice (Ericson et al. 1991; Fishman 1980; 
Graber 1980; Tuchman 1978). Regarding this latter point, however, studies 
of white-collar crime coverage have observed a different tendency, in which 
newspapers were far Jess likely to run stories on the adjudicative repercussions 
of corporate crimes (e.g, legal penalization, environmental cleanup initiatives) 
than the initial allegations (Burns and Orrick 2002; Wright et al. 1995; Molotch 
and Lester 1975). From the literature on media coverage of white collar crime, 


we can extract two hypotheses regarding the substantive focus of accounting 
scandal stories: 
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Hypothesis 6: The majority of coverage will focus on aspects 
of individual rather than organizational wrongdoing within 
any given scandal. 


Hypothesis 7: The majority of coverage will focus on the 
investigation of crimes, rather than their adjudication and 
penalization, within any given scandal. 


Data and Analysis 


In order to examine the “gatekeeping” process through which journalists deem 
certain events newsworthy and deserving of coverage (Zelizer 2004), the present 
study uses a counting technique to measure the amount of news coverage devoted 
to accounting scandals at six large companies: Adelphia, Bristol-Myers Squibb, 
HealthSouth, Rite Aid, Waste Management and Xerox. The cases do not represent 
a random sample of accounting scandals. Instead, they were purposefully chosen 
to offer variation on a number of dimensions, including the amount of overall 
coverage generated, the legal and regulatory consequences, industry and location 
of corporate headquarters. Practicality was also a consideration in sample selection. 
More prominent scandals, at the Enron and WorldCom corporations for example, 
generated far too many news stories to code and analyze in a multi-scandal re- 
search design. Other scandals were less prominent, failing to generate an advisable 
quantity of coverage for cross-sectional analysis across a sample of newspapers. The 
six cases examined here represent a sample of mid-level scandals compared to other 
concurrent cases, while still offering considerable variation in amount of coverage. 
Table 1 shows selected relevant characteristics of the five scandals, complementing 
the narrative summaries presented in Appendix A. 

For each of the above scandals, all news articles published in the 51 largest 
circulating U.S. newspapers’ during a seven-year period (from Jan. 1, 1999 to 
Jan. 1, 2006) were drawn from a sampling frame provided by three separate news 
databases: Factiva, NewsBank and ProQuest.’ Because the intent was to analyze 
the selection process through which apparently objective information enters the 
public sphere, editorials and opinion pieces were excluded from the analysis, iso- 
lating the amount of news coverage devoted to the scandals by any given newspaper. 
The resulting sample of articles (N = 1,938) represents all news articles at least 100 
words in length devoted to any one of the above scandals and appearing during 
the aforementioned time span. The time span was chosen to include both the 
earliest public indication of wrongdoing and the resolution of all major lawsuits 


pertaining to any of the scandals. 


Independent Variables 


To identify independent variables, data were gathered on the characteristics of 
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the newspapers and media parent companies represented in the sample. Parent 
company size was measured using companies’ self-declared revenue, and was 
drawn from Standard & Poor's Directory of Corporations and Executives (2002). 


Selected Characteristics of Accounting Scandals 


Table 1 


Corporate Penalties 
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The location of corporate headquarters and the 
board information used to identify interlocking 
directorates were drawn from this same source, 
as well as annual reports (2002) and SEC filings 
on public record (2002). Publicly owned par- 
ent companies were identified using Hoover's 
Company Profiles and SEC filings. A variable 
measuring the editorial staff's espoused political 
ideology was created by averaging a dummy vari- 
able for each newspaper's presidential endorse- 
ments in the 2000 and 2004 general elections. 
Newspapers that endorsed the Republican can- 
didate in 2000 and the Democratic candidate 
in 2004 received a score of .5 on this indicator. 
In the case of three newspapers, USA Today, the. 
Los Angeles Times and the Orange County Register, 
the editorial staffs have a policy of not endorsing 
presidential candidates. In the case of the Los 
Angeles Times, however, it was possible to impute 
a value based on the newspaper's endorsements 
for lower offices (e.g., governor or senator) dur- 
ing the period of analysis. The Orange County 
Register does not formally endorse candidates 
because the editorial board has an explicit com- 
mitment dating back to the newspaper’s found- 
ing to honor free-market principles, which are 
thought to trump political allegiances. This 
ideological commitment, however, clearly 
positions the newspaper close to a politically 
conservative viewpoint on economic issues, so 
the author was confident in imputing a (1). In 
contrast, the editorial board of USA Today has 
an overt commitment to political neutrality 
dating back to the paper’s founding (Barringer 
2000). Contemporary editors acknowledge 
that the policy is a function of necessity: USA 
Today's national focus is manifested in a more 
ideologically and demographically diverse edito- 
rial board than may be found in regional news- 
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papers, resulting in a lack of political consensus (Porter 2004). For these reasons, 
a score of .5 was imputed on this indicator. 


Control Variables 


In addition to the seven theoretically derived hypotheses, a null hypothesis can 
be derived from reasonable assumptions of a market-oriented approach by jour- 
nalists (McManus 1994; Hamilton 2004). On the assumption that newspapers 
serving relatively high-income readerships in large, urban markets would be more 
likely to provide business coverage (see Hong et al. 2004), the analysis includes 
control variables for the population of the city in which a newspaper publishes 
and the mean income of the newspaper's readership. The population of the city in 
which the newspaper publishes was drawn from data published by the Population 
Estimates Program of the U.S. Census Bureau for July 1, 2006. For USA Today, 
which is published outside of Washington DC, but circulates nationally, the 
sample mean was imputed, which may result in some bias if the newspaper's 
readership is concentrated in relatively large or relatively small cities. The average 
income levels of readers were gathered from the Audit Bureau of Circulation and 
the Newspaper Association of America, both non-profit organizations that collect 
survey data for the use of potential advertisers. Finally, a variable was created to 
measure each newspaper's “news hole,” the amount of physical space available for 
any given story at any given time (Jones et al. 1959). News holes are understood to 
vary by newspaper, depending on a newspaper's format, printing and distribution 
costs. When newspaper data is being gathered electronically, a control variable can 
be created by averaging the total number of articles returned by the database for 
any daily issue of a given newspaper. If newspapers differ in the amount of content 
they make available to databases, this technique may result in a less-than-perfect 
measure of the news hole in any given print edition. Nevertheless, it is expected 
to capture broad differences in the resources and formats of individual newspapers. 
The variable was created by averaging the number of articles appearing in the 
newspaper on three separate days: two of these days straddled the analysis period 
and the third fell squarely in the middle; the three days were chosen to fall on 
different days of the week and during different times of year. 


Analysis 


First, an event history dataset was constructed that includes the publication date 
for each article as well as all independent variables. A Cox proportional hazard 
model for recurring events was fit, specifying the newspaper as the subject id 
and using robust standard errors to account for the interdependence of repeated 
“failures” (articles published) by any given newspaper.‘ ‘The Cox model is an ap- 
propriate tool for this analysis because it does not force the analyst to specify a 
shape for the hazard curve over time, and thus controls for the erratic temporal 
sequence of events in the accounting scandals. When using a proportional hazard 
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time, produced mixed results.’ For this reason, the hypotheses were tested us- 
ing two other methods: by conducting an event history analysis of each scandal 
individually and by collapsing the event history dataset in order to use Poisson 
and negative binomial regression models to analyze counts of articles published 
by each newspaper on all six scandals. The latter method was replicated for the 
entire period of analysis as well as piecewise by week, month and year. The results 
of all three methods were remarkably consistent, exhibiting coefficients that were 
comparable (when made commensurate) and highlighting the same variables 
when conventional tests of significance were applied. 

The second stage of analysis describes the focus of the overall coverage of the 
scandals, seeking to answer the question: “What did they cover?” Articles in the 
sample were coded based on whether the primary news item represented an al- 
legation of individual or organizational wrongdoing, isolating, in other words, 
which kind of actor (individual or organizational) constituted the object of the 
allegation, lawsuit, legal settlement, etc., that directly prompted the news story. 
To be clear, this meant coding the event, or “news item” that induced the coverage, 
rather than the gist or tone of coverage. A news article prompted by a settlement 
agreement between a corporation and the S.E.C., for example, would have scored 
a zero (organizational actor). A news article precipitated by the testimony of an 
employee in the trial of a corporate officer would have scored a one (individual 
actor). Articles were also coded based on whether the primary news item focused 
on activities related to the investigation or the adjudication of a given crime. For 
example, both of the examples above would have been coded as adjudicatory 
events. An article responding to the announcement of an S.E.C. probe on the 
other hand would have been coded as investigation-related. In the overwhelming 
majority of stories, it was clear whether the focus of the article was the organiza- 
tion in question or an individual who was currently or formerly employed by the 
organization. In five of the six scandals, the investigation/adjudication variable 
divided the data cleanly and chronologically, indicating an early investigation 
stage and a subsequent adjudication stage that generally began with either the 
first major indictment of a corporate officer or the announcement of the first 
financial settlement in an ongoing SEC investigation. The relatively small subset 
of articles (61) that could not be confidently categorized according to this schema 
was excluded from this stage of the analysis. 


Results 
Who Covered the Scandals? 


The results of the first stage of analysis are presented in Table 3. Unsurprisingly, 
large urban newspapers with the resources to publish a large number of stories on 
a daily basis were more likely to cover any given corporate scandal. However, a 
market-model interpretation of coverage fails to be completely supported: high- 
income readership is insignificant as a predictor. Similarly, the analysis does not 
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offer support for the hypothesis that newspapers owned by large, publicly owned 
parent companies were less likely to cover the accounting scandals. Proximity to 
scandal headquarters has a significant, positive effect on the amount of coverage, 
supporting the intuitive null hypothesis that newspapers cover events affecting the 
local economy and workforce. On the other hand, interlocking directorates are 
insignificant as a predictor, suggesting that direct ties between scandal companies 
and newspaper parent companies did not affect coverage in this case. Finally, by 
far the most notable finding presented here is that editorial political endorsements 
proved to be a strong determinant of scandal coverage, suggesting that the political 
ideology of a newspaper's editorial staff was associated with decisions to cover or 
not to cover large-scale corporate scandals. Interpretation of the hazard ratio sug- 
gests that solidly Republican newspapers were approximately half as likely as their 
solidly Democratic counterparts to run an article on the scandals at any given time. 


What Did They Cover? 


Coding the news stories for content revealed that coverage overwhelmingly em- 

_ phasized the adjudication of individual crimes. Corrupt corporate officers and 
deviant organizations received a relatively similar amount of coverage during the 
investigation stage, as 10 percent (194) of the articles focused on the investigation 
of organizational wrongdoing vs. 15 percent (295) devoted to probes and allega- 
tions targeting individuals. In the adjudication stage, however, the bulk of cover- 
age dramatically shifted toward individual wrongdoing. As a result, 67 percent 
(1,286) of the total sample of articles covered the adjudication and penalization 
of individual crimes. The overall focus on individual offenders is not surprising, 
given prior studies of white collar crime. However, in their emphasis on the lat- 
ter stages of an unfolding crime story, the news stories dramatically contradict a 
consistent finding of earlier research. In Figure 1, coverage is further broken down 
by category and scandal. Examining the distribution between scandals, it is clear 
that the media’s preference for individual wrongdoing was most pronounced in 
the case of the three most high profile scandals: Adelphia, HealthSouth and Rite 
Aid. This pattern is intriguing, suggesting a tentative refinement of the results sum- 
marized above: the media appear to key in on the endgame of particular corporate 
scandals, devoting most of their energy to following the trial and sentencing of 
individual offenders in these cases. 

At this point, it might be useful to integrate the findings regarding the newspa- 
pers more likely to cover scandals and the events they were more likely to report, 
asking, “who covered which types of events?” Attempts to answer this question using 
the data suggested that the newspapers in the sample were extremely similar in their 
judgments of relative news value. Two dependent variables were created: one for the 
ratio of stories on individual wrongdoing to stories on organizational wrongdoing 
published by any given newspaper; and one for the ratio of stories on investigation 
to stories on adjudication. When these dependent variables were regressed on the 
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Table 3: Effect of Newspaper Attributes on Risk of Publishing an Accounting 
Scandal Article 


Independent Variable Model1__ Model2__— Model3__— Model 4 
News Hole (10s articles) A022 1027 1032). 10s” 
(9.98) (8.99). (4.53) (4.76) 
City Population (100,000s) it Tale tao, Oe 

(4.04) (3.47) (3.24) (2.78) _ 


Reader Average Income 88 20) Oot CS 
(10,000s) (-.60) fg) ets 09) 0 Ae) 
Publicly Traded Parent 1.01 87 1.06 
Company (.03) (-.40) (.17) 
Parent Company Size 05 OT 95 
(Billions of Dollars) 696) 685) 1). 
Republican Endorsements AT™ oe 
(-2.48) _— (-2.47) 
Interlocking Directorate 62 
. oo ie.80), 
Proximity to Scandal Zon 
Headquarters (2.61) 
*n<.05 **p<.01 ***p<.001 (one-tailed tests for hypothesized effects, otherwise 
two-tailed) 


Notes: N = 1,983 news articles published in the 51 largest U.S. newspapers; Coefficients 
represent Cox Proportionate Hazard Ratios; Z-ratios in parentheses; Efron method used 
to handle tied failures. Hazard ratios represent the relative risk of publishing an article 

on any given day: a hazard ratio above 1 indicates that newspapers registering one unit 

higher on the independent variable were more likely to run a story; a hazard ratio below 1 
indicates that newspapers registering one unit higher were less likely to run a story. 


independent variables in the analysis using stepwise OLS regression to probe for 
differences in preference for types of news event, standard tests of significance (p < 
.05) revealed only two fairly weak findings: large newspapers and those proximate 
to scandal companies were slightly more likely to favor adjudicatory events over 
investigatory events than were small or geographically distant newspapers. 

The sample of newspapers was separated into two sub-samples: “solidly” 
Democratic newspapers (newspapers that endorsed the Democratic presidential 
candidates in 2000 and 2004); and “solidly” Republican newspapers (newspapers 
that endorsed the Republican candidate in 2000 and 2004). ° Figures 2, 3 and 
4 present the raw hazard curves for the three individual scandals that received 
the most coverage. Examination of the hazard curves corroborates the findings 
presented in tables 3 and 4 while adding substantive detail: the news events that 
produced the most coverage are those that involved individual wrongdoing, and 
among these, adjudicatory events such as indictments, arraignments, trial pro- 
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ceedings and sentencing hearings were particularly generative. With regard to 
political ideology, any differential appeal of different types of news events would 
appear as overlapping hazard curves (signaling a violation of the Cox model’s 
assumption of proportionate risks). On the contrary, the shape of the curves is 
quite similar for Democratic and Republican newspapers in each case, indicating 
that newspapers separated by political ideology attributed news value to the same 
types of events, even if they differed drastically in the overall likelihood of coverage. 

These results suggest that a degree of consensus may exist in editorial gate- 
keeping decisions concerning corporate scandals, an insight that shifts the focus 
away from differences in journalistic approach and toward the specific events that 
occur as a given scandal unfolds. When individual corporate officers come under 
a high level of scrutiny, it is common practice for organizational wrongdoers to 
minimize negative publicity by pruning the bad apples and avoiding any further 
news-making events, quietly settling any lawsuits aimed at the corporation (Day 
2002). As can be seen in figures 2, 3 and 4, in the case of Adelphia, HealthSouth 
and Rite Aid—the three scandals that generated the bulk of the coverage—all of the 
executives who were eventually indicted either stepped down or were dismissed 

’ early in the developing scandal, a separation that may have raised their salience 
as individual culprits. In contrast, the corporations involved in the three scandals — 
that produced the /east coverage (Waste Management, Xerox and Bristol-Myers 
Squibb) kept any corporate officers who were under suspicion within the fold, 
settling any lawsuits naming individual offenders. 

A brief comparison of the Adelphia and Waste Management scandals is instruc- 
tive. Although six Waste Management officers were eventually indicted, none of 
them went to trial, and coverage of the scandal, as can be seen in Figure 1, was 
muted, roughly divided between individual and organizational wrongdoing. There 
is a clear explanation for the limited coverage. Waste Management, well aware of 
the scrutiny that would ensue if tales of individual corruption surfaced in legal 
proceedings, settled the former executives’ fraud charges with the SEC, paying 
$17.1 million on behalf of one former executive to avoid a trial. 

In the case of Adelphia, scrutiny mounted more rapidly, forcing a different 
response. The corporation chose to cooperate with the prosecution of former 
CEOs, disclosing information that supported the prosecution's case against 
founder John Rigas. At this point, accountability in the scandal became zero- 
sum. Rigass guilt was equivalent to Adelphia’s innocence, and any malfeasance 
that could be traced directly to the bad apple was an argument against a blight 
affecting the tree. Possibly in return for cooperation, Adelphia was spared criminal 
charges, declaring in a press release that the prosecution of Rigas “will help further 
distance Adelphia from the wrongful conduct of the Rigas family.” (Williams 
2002b) As President Ron Cooper later remarked, “ [We're communicating that 
it’s not the company on trial, it’s the former executives who are no longer part of 


the company.” (McCarthy 2004) 
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Figure 1. Accounting Scandal News Stories, by Category of News Item, 1999-2006 
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In both of these cases, the corporations attempted to limit public awareness of 
organizational malfeasance. Waste Management successfully avoided trial coverage 
while Adelphia, seeing a trial as unavoidable, made a sacrificial offering of individual 
culprits to regulators and the media. With these considerations in mind, the jour- 
nalistic preoccupation with individual white-collar criminals involves not just the 

“labelers” and the “labeled,” but a third party, the corporation. The focus of news 
coverage on the adjudication of individual wrongdoing, then, is perhaps best seen 
as the result of collaboration between the media and the corporation in question. 


Discussion 


To some degree, the present study is an exercise in falsification, notable for the 
support it does not offer for a number of prominent theories. It does not appear, 
based on the data, that large, publicly owned media companies suppress cover- 
age of corporate crime during such moments of widespread crisis. Similarly, it 
does not seem that measurable social ties between scandal companies and media 
companies affect newspaper coverage, or that newspapers protect local elites by 
limiting reporting on scandals. 

Two of the study’s findings are especially surprising with regard to prior research. 
First, the evidence marshaled here supports the contention that editors and report- 
ers do not always set aside their politics when covering business. This insight has 
theoretical weight, contradicting arguments in media sociology that attribute pri- 
macy to the economic interests of media owners, pro-business journalistic norms or 
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“responsible capitalist” ideological orientations hermetically sealed against political 
sentiment. Second, the evidence strongly conflicts with earlier research on white 
collar crime, which has consistently found that the media devote more coverage to 
the investigation of a breaking scandal than to the adjudication of crimes. 


Political Bias and Accountability Contagion 


Why were politically liberal newspapers more likely to cover the scandals and 
conservative newspapers less likely? Lichter and Rothman’s work suggests a liberal 
anti-authoritarianism among media professionals, but a perusal of editorials pub- 
lished on the scandals in question supports a different interpretation.’ The scale 
of the crisis brought into play editors’ fundamental assumptions regarding the 
functioning of the economy, and these assumptions varied along lines of political 
ideology. Editorials published by several Democratic editorial staffs pushed for 
broader governmental regulation, highlighting the possibility of an endemically 
flawed system or culture binding corporate and political elites (Bookman 2002; 
Buffalo News 2002b). In contrast, a number of conservative newspapers formu- 
lated a neoliberal response to the scandals, emphasizing the misdeeds of individual 
offenders and pointing to the “self-correcting” capacities of markets (Columbus 
Dispatch 2002; Orange County Register 2002). Some conservative newspapers 
similarly tied this view of the economy to the prospect for governmental reform, 
cautioning against a “drastic expansion of the federal regulatory system.”(New 
York Post 2002) If newspaper editors believed that economic health and trans- 
parency would be restored by the market (or had already been restored, as some 
argued), this may have suggested a less pressing requirement for the media to play 
a watchdog role. In contrast, a belief in the need for regulatory attempts would 
have called for continuing journalistic scrutiny of the crimes. 

Given the timing of the crisis, it is quite possible that relations between political 
and economic actors further politicized scandal coverage. George W. Bush's connec- 
tions to Enron and CEO Kenneth Lay were well documented in major American 
newspapers, and had a tangible effect on public concern over the administration’s 
ties to large corporations (CBS 2002). Ethical questions also surrounded SEC 
Chairman Harvey Pitt, who was alleged to have conducted illegal meetings with a 
Xerox CEO when the company was under investigation (Buffalo News 2002a). If 
Republican officials came under scrutiny for their ties to corporate offenders, con- 
servative commentators saw political opportunism in the Democrats’ vociferous 
response, a criticism based in part on the revelation that campaign contributions 
by several of the largest offenders either favored Democrats or were evenly split 
between the parties (Detroit Free Press 2002).° These charges and counter charges 
broadened the scope of the “elite crisis” represented by the scandals, implicating 
political as well as economic elites and highlighting the variety of ties that bind 
corporate and political actors, which include friendships, past business associations, 
campaign contributions and membership on the same advisory boards. ‘This conta- 
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gion of accountability may have made political allegiances more salient in editorial 
decision making, adding partisan fuel to the fire of public outrage. 

Thus, a broad crisis of confidence in economic elites is likely to be politicized 
in two ways: first, by triggering ideological commitments to differing views of 
economic health and accountability; and second, by spreading to political actors, 
a phenomenon that, given a densely interwoven network of elites that crosses insti- 
tutional boundaries (political, economic, religious, juridical, etc.), is not likely to 
have been specific to the scandals of 2002. In both cases, the scope of the crisis was 
instrumental in the blurring of lines between economic and political realms. It is 
unlikely that an isolated scandal would have led to a broad questioning of market 
assumptions or called into doubt the appropriate relations between political and 
corporate actors. Future research might add to the theoretical contribution im- 
plicit in these insights, identifying and investigating other moments in which an 
elite crisis has proved contagious, spreading to actors in other institutional fields 
and changing the factors deemed relevant to a typically bounded area of discourse. 


Faceless Corporations and Courtroom Dramas 


Why, in contrast to prior research, did newspapers devote more coverage to the 
adjudication of corporate crimes? As is the case with the findings regarding politi- 
cal ideology, this surprising result is partially explained by the scope of the crisis. 
In 2002, as public attention to corporate malfeasance mounted, so did calls for 
public accountability. In this context, rather than using organizational resources 
to defend individual offenders (for example, by settling lawsuits out of court), 
many corporate boards severed their connections to malfeasant executives, forc- 
ing them to stand trial alone. In doing so, they redirected public attention away 
from organizational wrongdoing and toward individual wrongdoing, offering the 
media fodder for extensive trial coverage focusing on the exploits of individuals. 

By offering up corrupt CEOs when a high level of scrutiny becomes unavoid- 
able, deviant corporations play to the media’s preference for individual offenders. 
A trial of an individual white collar criminal may provide months of riveting 
legal drama, punctuated by the testimony of key witnesses and new disclosures of 
wrongdoing; a corporate press release announcing a settlement agreement provides 
less grist for the journalistic mill. By extension, accounting scandals in which the 
primary offender is, as one newspaper put it, “some faceless corporation” (Chicago 
Tribune 2002) will fail to offer a sufficiently intriguing narrative to newspaper 
editors. At a fairly early point in these scandals, both liberal and conservative 
newspapers were left to cover the lurid exploits of individual criminals, which, as 
we have seen, they did extensively. 

This finding suggests an imperative for research on white-collar crime to look 
beyond the executive boardroom, the jury box, or the pages of the business section 
for the processes that shape societal treatment of organizational deviance. The con- 
tent of media coverage appears to reflect not just independent judgments of news 
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value, but the concerted attempts of corporate actors to shape public knowledge 
and opinion of corporate crimes. Public understandings of white-collar crime 
may in many cases be a product of “elective affinities” between corporate interests 
and media calculations of newsworthiness. Concretely, this insight suggests that 
future research might fruitfully examine coverage of cases in which the ability of 
organizations to deflect blame is inhibited, for example when accusations center 
upon long-held corporate practices or organizational cultures. Examples might be 
found in the private sector, such as the tobacco industry’s policy toward scientific 
evidence of smoking risk, but they might equally be found in the public sector 
or in religious organizations such as the Catholic Church, which has come under 
fire for allegedly ignoring cases of sexual abuse by clergy. Examination of media 
coverage of such cases would provide further evidence of the media’s framing of 
different forms of organizational deviance, paving the way for a fuller understand- 
ing of how the selection mechanisms employed by media gatekeepers and the 
damage control strategies undertaken by scrutinized organizations interact to 
produce public understandings of organizational wrongdoing. 


Notes 


1. See Appendix B for the list of newspapers included in the sample. Although the Wall 
Street Journal is one of the largest circulating newspapers in the United States, it is 
excluded from the present analysis. It was assumed that the sheer quantity of business 
coverage produced by the Wall Street Journal and the newspaper's unique position as 
a source of information for the business community would make the newspaper an 
outlier in cross-sectional research. 


2. Dummy variables for Factiva and Newsbank were included in the analysis in order 
to test for sampling error. Neither of these variables proved statistically significant for 
any stage of the analysis, so they were excluded from the final analysis presented here. 


3. Articles devoted to accounting scandals in general were excluded in order to accurately 
specify scandal-specific independent variables such as the existence of an interlocking 
directorate or geographical proximity. “News digest” articles which include short 
blurbs on a number of different news events were excluded for the same reason, 
and in order to remain consistent across newspapers in cases where extremely short 
individual news articles substitute for news digests, all articles shorter than 100 words 
were excluded. 


4. The author also tried accounting for interdependence of observations by including a 
control variable for the number of accounting scandal stories previously published by 
each newspaper. The results were consistent with those presented here. Because the 
robust standard errors are believed to offer a more conservative approach, the results 
of the analysis that utilized them are presented. 


5. Although visual inspection of Martingale and Schoenfeld residuals, as well as raw 
hazard curves, suggested that the proportionality assumptions of the Cox model were 
likely to have been met, a formal test of the significance of interaction terms between 
several covariates and log(time) raised concerns that the effects of several variables 
were varying over time in ways that could not be accounted for by merely including 
interaction terms for time. Because the results of the Cox analysis were corroborated 
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by a piecewise analysis sensitive to temporal variation, the author feels confident in 
presenting the Cox results in spite of this test. 


6. Plotting these sub-samples has the effect of omitting newspapers that switched the 
party of their endorsement during the period of the analysis, confining the analysis 
to newspapers whose political position is relatively unambiguous. 


7. As previously mentioned, editorials were excluded from the quantitative analysis in 
order to isolate patterns of purportedly objective news coverage. However, drawing 
upon editorials for the purposes of this discussion is justified by the fact that (as the 
quantitative analysis shows) editorial political endorsements predict scandal coverage. 
Because newspaper's political culture was not contained to the editorial page, we may 
look to the editorial page for interpretive (and admittedly speculative) insights as to 
how ideology may have influenced coverage. 


8. The author verified that the political contributions of WorldCom were evenly split 
between the parties using a search of corporate campaign contribution records on 
OpenSecrets.org. 
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Appendix A. Narrative Descriptions of Scandals in Sample 


Adelphia 

In the Adelphia scandal, which broke in March 2002, one of the nation’s largest 
cable television operators was abruptly hit by public allegations of “self-dealing,” 
an illegal process in which representatives of a corporation make personal use of 
corporate funds. Prosecutors would eventually allege that the 77-year old founder 
of the company, John Rigas, and his two sons, Michael and Timothy, had issued 
themselves $1.6 billion in stock (Starkman 2005). Adelphia paid a total of $715 
million to settle inquiries brought by regulatory agencies, while John and Timothy 
Rigas received 15 and 20-year jail sentences, respectively. 


Bristol-Myers Squibb 

In April 2002, Bristol-Myers Squibb was forced to revise sales and earnings fore- 
casts in order to compensate for having inflated apparent revenue by intentionally 
bloating distributors’ inventories, a practice known as “channel-stuffing.” In 2004 
and 2005, the company paid approximately $909 million to settle government 
probes and shareholder lawsuits related to the scandal. Although CEO Peter 
Dolan came under intense scrutiny from investors and government regulators, he 
was never linked with the company’s accounting malfeasance. 


HealthSouth 

In September 2002, the SEC initiated an investigation of possible insider trading 
by Richard Scrushy, the founder of HealthSouth, a health care services provider 
based in Alabama. In November 2003, Scrushy was indicted, but was later acquit- 
ted at the conclusion of a 2005 jury trial, in spite of the testimony of 15 subor- 
dinates who cooperated with the prosecution in exchange for reduced sentences 
and fines. By early 2006, the HealthSouth corporation had paid between $450 
and $500 million in separate fines and settlements. 


Rite Aid 

Although the first indication of accounting impropriety at Rite Aid came in early 
1999, a full-fledged scandal only broke in 2002, as governmental investigations 
intensified after the revelation of scandals at Enron, WorldCom, and Adelphia. 
As a result of these investigations, six Rite Aid executives were eventually charged. 


A yearlong trial and sentencing process began in mid 2003, culminating with a 
9-year jail sentence for former CEO Martin Grass. 


Waste Management 

In May 1999, Waste Management's board of directors unexpectedly withheld pay- 
ment from several ex-executives who had overseen major growth and transforma- 
tion at the company during the 1990s. In August, the Houston-based corporation 
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admitted to an SEC probe and fired its CEO. For two years the company floun- 
dered, restating earnings and losing profit amid sporadic disclosures of accounting 
irregularities. Finally, in 2001 and 2002, the company settled the SEC probe for 
$457 million and six former executives were accused of fraud. In August 2005, the 
corporation, along with the executives who had been charged, paid $30.8 million 
to settle the fraud case. 


Xerox 

In 2001, the SEC began an investigation of Xerox’s American operations, focusing 
on the company’s procedures for reporting income from the leasing of copiers 
and other office equipment. In April 2002, Xerox announced that it had paid a 
$10 million civil fine to settle the investigation. Federal regulators, however, soon 
expanded their inquiry to include six current and former company executives 
who allegedly conspired between 1997 and 2000 to misstate company profit and 
inflate stock prices. In June 2003, the six executives paid a total of $22 million 
to settle the government’ lawsuit. 
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Appendix B. 

Newspaper 2000 2004 
Arizona Republic Bush Bush 
Atlanta Journal-Constitution Gore Kerry 
Baltimore Sun Gore Kerry 
Boston Globe Gore Kerry 
Boston Herald Bush Bush 
Buffalo News Gore Kerry 
Charlotte Observer Gore Kerry 
Chicago Tribune Bush Bush 
Cincinnati Enquirer/ Post Bush Bush 
Columbus Dispatch Bush Bush 
Daily Oklahoman Bush Bush 
Denver Post Gore Bush 
Denver Rocky Mountain News Bush Bush 
Des Moines Register Gore Kerry 
Detroit News/Free Press Bush Kerry 
Fort Worth Star-Telegram Bush Bush 
Hartford Courant Bush Bush 
Houston Chronicle Bush Bush 
Indianapolis Star Bush Bush 
Kansas City Star Gore Kerry 
Little Rock Arkansas Democrat-Gazette Bush Bush 
Los Angeles Times None None 
Louisville Courier-Journal Gore Kerry 
Miami Herald Gore Kerry 
Milwaukee Journal-Sentinel None Kerry 
Minneapolis Star Tribune Gore Kerry 
New York Daily News Bush Bush 
New York Post Bush Bush 
Newark Star-Ledger Gore Kerry 
Omaha World-Herald Bush Bush 
Orange County Register None None 
Orlando Sentinel Bush Kerry 
Philadelphia Inquirer Gore Kerry 
Pittsburgh Post-Gazette Gore Kerry 
Plain Dealer Bush None 
Portland Oregonian Bush Kerry 
Sacramento Bee Gore Kerry 
San Antonio Express-News Bush Bush 
San Diego Union Tribune Bush Bush 
San Francisco Chronicle Gore Kerry 
San Jose Mercury News Gore Kerry 
Seattle Post-Intelligencer Gore Kerry 
Seattle Times Bush Kerry 
South Florida Sun-Sentinel Gore Kerry 
St Louis Post-Dispatch Gore Kerry 
St Paul Pioneer Press Bush Bush 
St Petersburg Times Gore Kerry 
Tampa Tribune & Times Bush None 
The New York Times Gore Kerry 
USA Today None None 


Washington Post Gore Kerry 


Socio-demographic Determinants of Economic 
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Sociolinguistic Integration 


Edward Crenshaw, Ohio State University 
Kristopher Robison, Vorthern Illinois University 


‘This study establishes a socio-demographic theory of international development de- 
tived from selected classical and contemporary sociological theories. Four hypotheses 
are tested: (1. population growth’s effect on development depends on age-structure; 
(2. historic population density (used here as an indicator of preindustrial social com- 
plexity) boosts contemporary economic performance; (3. ethnic polarization impairs 
economic growth; and (4. a nation’s degree of sociolinguistic integration positively 
influences economic performance. Investigating annual changes in real gross domes- 
tic product per capita from 1970 to 2000, our pooled time-series analyses of 101 
developed and developing countries generally support these hypotheses net of com- 
mon alternative explanations, suggesting that the etiology of economic growth could 
benefit from the reintroduction of classic and contemporary sociological theories. 


Introduction 


Economic growth and attendant changes in social organization have been central 
themes of analysis since the earliest days of social science. As an independent vari- 
able, economic development has been linked to urbanization, fertility, mortality, 
democratization, income inequality and many other social phenomena, predictive 
power that justifies a sociological interest in the etiology of economic growth and 
social development. 

In this article we propose a demographically-driven, ecological-evolutionary 
model of economic growth. Following logic common to several ecological, evolu- 
tionary and functionalist accounts of social change, we investigate the influences 
of age-dependency, historical population density, ethnic/racial polarization, and 
sociolinguistic integration on economic growth over three decades (1970 to 2000), 
bringing these dimensions together in a theoretically and empirically integrated 
study for the first time. In the following analyses of all nations for which data exist, 
these ecological and evolutionary characteristics play important roles in economic 
development controlling for more frequently considered alternative explanations. 
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Figure 1. The Socio-Demographic Engine of Change in Early Social Science 
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Although population dynamics were the fulcrum of early sociological accounts 
of economic change (Spencer 1852; Durkheim 1893), this demographic focus in 
comparative studies of national development was lost for most of the post-WWII 
era. Despite a series of empirical studies during the 1990s that linked population 
change to development, there has been little investigation of the influence of other 
socio-demographic dimensions on economic change and no attempt at all to link 
empirical findings to sociological theory. While some studies have investigated 
the role of age structure on economic growth (Mankiw et al. 1992; Barlow 1994; 
Brander and Dowrick 1994; Kelley and Schmidt 1995; Crenshaw et al. 1997; 
Sachs and Warner 1997; Bloom and Williamson 1998), only a handful have 
considered either population density’s influence on economic growth (Simon and 
Gobin 1980; Kelley and Schmidt 1995; Crenshaw and Oakey 1998; Burkett et 
al. 1999; Acemoglu et al. 2002) or the role of ethnicity in development (Easterly 
and Levine 1997; Brukett et al. 1999; Putterman 2000). Therefore, while we are 
beginning to appreciate socio-demographic influences on development, a more 
comprehensive theory-driven account of population’s impact on economic growth 
would be useful. 

‘This tardiness in scrutinizing population's role in development is ironic given 
the longstanding pro-natalist tradition in Western thought. Many social com- 
mentators of antiquity and the medieval period (e.g., Cicero, Plato, Machiavelli) 
as well as many of the first social scientists considered increases in population an 
indicator of successful, expansionary social systems. Adam Smith (1776) noted 
that market size limits the complexity of the division of labor. Spencer (1852) 
elaborated this principle by explicitly linking increased population size and density 
to economic change via innovation, which Durkheim (1893) later attributed 
to competition. As individuals and groups find themselves struggling for scarce 
resources in densely populated environments, they tend to differentiate their 
economic activities (to reduce direct competition) and to innovate to increase 
surplus (see Figure 1). Essentially, organizational specialization and technological 
development are partial responses to population pressures, and while intensely 
competitive environments may harm human welfare in the short-term, the emer- 
gent property springing from such competition is a population used to hard work, 
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long hours and self-sacrifice nested in a complex division of labor, attributes which 
may lead in time to superior economic performance at the macro level. 


Population Growth, Age-Structure and Economic Development 


The modern study of population growth’s influence on economic growth began 
with Malthus’ (1798) famous dictum that agricultural productivity increases ar- 
ithmetically while population increases geometrically. That is, in the absence of 
“preventive checks,” populations overshoot their abilities to secure subsistence and 
“positive checks” (i.e., mortality) restore the equilibrium. 

Coale and Hoover (1958) elaborated on this theory by stressing family econom- 
ics and capital formation. According to their perspective, rapid population growth 
forces families to consume savings, adversely affecting capital formation and national 
savings rates. In addition, high youth dependency ratios force nations to invest 
scarce capital in a game of catch-up to provide education, jobs and infrastructure 
for a rapidly expanding labor force. Allocating capital to less-productive segments 
of the population (e.g., educational expenditures) forces nations to undercapitalize 
those already in the labor force (Bloom and Freeman 1988), resulting in sub-optimal 
investment patterns and subsequent under-performance in economic growth. 

On the other hand, many economists assert that abundant labor has been and re- 
mains a prerequisite for economic take-off, even going so far as to attribute much of 
the “Asian miracle” to its huge, inexpensive labor reserves (Williamson 1998). One 
argument asserts that as competition for jobs depresses wages, profitability increases 
for owners of capital and encourages greater domestic investment (Lewis 1954, 
1958). Labor force effects are therefore mediated by investment rates. Related argu- 
ments involve scale effects and demand effects. A growing labor force encourages 
scale effects such as expanding domestic markets, growing complexity in divisions 
of labor, greater volumes and superior mixes of diffused information, technology 
and skills, and declining per capita costs associated with public infrastructure due 
to heavy use (Simon 1981). On the demand side, an increase in population may 
lead to a rise in consumer demand for durable goods and real estate (Easterlin 1967). 

Regardless, older studies linking population growth and economic growth 
failed to differentiate labor force and fertility effects, leading to weak and equivo- 
cal findings (Kuznets 1967; Easterlin 1967; Simon and Gobin 1980; Firebaugh 
1983; McNicoll 1984; Chesnais 1987). This dearth of robust statistical evidence 
linking population and economic growth was due to the inappropriate specifica- 
tion of models (Crenshaw et al. 1997). While various combinations of birth and 
death rates may produce the identical rate of population growth, age structures 
may differ substantially. To correct this, population growth must be disaggregated 
into its productive (i.e., competitive and adaptive) and non-productive segments 
to determine the true relationships between demographic change and economic 
change. Indeed, cross-national evidence increasingly points out that youth de- 
pendency (i.e., the ratio of children to productive adults) does retard economic 
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growth, but that adult population growth (i.e., labor force growth) has the op- 
posite influence (Bloom and Freeman 1988; Barlow 1994; Brander and Dowrick 
1994; Crenshaw et al. 1997). 

Thus, some types of population growth contribute to competition and adaptabil- 
ity more directly than others, and so not all forms of population growth contribute 
equally to social change. Clearly, the increase of productive adults should be the 
operative variable in economic progress. As the major actors in any economy, adults 
compete for scare resources, differentiate themselves to avoid head-on competition, 
apply diffused information to production, and engage in major consumer activity. 
Population growth among children and the elderly, however, has far fewer beneficial 
multipliers and, even when children and retired persons are common in the labor 
force, the contributions of these age segments to ecological processes such as dif- 
ferentiation and diffusion are minor relative to adult contributions. 


Population Density, Proto-Modernity and Economic Development 


Early ideas about the latent functions of intense social competition suggest that 
socio-demographic dynamics may also have long-run influences on social devel- 
opment. Contemporary theoretical statements concerning the long-run effects of 
population growth by Boserup (1965, 1981), Lenski (1966, 2005) and Simon 
(1981) are compatible with early population-driven models of social change. Some 
early sociological theories posited that the wellspring of organizational and techno- 
logical innovation and subsequent social development is competitive specialization. 
Historical differences in climates, disease regimes, geography and social environ- 
ment culminate in differential demographic inheritances among contemporary 
countries, differences that allow us to place contemporary nations along a con- 
tinuum that runs from pre-modernity to modernity. Some developing countries are 
proto-modern societies, or those preindustrial societies where historical population 
pressures forced populations to experience social evolution in terms of advanced 
agrarianism, institutional development, and socio-spatial complexities/efficien- 
cies well before the modern era. Essentially, historic population density serves as a 
convenient marker of the starting line in the contemporary race for economic growth. 

This perspective is bolstered by noting that every industrialized country in the 
world has a history of advanced agrarianism, either as a consequence of long-term 
social evolution or, in the case of settler colonies (e.g., the United States), direct 
transplantation of advanced agrarianism to new territories. As Lenski and Nolan 
(1984) point out, this is no accident given that plow agriculture produces the 
economic surplus that facilitates a large, densely-settled population, the creation 
of written language, the need for money to replace barter, and other hallmarks of 
modernity. The “developed world” is comprised of these kinds of nations. Other 
nations in this advanced agrarian group, whose technological evolution may have 
been impaired by a variety of factors (e.g., the parasitic Mongol regimes of China 
and India) might now be expected to catch up at an accelerated rate. 
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Specifically, proto-modern societies exhibit social and spatial properties directly 
related to their demographic inheritances that predispose them to rapid devel- 
opment upon exposure to modern technology and external capital. First, proto- 
modern societies tend to produce higher levels of agricultural surplus. Population 
density is inversely related to farm size in traditional agriculture, consistent with 
the need to subdivide land holdings generation after generation. Over time, many 
advanced agrarian systems revert to small-holding societies, and this dependence 
on land fosters an intensification in land use as more and more people derive 
subsistence from less and less arable land (Boserup 1965). There is thus a ten- 
dency for small farms to use land much more productively (Berry and Cline 1979). 
Contemporary evidence suggests that higher levels of aggregate surplus, when com- 
bined with intense competition for land, most often result in renewed agricultural 
innovation, greater occupational specialization and a concomitant increase in non- 
farm employment in rural areas (Clark 1967; World Bank 1978; Boserup 1990:86). 

Second, historically-dense societies are generally less burdened by severe economic 
inequalities and other social/demographic fault lines. Unlike formerly horticultural 
societies, where low population densities and correspondingly low levels of political/ 
economic development allowed colonial conquest and subsequent inequalities in 
land and capital (i.e., Africa and Latin America), proto-modern societies have gener- 
ally experienced shortages of land and capital but an abundance of labor. As labor 
becomes the only flexible source of wealth, the problem of hoarding labor power 
forced elites to distribute subsistence more evenly. Moreover, elites subdivided their 
lands among their children over the generations, resulting in more even distribu- 
tions of income or subsistence (Nolan and Lenski 1985; Crenshaw 1992, 1993). 
As Diamond and Bellwood (2003) note, when farming populations spread across 
the world’s continents, their technologies gave them the wherewithal to displace, 
assimilate or wipe out non-agrarian competitors. Today, therefore, we would expect 
proto-modern societies to be more ethnically and linguistically homogeneous, and 
to have been more resistant to settlement by other agrarian groups (i.e., colonialism). 
According to Lenski (2005), we see the repetition of this pattern in the modern 
world, with industrial and post-industrial populations exerting tremendous selection 
pressures on human language and (material and non-material) culture worldwide. 

Third, proto-modern societies are more spatially articulated, leading to im- 
provements in scale effects and the early penetration of markets into rural areas. 
Such societies usually exhibit articulated networks of villages, towns and cities 
prior to industrialization (Boserup 1981). Dense, permanent settlements encour- 
age complex trade and communications networks, attributes that promote early 
commercial development in rural and semi-rural districts as well as the spread of 
literacy and education. Moreover, dense settlement usually supports infrastructure 
such as roads, canals and ports (Glover and Simon 1975; Boserup 1981). Thus, 
proto-modern societies have enjoyed a higher level of spatial articulation due to 
more advanced infrastructure and, in turn, the advanced political development 
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that resulted from activities such as irrigation and road building (Chanda and 
Putterman 2007). These complex settlement patterns and linking infrastructure 
also encourage the spread of commercial agriculture and cottage industries — im- 
portant precursors to industrialization (Kriedte et al. 1981). 

Finally, proto-modern societies are the cradles of modern social psychology. 
Private property, contracts, wage labor and inheritance law evolved well before 
industrialism in response to the breakdown of communal land tenure and produc- 
tion systems under the onslaught of population pressures (Popkin 1979; Boserup 
1990). Moreover, the “possessive individualism” of the West (MacPherson 1962) is 
in fact not unique to the West; as Simmel notes (1971[1908]), true individualism 
becomes possible only with the increasing structural-functional differentiation of 
populations. Growing population density forces many individuals to specialize 
to reduce competition, and this specialization necessarily increases the number 
of unique social transactions, accelerates the process of role segmentation, and 
gradually weakens the primordial group ties within a population. In short, the 
density-induced transition from “mechanical” to “organic” solidarity (Durkheim 

_ 1893) begins prior to industrialism. 

Our sociological approach anticipates a positive coefficient between demo- 
graphic inheritance (i.e., historic population density) and contemporary economic 
growth per capita. On the other hand, Acemoglu et al. (2002) argue that the 
history of European colonialism and selective investment bred institutional con- 
figurations (i.e., governmental constraints on property rights) that “reversed” the 
fortunes of the old agrarian empires. Their proof consists of a negative correlation 
between estimates of population density in A.D. 1500 and the /evel of economic 
development in 1995 (GDP/c). While we might concede that such a temporary 
historical effect is possible, the historical record linking high density and high 
civilization (e.g., Egypt, China, India, Japan, southern Europe) suggests that con- 
temporary growth among such societies should be relatively rapid — an accelerated 

“recovery,” perhaps (Chanda and Putterman 2007). We contend that whatever the 
advantages or disadvantages of colonialism, the traditional advantages related to 
population and social complexity should be ascendant in the post-colonial world. 


The Ecology of Ethnic Competition 


Although ethnic, racial and linguistic diversity can be a social benefit, history 
clearly demonstrates that ethnic/racial competition can also challenge social 
cohesion rather than reinforce it. Recent conflicts such as those in Yugoslavia, 
India, Indonesia, Rwanda and Iraq demonstrate that ethnic, racial, linguistic or 
religious cleavages are extremely potent social forces that shape national histories. 
Nevertheless, many have noted the usefulness of ethnicity as a social tool; ethnic, 
racial or linguistic markers promote efficient intra-group relations. Recent socio- 
logical research highlights the economic benefits of ethnicity-based social capital, 
(i.e., Aldrich and Waldinger 1990; Waldinger 1994), generally because the use 
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of ascribed statuses as primary axes of organization can provide the phenotypic 
markers, tight social interdependencies, and common cultural/social warp and 
woof that make for effective social organization and social control (Hechter 1987). 

As Portes (1998) points out, however, the downside of ethnic solidarity is 
out-grouping. While ethnic competition models are common in the social sciences, 
social scientists rarely explain exactly why phenotypic or linguistic differences 
should so often lead to competition and conflict. An ecological perspective helps 
to fill this gap by noting that, while ethnicity provides a fairly efficient organiza- 
tional basis for many large unions, the ability of such unions to forge beneficial ties 
across groups is sharply circumscribed precisely because ascribed status is so effec- 
tive for the in-group. The human ecological notion of symbiotic and commensal 
unions explains why (Hawley 1968). Symbiotic unions are exchange relationships 
between parties with complementary differences. Mutually dependent manufac- 
turers in a production chain or the gender/age division of labor within families 
are good examples of symbiotic unions. Commensal unions, on the other hand, 
are based on the similarity of members or units, such as labor unions or profes- 
sional associations. Commensal unions are therefore made up of like members 
who cooperate in drawing sustenance from a common resource base. Hawley 
(1968:332) states that, “The symbiotic union enhances the efficiency of produc- 
tion or creative effort; the commensal union, since its parts are homogeneous, 
can only react and is suited, therefore, only to protective or conservative actions.” 
Although commensal unions work to conserve the positions and powers of their 
members, they still play a large role in social change. A predominance of commen- 
sal unions attempting to fill similar niches leads to competitive pressures which 
set the stage for organizational and technological innovation and social change 
(complementary differentiation). Where complementary specialization is relatively 
easy, commensalism will yield to symbiosis and competition to cooperation and 
exchange (i.e., structural-functional differentiation). Where such complementary 
change is impossible, difficult or profitless, competition intensifies. 

In the language of classical theory, sociocultural diversity sometimes interferes 
with social differentiation and interdependency (see also Blau 1977). ‘The very 
thing that makes human “races” or “ethnicities” strong axes for social organization 
and solidarity necessarily devalues weaker ties with out-groups. This is because, 
unlike biological specialization among the social insects, ethnic or racial differ- 
ences are relatively superficial and may be partially or wholly socially constructed. 
Racial or ethnic differences serve as convenient markers to promote beneficial 
externalities (i.e., social capital) within in-groups but, unlike bees, they create 
no natural division of labor facilitating economic and social cooperation across 
groups. Thus, social organization dependent on race or ethnicity boosts competi- 
tion between groups while interfering with the conversion of this competition into 
cooperative specialization and productive social change. The great irony is that the 
strength and efficacy of ethnic-based social organization makes complementary 
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adaptation between groups difficult and sometimes impossible. Unfortunately, 
given that in-group solidarity is generally heightened by external threats (Simmel 
1955), sharp ethnic, racial and/or linguistic polarization may serve as a cohesive 
factor by protecting group identities from the corrosive influences of modernity. 
For these reasons, interracial or inter-ethnic commensalism may damage inter- 
group relations in what becomes a self-perpetuating cycle. 

Suspicions or hostility between ascribed groups may also impair cultural dif- 
fusion processes critical to modernization. As Rogers (1983) notes, the principle 
of homophily, or the degree to which interacting actors resemble one another, is 
important in any process of social change. Extending this principle to economies, 
the more closely economic actors are matched in terms of cultural or social back- 
ground the greater their likelihood of successfully exchanging goods and infor- 
mation. Therefore, in a racially- or ethnically-segmented society transmission of 
values, technologies and information may be hampered, lines of communication 
may be truncated, and national markets may be slow to form. 

Easterly and Levine (1997) also posit that ethnic diversity harms economic per- 
formance via political processes, encouraging a diverse interest structure that can 
lead to conflicts in public policy and implementation (see also Alesina et al. 1999). 
If a society is sufficiently polarized in this way, political deadlock and corruption, 
civil conflict, and non-optimal public spending may lead to delays in creating 
necessary political and economic infrastructure (e.g., roads, central banks) and 
institutional complexities (e.g., property law, contract law). To restate Easterly and 
Levine's account in the language of our theoretical framework, collective goods 
in a racially- or ethnically-diverse setting may be viewed from the prism of the 

“inclusive fitness” of competing groups (Hamilton 1964; Nielsen 1994; Williams 
1994), thereby damaging the consensus on institutional change. 

Nevertheless, ethnicity is not necessarily destiny. Some studies have found 
no strong correlation between social diversity and economic growth (e.g., Lian 
and Oneal 1997). Also, if nation can manage to instill a sense of “being in this 
together” among diverse elements of a population, it is possible that ethnic/racial 
differences may fade over time. As linguistic barriers crumble and a common his- 
tory and sense of nationhood prevail, ethnicity may play a more neutral or even a 
positive role in development. As Anderson and Paskeviciute (2006) note, diversity 
and linguistic diversity are not the same; many countries, such as the United States 
and Brazil, have ethnically/racially diverse populations but a national language 
that dominates the society. Because linguistic homogeneity promotes increased 
communication, diffusion, cross-cutting economic and political ties, and inter- 
marriage between distinctive groups, some empirical evidence suggests that lin- 
guistic homogeneity is positively associated with economic development (Nettle 
2000). Therefore, we propose that “sociolinguistic integration” (i.e., the linguistic 
unification of an otherwise ethnically- or racially-diverse national territory) will 
have a positive influence on a nation’s rate of economic growth. 
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Theoretical Synopsis 


Although a growing literature linking population to economic development now 
exists, such studies have yet to establish socio-demographic dynamics as central to 
theoretical accounts of economic and social change. Population growth’ influence on 
economic growth depends on the relative growth of population segments. Although 
the multiplication of working adults should enhance economic performance and 
social complexity, the proliferation of non-working children and elders could not 
produce the same effect and might in fact retard economic growth (see Figure 2). 
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Figure 2. Proposed Sociodemographic Model of Economic Development 
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Historical population density is a salient manifestation of the degree of structural- 
functional differentiation and interdependency (i.e., social complexity) a society 
has achieved through the competition-avoidance involving specialization and 
innovation. It is a convenient marker of a nation’s starting point in the race for 
contemporary economic growth. The ethnic/racialllinguistic composition of the 
population should matter for economic growth because social organization based 
on ascribed status may not lend itself to adaptive differentiation and interde- 
pendency. Alternatively, if a dominant language can unite an otherwise diverse 
population then the pernicious influences of diversity might be partially overcome. 


Design and Variables 


We apply pooled time-series cross-section analysis to an annualized (unbalanced) 
panel of data covering much of the globe (i.e., 101 nations) for the years 1970- 
2000.' The main advantage of this method is the larger sample size acquired by 
combining a cross-section and time-series design into a country-year database. 
This methodology also allows us to analyze subtle changes over time in the de- 

_ pendent variable. A pooled analysis will also permit observation of variation over 
both time and space simultaneously, with the additional advantage of allowing _ 
robustness checks in the form of temporal and regional fixed effects. 

The major disadvantage is that the error structure is complicated by the inclu- 
sion of cases that can have non-random variation over space and time. Pooling 
data with an improper model specification may also lead to the conclusion 
that the error terms are heteroscedastic and autocorrelated when in fact they 
are not. Attempting to correct these potential problems, we follow Beck and 
Katz (1995,1998) and use an ordinary least squares model with panel-corrected 
standard errors and a lagged dependent variable. Moreover, given that standard 
country-dummies (i.e., fixed effects) eliminate all cross-sectional variation in the 
sample, we are constrained to using regional and temporal fixed effects (i.e., the 
inclusion of region dummies and decadal growth averages). 

The generic model can be specified as follows: 


Yt Ot PY, + BX te 


1, 1, 


where Y, . is the dependent variable for country i at time t, and Y, —I is the same 
variable lagged one year. X, _, is a vector of important covariates each lagged one 
year to better capture causality and to rule out reciprocal causation. 


Dependent Variable 


Our dependent variable is an annual change score of real gross domestic product 
per capita (Heston et al. 2002). Real gross domestic product per capita is the 
best indicator of national wealth currently available given its standardization in 
1996 international dollars, its adjustment for actual buying power, and its exclu- 
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sion of factor income from abroad. We opted to predict annual change-scores 
over annual rates because the change-score is better behaved in our sample (CV 
= 2.7 compared to CV = 3.9 for the annualized rate of economic growth) and 
substantially decreases the “spikiness” of the dependent variable. We also enter a 
lagged dependent variable (the change-score for the year prior), which is essential 
to compensate for autocorrelation (Beck and Katz 1995). 

We incorporate a time-series of secondary school enrollments to control for 
human capital (World Bank 2002). Given the important role modernization 
theorists assign to schooling, some going so far as to term mass education as the 
“handmaiden of industrialism” (Kerr et al. 1960), it is not surprising that human 
capital indicators (typically measured as school enrollment ratios) have become 
standard features of cross-national analyses of economic growth (e.g., Barro 1991; 


Firebaugh and Beck 1994). 


Focal Predictors 


Our models incorporate four focal variables: (1. the age-dependency ratio; (2. an 
estimate of a country’s population density circa A.D. 1500; (3. the ratio of the 
second largest ethnic group to the largest group; and (4. the difference between 
linguistic fractionalization subtracted from ethnic fractionalization. The first in- 
dicator is simply the measurement of how burdened working populations are by 
those who do not work (population above and below 15-65/population 15-65); 
such dependency ratios are fairly common in cross-national assessments of eco- 
nomic growth. Population and age-structure variables have been adapted from the 
World Bank (World Bank 2002).? 

The proto-modern effect is represented by an estimate of population density in 
A.D. 1500, adapted from McEvedy and Jones (1978), who supply demographic 
estimates of historical populations for contemporary national-state equivalents.’ 
We chose persons per square mile in A.D. 1500 because it maximizes our samples 
and, more importantly, precludes the run-up in population densities since the 
Industrial Revolution began. Thus, in keeping with the theory's focus on preindus- 
trial social complexity, our use of population density in A.D. 1500 prevents any 
confusion between modern and pre-modern social complexity. 

Our ethnic dualism ratio (operationalized as the ratio of the second-largest ethnic 
group's percentage of the population over the majority group’s percentage, circa 
1980) comes from the World Christian Encyclopedia, a survey of world religion that 
incorporates official statistics. Following Blau’s (1977) argument that extreme di- 
versity increases intergroup affiliation while the existence of only a few large groups 
decreases social interaction and entrenches in-group solidarities, we attempt to tap 
the most salient ethnic divisions in each population with this indicator. 

Although strong ethnic pluralism may damage a nation’s ability to unify its 
economic and political landscape, demography isn’t necessarily destiny. If lin- 
guistic fractionalization is substantially lower than ethnic fractionalization, it 
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suggests a process of cultural assimilation. A common language for a growing 
proportion of a nation’s citizenry eases political and economic exclusion, decreases 
internal colonialism/regional ghettoization, and lowers barriers to information 
and technology. Using variables from Fearon (2003), we calculate this difference; 
the greater the difference between ethnic and cultural/linguistic fractionalization, 
the greater a nation’s sociolinguistic integration. We predict such consolidation 
partially mitigates other negative influences. 


Variables Suggested by Alternative Theories (Controls) 


An obvious barrier to economic expansion is civil war, a generally ruinous event 
that ravages a national economy. Ethnic competition and rapid population growth 
have been linked to a higher incidence of internal warfare (Goldstone 1991), so to 
guarantee that our focal effects are not spurious due to internal violence we create 
an annualized dummy variable coded 1 if a given nation experienced a civil war in 
that year (PRIO 2004). We define civil war as any conflict between state actors and 
non-state actors that results in at least 1,000 battle-related deaths in a given year. 
_ On the other hand, political-economists generally emphasize external causes of 
economic growth such as colonialism, foreign trade and investment. One of the _ 
more innovative approaches to studying the effects of past imperial policies on 
developing nations belongs to Acemoglu et al. (2002). They suggest that where 
prior social development (measured by population density in 1500) precluded actual 
large-scale European settlements, the colonial powers relied on extractive institu- 
tions that “reversed” the fortunes of these colonies, essentially impoverishing them 
by the middle of the 20" century. As our study period cannot address the influences 
of colonialism prior to 1970, our goal here is simply to demonstrate that the broad 
parameters of social evolution that influence economic development can be found 
net of any lingering institutional influences of colonialism. We accomplish this by 
introducing three control variables: (1. a dummy variable for European settler colo- 
nies (e.g., USA, Canada) (SETTLER COLONIES), (2. a dummy variable coded 1 
ifa country was ever colonized (COLONIAL STATUS), and (3. an interaction term 
between COLONIAL STATUS and POPULATION DENSITY IN A.D. 1500.4 
This interaction term applies Acemoglu et al.’s theory to economic growth since 
1970 (i.e., the influence of colonial institutions on modern economic growth varies 
by the degree of preindustrial social development, or historic population density). 
The influence of trade on economic growth is a point of contention between 
dependency/world-systems theorists and modernization/neo-classical economic 
theorists. Dependency theorists suggest exports as a proportion of GDP is an impor- 
tant control variable (Bornschier and Chase-Dunn 1985), although many consider 
it “dependent development” or “distorted development.” (Evans 1979) Some suggest 
that such dependent development does not provide the same economic multipliers 
as indigenous industrialization, so as export dominance grows we would expect to 
see anemic economic growth. On the other hand, neo-classical economists view ex- 
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port activities as particularly beneficial to economic growth because they are thought 
to expand access to external markets, convert idle resources into foreign exchange, 
and prevent stagnation due to internal market saturation. Combining these ideas, 
we control for merchandise exports as a percentage of GDP (World Bank 2002). 

As for domestic and foreign investments, modernization theory views both do- 
mestic and direct private foreign investment as assets in both developed and develop- 
ing economies, while critical political-economic theories typically consider foreign 
investment as a prime culprit in many Third World social ills. For this reason, we 
model a time-series of both direct private foreign investment as a percentage of 
GDP and gross domestic investment as a percentage of GDP (World Bank 2002). 
Because some theories posit that the influence of demographic and other macroso- 
cial structural characteristics on economic growth is mediated by investment, our last 
model, which looks for our focal effects net of investment, constitutes an extremely 
conservative test of the proposed demographic hypotheses. 

We were not able to employ country fixed effects even though doing so has 
become fairly common in cross-national pooled time-series analysis. Given that 
three of our four focal variables are cross-sectional in nature (a fact dictated by data 
availability and/or theoretical concerns), and that country dummies eliminate 
cross-sectional variation, we have opted to follow Beck and Katz's (2001) advice to 
avoid the use of country fixed effects. In lieu of country- and temporal-dummies, 
all reported models include regional controls (essentially, regionally-fixed effects) 
as well as decadal dummies for average economic growth among our sample of na- 
tions for the 1980s and 1990s (with the 1970s averages as the reference category). 


Results 


Table 1 reports the initial tests of our hypotheses. Model 1 demonstrates the 
important influences of our lagged dependent variable, the prior level of develop- 
ment and human capital formation (secondary school enrollments), consistent 
with many cross-national assessments of economic growth. Turning to our focal 
variables, Model 2 investigates the impact of age structure on economic develop- 
ment. The age dependency ratio is statistically significant, demonstrating that, 
on average over the study period, a 1 unit rise in the dependency ratio led to a 
decrease of approximately $1.01 in economic growth per annum. ‘The R° statistics 
are relatively low, averaging around .3 for these models. While no one has ex- 
plained the tendency of big-N studies to produce lower-than-expected R’ statistics, 
the tendency clearly exists (see Reisinger's 1997 meta-analysis for confirmation). 
Short of using fixed effects (to eliminate cross-sectional variation), we found no 
alternative combination of theoretically-relevant variables that would produce 
higher R? values over the three decades of our study period. 

Model 3 introduces ethnic dualism. All else constant, ethnic polarization dis 
courages rapid economic development; for every 1 percent increase of a nation’s 
second largest ethnic group in relation to its majority group, average change 
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in RGDP/c declines by $.39 per annum. Model 4 confirms these effects and 


introduces population density circa A.D. 1500. On average, for every additional 


person per square mile halfa millennium ago, national economies experienced an 


about $35 


average annual increase in RGDP per capita of $1.16 (which is to say, 
of RGDP/c over 30 years for every additional person per square mile in the year 


1500). Thus, this indicator of preindustrial social complexity suggests that nations 


do not begin the race for economic development on the same starting line, but in 


fact came into the modern era with unequal cultural and geophysical endowments. 
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‘This supports the theories of Spencer and Durkheim (cf., Malthus) and ecological- 
evolutionary theory's view of techno-ecological heritage effects (Crenshaw 1992). 

Models 5 and 6 demonstrate a somewhat countervailing influence of cultural 
assimilation, however. The introduction of assimilation decreases the magnitude of 
ethnic dualism (from b = -.24 in Model 4 to b = -.16 in Model 6), which indicates 
that nations with a high degree of national integration have partially overcome 
the economic disadvantages of ethnic polarization. And, indeed, sociolinguistic 
integration itself is an important determinant of economic growth; assimilation 
first decreases economic dynamism, given the 
social disorganization attended on early as- 
similation pressures, but then turns positive at 
a relatively high level of sociolinguistic assimi- 
lation (the turning point is around 33, which 
is in the higher range of the variable). This 
establishes the plausibility of the structural/ 
ecological theory, even controlling for regional 
and temporal effects. 

Table 2 reports the results of pitting our 
sociological account against more commonly 
tested explanations. Model 1 demonstrates that, 
indeed, experiencing a civil war in a given year 
leads to a sharp drop in RGDP per capita the 
following year — on average, about $64 per per- 
son. However, this only negligibly influences 
our focal effects. The coefficient and z-ratio for 
age-dependency both decline to a small degree, 
but the effect is still significant at the .1 level. 
On the other hand, the coefficients for histori- 
cal population density, ethnic dualism and the 
second-degree polynomial for sociolinguistic 
integration are relatively unchanged by the 
introduction of internal warfare. Somewhat 
surprisingly, we find no statistically significant 
effect for merchandize exporting in Model 2, 
although its inclusion improves the effect of 
age dependency. Model 2, therefore, allows us 
to dismiss the notion that the relationships be- 
tween our focal variables and economic growth 
are simply spurious due to mutual correlations 
with international trade. 

Models 3 through 5 investigate the role of 


colonial heritage on contemporary economic 





*significant at 10%, **significant at 5%, ***significant at 1% 
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growth. In Model 3 we see that settler colonies have had an advantage over other 


nations; on average, the economies of such colonies have grown $67 more per 


colonized 


annum. On the other hand, restricting the reference category to never- 


nations, Model 4 demonstrates that settler colonies have experienced no statisti- 


cally significant advantage in economic performance over historically-independent 
nations. Surprisingly, former (non-settler) colonies have enjoyed some economic 


advantage over independent countries — on average, about $70 more per annum, 
ceteris paribus. While it is important to be cautious when working with dummy 
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variables (in that such group averages can be strongly biased by extreme cases), the 
pros and cons of colonialism are not central to our arguments here. Rather, Model 
5 indicates that there is no apparent contemporary economic penalty associated 
with being both a former colony and a dense (i.e., pre-industrially complex) soci- 
ety, contrary to Acemoglu et al. (2002) (cf., Chadra and Putterman 2007). Were 
that the case, our interaction term Colony*Density 1500 should be negative and 
statistically significant. We conclude that whatever institutional benefits and/or 
infirmities may have been conferred on colonies by their metropole nations, our 








*significant at 10%, **significant at 5%, ***significant at 1% 
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statistical test cannot confirm the idea that more densely-settled (former) colonies 
have fared worse than other places in more recent times. 

Model 6 reports our most conservative analysis, where we control for both 
domestic and foreign investment as a percentage of the economy. Although invest- 
ment dynamics have dominated the sociological literature on development, it is 
important to note that many development theories assume that structural influ- 
ences on economic performance, such as age-structure or ethnic polarization, are 
partially or completed mediated by investment rates. If this is the case, we would 
expect to see such structural effects fade to statistical non-significance in the pres- 
ence of investment controls, so our focus here is not on how investment influences 
economic performance but rather on how controlling for investment influences 
our focal relationships. Model 6 demonstrates the dominance of gross domestic 
investment on economic performance; on average, a 1 percent increase in domes- 
tic investment over total GDP results in a $10 increase in RGDP/c per annum. 
Interestingly, net of domestic investment, foreign investment plays no generaliz- 
able role in economic performance. The coefficient is weak and statistically non- 
significant. More importantly, however, investment rates do mediate some of our 
focal effects. While both age dependency and historic population density remain 
virtually unchanged, apparently ethnic dualism’s effect on economic growth is 
wholly mediated by investment, and sociolinguistic integration is also weakened 
by controlling investment levels (although it remains significant at the 5 percent 
level). As suggested by Easterly and Levine (1997), ethnic polarization leads to 
political stalemate, state failure and slow business expansion due to uncertainty 
and high risk. In nations where sharp ethnic rivalries remain unmitigated by a 
strong sense of nationhood, it stands to reason that economic performance might 
languish as important (generally state-provided) requisites fail to materialize. 


Discussion 


This study demonstrates that a sociological account of economic growth is plausible 
and empirically sustainable even when considering competing models. Although 
we have confirmed that age-dependency retards economic growth in the short-term, 
the effect of historical population density makes it relatively clear that, in keeping 
with classical sociological theory, the long-run consequences of population growth 
for societal advancement have been positive rather than negative. Historical popu- 
lation density clearly exerts a robust, positive effect on contemporary economic 
growth rates net of war, colonial status, export regime and investment patterns. 
By most interpretations of Malthusian economic theory, high population density 
should be associated with adversity, poor labor absorption, lagging capital forma- 
tion and poor macroeconomic performance, and perhaps it has been under some 
historically-unique circumstances. Nevertheless, our analyses demonstrate quite the 
opposite is generally true. As a form of societal investment, long-run population 
increase forces adaptive differentiation and interdependency on societies, complexi- 
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ties that lead to higher surpluses and socio-spatial characteristics that predispose 
these societies to accelerated integration into the modern world. 

Another element of our model answers the perplexing question of why a reduc- 
tion in fertility might be beneficial in the short-term but disastrous over time. Rapid 
growth in the dependent population may retard economic growth, but it probably 
does not actually impoverish a society (i.e., it will not zero-out growth, but only slow 
it down). If a society can hold its own economically during its baby booms, then 
it stands a good chance of ratcheting up its economic activity when these children 
finally enter the labor force — a demographic windfall and ratchet effect (Crenshaw et al. 
1997). The discussion of population growth’s influence on economic growth should 
therefore revolve around age-specific population growth and attained population 
size/density. Such dynamics will dramatically increase in importance as East Asia, 
North America and Europe exhaust their adult population growth potential and 
face swelling ranks of the elderly, the flip-side of the typical Third World situation. 

From a strictly sociological point of view, the robust relationships between eth- 
nic dualism, sociolinguistic integration/assimilation and economic growth may 
hold greater interest. While postmodern views of race and ethnic relations run the 
gamut from condemnation of race as a social construct to glorification of racial 
consciousness and identity, gesellschaft simply may not be as good as gemeinschaft 
in encouraging social cohesion and cooperation. It stands to reason that the course 
of rapid social change might be easier for those countries that retain vestiges of 
primordial solidarities (i.e., common cultural understanding, a sense of common 
destiny, and/or phenotypic homophily). If social trust, cooperation, mutual un- 
derstanding and “fellow-feeling” are important to economic change, it is logical 
to view ethnic and linguistic variation as a potential liability rather than as an asset 
in a nation’s quest for economic success. 

Our results also suggest that a unifying or dominant language can partially 
bridge the problems associated with heterogeneity. Obviously, the removal of 
language barriers will generally improve a nation’s business climate and market 
efficiencies. However, we find that this benefit accrues only to nations with an 
overwhelmingly dominate language. 

These results also shed light on modernization theory and, to a lesser extent, 
neoclassical economics. Modernization theory has accumulated decades of anoma- 
lies it has difficulty explaining. Why have Pacific Rim countries made such great 
economic strides while many of the more developed parts of Latin America have 
languished? How can countries such as Taiwan combine rapid economic growth 
with low levels of income inequality? In our view, these anomalies may spring from 
improperly specifying the “starting line” and from overlooking socio-demographic 
dynamics. Rostow (1960) notes that a society's initial economic conditions govern 
its future industrialization, but most modernization theorists have poorly specified 
these initial conditions. Understanding a country’s demographic and technologi- 
cal history allows us to do so, at least in a rough fashion. 
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This research anchors socio-demographic effects in classical and contemporary 


macro-sociological theory. Over the years, studies of economic growth and demo- 
graphic change have become disconnected from earlier traditions in social science. 
While focusing on the “proximate determinants” of economic growth such as in- 
vestment or technological innovation may be helpful for public policy, pursuing 
such explanations leads to neglect of more fundamental dynamics that give rise to 
investment and innovation — how social environments shape human motivations. 
Rooting demographic-ecological variables in abstract evolutionary theory helps to 
reset the focus on the master processes of differentiation and interdependency which 
are, at their most basic level, humanity's adaptation to environmental challenges. 


Notes 


ie 


Country data used in the sample: Albania, Algeria, Angola, Argentina, Australia, 
Austria, Bangladesh, Barbados, Belgium, Bolivia, Botswana, Brazil, Belize, Bulgaria, 
Burundi, Cambodia, Cameroon, Canada, Cape Verde, Central African Republic, Sri 
Lanka, Chad, Chile, China, Colombia, Comoros, Republic of the Congo, Congo 
Democratic Republic, Costa Rica, Cuba, Cyprus, Benin, Denmark, Dominican 
Republic, Ecuador, El Salvador, Ethiopia, Fiji, Finland, France, the Gambia, Ghana, 
Greece, Guatemala, Guinea, Guyana, Haiti, Honduras, Hungary, Iceland, India, 
Indonesia, Iran, Ireland, Israel, Cote d'Ivoire, Jamaica, Japan, Jordan, Kenya, 
Republic of Korea, Lebanon, Lesotho, Madagascar, Malawi, Malaysia, Mali, 
Mauritania, Mexico, Morocco, Mozambique, Namibia, Nepal, Netherlands, New 
Zealand, Nicaragua, Niger, Nigeria, Pakistan, Papua New Guinea, Paraguay, Peru, 
Philippines, Poland, Portugal, Guinea-Bissau, Romania, Rwanda, Senegal, Sierra 
Leone, Singapore, Vietnam, South Africa, Zimbabwe, Spain, Sweden, Syria, Thailand, 
Togo, Trinidad & Tobago, Tunisia, Turkey, Uganda, Egypt, United Kingdom, 


‘Tanzania, United States, Burkina Faso, Venezuela and Zambia. 


Given that no nation conducts a registration system on an annual basis, yearly 
population and age-structure values are estimates. Personal communication with 
the World Bank determined that these estimates are generated using fertility and 
mortality data applied to a baseline population figure frequently derived from 
household surveys, suggesting that the estimates are well-grounded in extant 
empirical information. More importantly, GDP or other non-demographic variables 
(that might compromise our multivariate analyses) have not been used in the creation 
of these estimates (Bos 2005). 


Estimation of historic populations are inherently problematic. While some have 
severely criticized the intricate and often opaque methodologies used to create such 
estimates (e.g., Durand 1977), most scholars concede that current estimates are 
probably as close as we can come to the “truth” given the dearth of historical data 
(Caldwell and Schindlmayr 2002). Regardless, we were able to roughly replicate our 
findings using population density measured in 1965, The major difference is that 
significance levels are weaker (.1 or less), which is to be expected given the temporal 
lags needed between density and development suggested by ecological theory. 


Settler Colonies include Argentina, Australia, Canada, Chile, Costa Rica, Ireland, New 
Zealand, South Africa and the United States. All colonies are colonies of Western 
Europe and include: Argentina, Australia, Canada, Chile, Costa Rica, Ireland, New 
Zealand, South Africa, United States Of America, Bangladesh, Cambodia, Sri Tanke 
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India, Indonesia, South Korea, Malaysia, Philippines, Singapore, Vietnam, Algeria, 
Angola, Bolivia, Botswana, Brazil, Burundi, Cameroon, Central African Republic, 
Chad, Colombia, Congo, Zaire, Cuba, Cyprus, Benin, Dominican Republic, Ecuador, 
EI Salvador, Fiji, Gambia, The, Ghana, Greece, Guatemala, Guinea, Guyana, Haiti, 
Honduras, Hungary, Israel, Cote D’Ivoirie, Jamaica, Jordan, Kenya, Lebanon, Lesotho, 
Madagascar, Malawi, Mali, Mauritania, Mexico, Morocco, Namibia, Nicaragua, Niger, 
Nigeria, Pakistan, Paraguay, Peru, Poland, Guinea Bissau, Rwanda, Senegal, Sierra 
Leone, Zimbabwe, Syria, Togo, Trinidad & Tobago, Tunisia, Uganda, Egypt, Tanzania, 
Burkina Faso/Upper Volta and Zambia. The countries in the reference category 
(colonizers or non-colonies) are: China People’s Republic, Japan, Nepal, Thailand, 
Albania, Austria, Belgium, Bulgaria, Denmark, Finland, France, Iran, Netherlands, 
Portugal, Romania, Spain, Sweden, Turkey and the United Kingdom. 
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This article examines spatial relationships between environmental hazards (i.e., pork 
feed lots, brick kilns, final assembly plants and a rail line) and markers of social mar- 
ginality in Ciudad Juarez, Mexico. Judrez represents an opportunity for researchers 
to test for patterns of injustice in a recently urbanizing metropolis of the Global 
South. We use spatial-econometric modeling to predict the four unique hazard 
variables and a composite hazard variable using socio-demographic variables at the 
neighborhood level. Lower class and higher percentages of children and migrants 
were statistically significant predictors of composite hazard density. These results 
align with previous studies in the North. However, disaggregating these results by 
hazard type reveals important and counterintuitive differences in groups at risk 
based on the market-orientation of the hazard (i.e., domestic vs. transnational) and 
its location within the urban structure. 


Ciudad Juarez is a growing and industrializing city on Mexico's northern border. 
Located just across the Rio Grande/Bravo from El Paso, Texas, it hardly resembles 
its American sister, despite being a relatively well-off Mexican city. Juarez is home 
to a variety of hazards, including those produced by manufacturing, brick making, 
chemical production and industrial hog farming. For researchers, Juarez represents 
an opportunity to test for general patterns of environmental injustice in an urbaniz- 
ing metropolis in the Global South. Environmental injustice is when environmen- 
tal hazards are disproportionately located in socially marginalized (e.g., poor, racial/ 
ethnic minority) neighborhoods (Bolin et al. 2002; Pellow 2007). Environmental 
injustice researchers studying technological hazards in the Global North have—for 
the most part—found that socially marginalized persons (e.g., the poor) live closer 
to technological hazards (e.g., factories). This article tests those findings across 
a range of hazards. The term Global South is employed here, following Pellow 
(2007), as a social rather than strictly geographic term meant to refer to politically 
and economically marginalized communities or nations such as those found in the 
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“developing” or “third” world. Global North refers to more privileged communities, 
such as those in the “developed” or “first” world (Pellow 2007). 


Literature 


The majority of environmental justice research and nearly all of the quantitative 
studies have been conducted in developed countries of the Global North. In the 
developing world, investigations usually focus on social movements for change 
(Carruthers 2007; Widener 2007). The first quantitative/spatial environmental 
justice studies were conducted in the United States, and the methods have since 
spread to Canada, New Zealand and countries of the European Union. In the 
United States, the majority of studies have found that poor and racial/ethnic 
minority people are disproportionately affected by environmental hazards (Sicotte 
and Swanson 2007; Downey 2006; Pastor et al. 2002); a few have not (Anderton 
et al. 1994; Brown et al. 1997). A variety of hazards have been explored in these 
studies including industrial facilities regulated by the Environmental Protection 
Agency’s Toxic Release Inventory, toxic waste treatment, storage and disposal 
facilities, large generators of hazardous waste, airport noise, heat, industrial-scale 
meat production, and criteria air pollutants (Bolin et al. 2002; Cutter et al. 2001; 
Sobotta et al. 2007; Harlan et al. 2006; Buzzelli and Jerrett 2004; Wilson et al. 
2002; Grineski et al. 2007). In addition to addressing a variety of hazards, U.S. 
researchers have found patterns of injustice related to myriad measures of social 
marginality: children (Perlin et al. 2001), the elderly (Bae et al. 2007), home rent- 
ers (Grineski et al. 2007), single mothers (Downey and Hawkins 2008), public 
housing residents (Cutter et al. 2001) and immigrants (Hunter 2000). While 
many U.S. studies have focused on race, outside of the United States, the major- 
ity of investigations have focused on class-based inequities (Pearce et al. 2006; 
Kingham et al. 2007; Gouldson 2006; Jerrett et al. 2001). 

The application of quantitative environmental justice research methods to 
countries of the Global North outside the United States has extended the field 
in many important ways, particularly in terms of testing the generalizability of 
linkages between social marginality and the hazardousness of place. Nonetheless, 
it is not known if similar patterns of relationships between technological hazards 
and marginalized people will be found in developing contexts. Certainly, qualita- 
tive accounts of severe environmental injustice have been documented in Latin 
America, such as the case of the 5,000 low-income Argentine residents living next 
to a Shell refinery (Auyero and Swistun 2009), but few spatial-quantitative stud- 
ies have been conducted. Carruthers (2008:5-6), the author of an edited volume 
on environmental justice in Latin America, claims that “the socioeconomic and 
distributional assertions of U.S. environmental justice will often not appear or 
hold the same way in other contexts.” He hypothesizes that lower-, middle: and 


upper-class people are similarly burdened because hazards are widely distributed 
within Latin American cities. 
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In the Global South, most research studies have focused on the transfer of risk 
from the more powerful North to the less powerful South. Because of an unequal 
exchange relationship, the North has been intimately involved with industrializa- 
tion and placement of technological hazards in the South (Adeola 2000), includ- 
ing industrial development in Juarez (Frey 2003). Southern states are saddled with 
environmental burdens because they are marginalized within the global political 
economic order and represent a path of little resistance (Pellow 2007). In addition, 
they typically lack adequate development and/or enforcement of environmental 
policy and can be desperate for the short-term economic gains that accompany 
corporate-led industrialization, the toxic waste trade, and the transnational ap- 
propriation of domestic resources (Adeola 2000). 

Therefore it is not surprising that the majority of environmental justice research 
in the South has focused on social movements against this global transference of 
risk. Recently, the building of an oil pipeline by community groups in Ecuador 
(Widener 2007) and the struggles for environmental justice in the post-colonial 
context of India (Williams and Mawdsley 2006) have been chronicled by re- 
searchers. Along the U.S.-Mexico border, social movements have developed in 
response to the exploitation of Mexican resources (environmental and human) by 
the United States. In Baja California Norte, Carruthers (2007) examined how a 
California energy crisis influenced the development of energy infrastructure in the 
peninsula and how locales used environmental justice as a framework for resisting 
new power plants and terminals for liquefied natural gas. He also described how 
local residents mobilized for the clean-up of an abandoned maquiladora (i.e., final 
assembly plant) in Tijuana (Carruthers 2008). 

To our knowledge, only a few quantitative environmental justice studies have 
been conducted in the Global South. A few studies have been conducted along 
the U.S.-Mexico border. In Tijuana, results suggest that maquiladoras are located 
in more densely populated areas with higher concentrations of children; unfor- 
tunately authors’ relied on qualitative inspection of mapped patterns as opposed 
to statistical analysis of the spatial data (Kopinak and Barajas 2002). The study 
also utilized a survey of nearly 800 maquiladora workers and found that newer 
(usually migrant) workers tended to live closer to the plant and thus be exposed 
to greater risk (Kopinak and Barajas 2002). In Juarez, results of a quantitative 
study suggested that maquiladoras are located in the more developed areas of the 
city that house a generally affluent populace, but within these areas, lower class 
neighborhoods are most proximate (Grineski and Collins 2008). 

Our study extends this nascent work on quantitative analysis of human-hazard 
relationships in the South. It expands the preliminary work of Kopiak et al. (2002) 
and Grineski and Collins (2008) by including additional markers of social mar- 
ginality along with more traditional environmental justice variables, using spatial 
regression methods, and considering multiple hazard indicators differentially con- 
nected to the transnational economy. Considering the existing environmental 
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justice literature, it is obvious that social marginality and environmental hazards 
are related across a variety of contexts in the Unite States, Europe, Canada, New 
Zealand and Australia. It is also evident that powerful countries of the Global 
North play a role in the technological risks present in the South. However, it is less 
clear whether patterns typically found in the North will be uncovered in the South. 

Due to their divergent trajectories of urbanization, the socio-spatial structures of 
cities in the Global South bear only faint resemblance to those in the North (Davis 
2006; Gottdiener and Hutchison 2006). Today, urbanization is occurring more 
rapidly and on a larger scale in the South than during the prior urban explosions in 
the North. In many cases, urbanization in the South is closely tied to industrializa- 
tion, which is driven by the international division of labor (DeOliveira and Roberts 
1996). While there is less quantitative literature on the subject, some qualitative 
accounts suggest that urbanization outside the North may influence fundamentally 
different social patterns of exposure to environmental hazards (Carruthers 2008). 

Informal development predominates in cities of the Global South, where the 
poor commonly inhabit the low-rise peripheries of cities and the affluent reside in 
formally-designed central city locations endowed with infrastructure (e.g., piped 
water, sewage treatment, paved roads and electricity). In the urban South, formal 
housing markets rarely supply more than 20 percent of the new housing stock 
(Davis 2006). During Mexico’s economic crises in the 1980s, the formal home 
market satisfied little more than a third of the demand for housing. Still today, 
only the middle and upper classes are able to purchase homes in the formal sector 
(Pezzoli 1995). In contrast, the lower classes, which receive a small fraction of 
the federal housing assistance available, are often forced to construct their own 
dwellings in peri-urban settings (Davis 2006). While Latin American elites may 
be tempted by the lure of the less-polluted urban fringes like their U.S. counter- 
parts, the exodus to the suburbs has not occurred because of insufficient state and 
market investment in infrastructure and the high density of squatters in these areas, 
which impose untenable costs (e.g., time and money for transportation) while 
diminishing the aura of exclusivity (DeOliveira and Roberts 1996). In sum, the 
divergence in urban socio-spatial structures between northern and southern cities 
may influence differences in populations at risk to hazards. 


Study Area 
This study took place in Juarez (Chihuahua), Mexico’s fifth largest city. Given its 


proximity to the United States, Juarez exhibits dissimilarities with southern and 
central Mexican cities. However, it is similar to other Mexican border cities (e.g., 
Tijuana) and may indeed be similar to other industrial cities in China or Southeast 
Asia, given their similar position as export processing zones in the global economy. 
Juarez is home to nearly 1.5 million people, and is rapidly growing as migrants 
arrive in search of job opportunities in the manufacturing sector. Population more 
than doubled between 1980 and 2005, from 567,365 to 1.3 million (Consejo 
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nacional de poblacién 2005). A lower-income segment of migrants can be termed 
the floating population, members of this group migrate to Juarez with the intent 
of pursuing immediate livelihood opportunities rather than making the city their 
permanent home. While it has a reputation for being a wealthy Mexican city, an 
unpublished study conducted by E/ Colegio de la Frontera Norte in 2006 found 
that the median household monthly income for Juarez was $9,890, while the 
statistic for Mexico was $11,460 (pesos converted to 2007 US $). It also has a host 
of environmental problems, including flooding, drought, groundwater depletion, 
water pollution, air pollution and toxic waste (Liverman et al. 1999). 

The United States has been involved in the industrialization of Mexico's north- 
ern border. When the U.S. Bracero Program (1942-1964) —which legalized the 
migration of Mexican workers during WWII to help with labor shortages—ended, 
several hundred thousand Mexican workers were relocated to Mexican border cit- 
ies. The Mexican government then created the Border Industrialization Program 
in order to alleviate unemployment and overcrowding in border cities (Frey 2003; 
Liverman and Vilas 2006). This program took off and today there are more than 
3,500 maquiladoras in the region. These mostly U.S.-owned transnational corpo- 
rations are allowed to import needed equipment and raw materials tax free. They 
now employ 25 percent of the Mexican manufacturing labor force (Frey 2003; 
Liverman and Vilas 2006). Because it reduced tariff barriers to trade, the passage of 
the North American Free Trade Agreement in 1994 enabled continued industrial 
growth along the Mexican side of the border, especially in Juarez (Liverman et al. 
1999). While magquiladoras are the central component of Juarez's economy, the 
city also relies on employment in brick production (primarily for domestic use), 
retail and services, agriculture and domestic meat production. 


Data & Methods 


Data for this quantitative analysis came mainly from the municipal government, 
Instituto Municipal de Investigacién y Planeacién in Juarez, and from the Mexican 
census, Instituto Nacional de Estadistica y Geografia. The unit of analysis is AGEB 
or Area Geoestadistica Basica, the finest spatial unit in the Mexican census, most 
similar in size and population to census block groups in the United States. We 
refer to this unit as a neighborhood. 

At the AGEB level, we created four independent variables based on previous 
studies and what was available from the Mexican census. These four variables 
represent critical indicators of social marginality in the city. First, we consider 
the effect of social class. Social class,! usually neighborhood income, has been 
considered in numerous studies conducted in the North (see Szasz and Meuser 
1997; Mohai and Saha 2006). Therefore, we created the neighborhood social class 
variable by combining in one factor’ the following variables: mean neighborhood 
income, percent making five times the minimum wage, percent with a car, per- 
cent with a personal computer and mean level of education. ‘This combined vari- 
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Table 1: Top Migrant Sending States to Juarez, Chihuahua 


State Number Percent 
Durango 120,074 28 
Coahuila 76,458 18 
Veracruz 45,225 11 
Zacatecas 42,971 10 
United States 30,566 7 
Districto Federal 21,056 J 


Other states in Mexico 84,341 20 


Note: “Other States” are: Aguascalientes, Baja California Norte and Sur, Campeche, 
Chiapas, Colima, Estado de México, Guanajuato, Guerrero, Hidalgo, Jalisco, Michoacan, 
Morelos, Nayarit, Nuevo Leon, Oaxaca, Puebla, Querétaro, Quintana Roo, San Luis 
Potosi, Sinaloa, Sonora, Tabasco, Tamaulipas, Tlaxcala and Yucatan. 


able is robust because it combines multiple facets of social class into one measure: 
income, education and possession of goods. Second, we explore the relationship 
between hazards and children. Children’s exposure to hazards has received recent 
attention in environmental justice studies (Houston et al. 2006; Chakraborty 
and Zandbergen 2007). Researchers have found that areas in the Global North 
with higher percentages of children have an increased presence of environmental 
hazards (Chaix et al. 2006; Mitchell and Dorling 2003; Downey and Hawkins 
2008). The proportion children variable was calculated by dividing the number of 
children (under 15 years of age) in each neighborhood by the total population 
in each neighborhood. Third, we consider the relationship between proportion 
of migrants in a neighborhood and proximity to hazards. The proportion migrant 
variable is defined as the proportion of residents not residing in Judrez five years 
prior to the 2000 census. This variable is important in the Juarez context be- 
cause the city is currently undergoing population growth driven by in-migration. 
The population in the city in the year 2000 was 10 times larger than it was in 
1950 (Rubio and Guerra 2008), and many of the arrivals have been impover- 
ished people from rural areas seeking work in the global marketplace. Durango 
(Chihuahua’s southern neighbor), Coahuila (its eastern neighbor), Zacatecas 
and Veracruz send the most migrants (see Table 1), Post-NAFTA, rural Mexico 
has continued to suffer economically, which has accelerated migration to border 
cities because of the employment opportunities there (Tiano 2006). Thus, a 
variable capturing the floating population is especially relevant in a globalizing 
city such as Juarez. 

Fourth, we consider the level of formal development in the housing stock. 
This variable has not been used in past studies of the North but it is relevant ‘here 
given Juarez’ historical context and position in the global economy. Variation in 
residential development and residential services exist across Juarez’s neighborhoods 
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Figure 1. Residential Landscapes in Juarez: Less-developed Residential Area 
near a Porcicultura 


Figure 2. Residential Landscapes in Juarez: Developed Residential Area 
abutting a Maquiladora 
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as many residential areas have developed through decades of informal settlement 
(Ward 1999). Home characteristics range from wood pallet walls and dirt floors 
in neighborhoods with dirt roads (see Figure 1) to well-constructed cinderblock 
homes on a paved grid of streets (see Figure 2). To operationalize formal develop- 
ment, we combined the percent of homes with a strong roof, strong walls, floors, 
public sewer lines, piped water indoors and hot water heaters to create a variable 
called formal development (Cronbach's alpha = .910) using the same method as we 
did to create the social class factor. 
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In terms of the dependent variables, we consider four unique indicators that 
capture a range of industrial activities in Juarez including the domestic, export and 
transportation sectors and a fifth variable that is a composite measure combining 
the other four. First, we selected magquiladoras (n = 345), which are comparable to 
TRI facilities, the most commonly employed data source in environmental justice 
research in the United States (e.g., Bolin et al. 2000; Mennis 2005). Like TRI fa- 
cilities, they are associated with negative health impacts and therefore theoretically 
place proximate residents at risk. Maquiladoras generate toxic chemical wastes (i.e., 
acids, heavy metals) that are sometimes improperly disposed of or spilled (Sanchez 
1990). Chemicals generated contribute to smog and contaminate aquifers that 
residents rely on for water (Heyman 2007). Electronics maquiladoras—the largest 
and fastest growing sector of maquiladora employment (Dallas Federal Reserve 
Bank 2006) —pose a serious threat to human health and the environment because 
of the toxins used (e.g., hydrocarbon solvents, organochlorides) (Sanchez 1987; 
Quintero and Romo 2001). Maquiladora data were acquired from IMIP for 2004. 
These data, much like TRI data, are self-reported by the maquiladoras. In this case, 
location was reported to the primary association of maquiladoras (i.e., Asociacién 
de Maquiladoras- Asociacién Civil de Judrez) and then to municipio authorities, who 
then visited each site to record its location using a global positioning system receiver. 

Second, we use brick kilns (n = 290), or ladrilleras. Ladrilleras have not been 
a focus of environmental justice studies before but they are a key source of en- 
vironmental risk in Juarez. Brick kilns represent important sources of chronic 
hazardous pollution that also pose risks of acute incendiary accidents (Romo et al. 
2004). Romo et al. (2004) estimate that there are 369 brick kiln burns per month 
in Juarez and approximately 391 kg of air contaminants are released during every 
burn. While the majority of bricks are sold in Mexico, some are exported to the 
United States. This dataset was created by university researchers at El Colegio de 
la Frontera Norte Ciudad Juarez in conjunction with IMIP. COLEEF researchers, 
including the third author, obtained IMIP’s directory of all brick producers initially 
created in 1997 and 1998. Then, in 2002 and 2003, COLEE visited each site to 
verify the location of the facility, speak with the operator, and record the location 
using a GPS; they also identified additional sites, which were added to the database. 

Third, we utilize data on pork feed lots (n = 297), or porciculturas. Industrial- 
scale hog production has been studied from an environmental justice perspective 
in the Global North (Donham et al. 2007). Researchers working in the southeast- 
ern United States have found that poor and minority communities are at increased 
risk due airborne effluent (Mirabelli et al. 2006), fecal waste pit Hooding (Wing et 
al. 2002) and presence of industrial hog operations (Wilson et al. 2002). Previous 
research on feedlots in the United States has demonstrated that they pose chronic 
and acute environmental and health hazards to proximate residents (Winget al. 
2002; Donham et al. 2007). In Juarez, pork is used for domestic purposes only. 
‘This dataset was created by COLEF and IMIP. The municipal government created 
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Figure 3. Locations of Porciculturas, Ladrilleras, Maquiladoras and the Rail 
Line in Juarez 
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a list of porciculturas by visiting areas where farms were known to be located. ‘This 
list was verified and augmented by the third author, who conducted field recon- 
naissance throughout the city to identify additional porciculturas for a separate 
project. Her additions were incorporated into the municipal database. ‘The third 
author’s field visits to all farms were conducted during 2003 through 2005, and 
all locations were recorded using a GPS receiver. 

We examine the rail line as the fourth hazard type. Rail lines are not usually 
considered in environmental justice studies (see only Kruize et al. 2007), however 
rail hazards are especially relevant in the Judrez context because one of the world’s 
most dangerous chemical plants (i.e., Solvay) ships hydrofluoric acid and related 
substances through Juarez into the United States using the rail line (Mackler 
2000). Previous research has shown that railroads produce unhealthy levels of 
noise pollution (Marks et al. 2008) and pose risks of an acute hazardous accidents 
(Oggero et al. 2006). Rail data were acquired through IMIP. Each of the four 
unique hazards is depicted in Figure 3. 

To represent the hazards, we used the hazard density index method (Bolin et al. 
2002). To create the HDI scores using this method, we used ArcGIS 9.2 software 
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and an additional analysis tool (i.e., Polygon in Polygon Analysis) developed by 
Beyer (2004). We will explain how it was done for /adrilleras, but the same method 
was followed for each hazard. We began the process by drawing a 1-kilometer 
buffer around each Jadrillera. We then intersected the buffers and AGEB bound- 
aries, and calculated the area of each portion of the intersected buffers. Next, we 
summed the areas of all portions of buffers falling within each neighborhood; 
thus neighborhoods intersected by more buffers have higher HDI scores. Lastly, 
that score was divided by the area of each spatial unit to create a unique /adrillera 
hazard density score for each neighborhood. The same process was repeated for 
each of the hazard variables (see Figure 4); z-scores of these variables were used in 
the analysis. We then summed the z-scores of the four hazards into one variable 
called the composite hazard density. This variable allows for us to provide a more 
general perspective on social relationships to hazards in Juarez. 

Because we do not know the distance at which health risks cease to be associ- 
ated with each hazard-type, we have assumed —based on our choice of buffer—that 
people living within 1 kilometer of each hazard are at risk. While there are health 
risks associated with proximity to the hazards, we do not know the exact spatial 
dimensions of risk for each hazard, which would vary for chronic (air pollution) 
vs. acute (industrial accident) risks. Our choice of a static buffer also neglects the 
temporality of risk: ladrilleras, for example, conduct fewer burns during monsoon 
season than when the weather is dry (Romo et al. 2004). Our method also neglects 
workplace hazards faced by employees. However, this buffer size is a more con- 
servative estimate of the at-risk population than is a larger buffer, such as 1 mile 
(see Downey 2005; Mohai and Saha 2006 for a useful discussion about imposing 
boundaries on risk). 

Our analytical approach relies on two sets of multivariate analyses. Beforehand, 
we deleted the 81 neighborhoods with less than 500 people from the dataset, 
resulting in an N of 401. This was done to create more stable independent 
variables (e.g., meaningful values). For example, 19 AGEBs had no populations 
and 10 AGEBs had 10 people or less. For the multivariate analyses, we begin by 
running the first set of models in which class, children and migrant are used to 
predict the five dependent variables. Then, we add formal residential develop- 
ment in a second set of five models. 

We ran spatial econometric models using the open source software GeoDa fol- 
lowing Pastor et al. (2005), Grineski and Collins (2008) and Chakraborty (2009). 
To begin model specification, we ran five OLS regression models, one for each 
dependent variable. We then tested residuals (for the first set of models using three 
independent variables) for spatial autocorrelation using the Univariate Moran’s I 
test following Chakraborty (2009). Spatial autocorrelation “refers to the tendency 
of variables to be influenced by their neighbors, a fact that will cause the errors 
in the regression analysis to not satisfy the independence conditions generally 
associated with ordinary least squares (OLS) regression.” (Pastor et al. 2005:134) 
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Figure 4. Neighborhood HDI Scores: We found that all five models in the 
Porciculturas, Ladrilleras, Maquiladoras and first set had spatial autocorrelation 


the Rail Line in the residuals; therefore, our data 


N did not meet the assumptions of 
OLS regression models. 

Because we could not use OLS, 
we used the Lagrange Multiplier 
diagnostic tests to suggest if a 
spatial lag or spatial error model 
was best for each model (Anselin 
2005). Spatial lag models assume 
that spatial autocorrelation is only 
present in the dependent variable 
(Chakraborty 2009), while spatial 
error models assume that the in- 
dependent variables exhibit spatial 
dependence (Pastor et al. 2005). In 
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rail and composite models, and that 
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the maquiladora model. 

The third specification diagnos- 
tic offered by GeoDa is the multi- 
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to multicollinearity (Anselin 2005; 
Chakraborty 2009). Anselin sug- 
gests that a condition index of 30 is 
indicative of serious collinearity problems. For the first set of models, the condi- 
tion index score was 1.7 and for the second set of models, it was 3.1, indicating 
that collinearity problems were not present in either of the sets of models. 

Spatial econometric models are supported by means of the maximum likelihood 
method and they require sparse spatial weights (Anselin et al. 2006), which are 
calculated based on a set of neighbor relationships (Pastor et al. 2005). Defining 
neighbors is a critical, but exploratory part, of spatial econometric modeling in 
environmental justice research. We began the process of defining our neighbor 
relationships based on Pastor et al. (2005). Their model of defining weights in 
environmental justice research using a distance-based approach has been followed 


by Grineski and Collins (2008) and Chakraborty (2009). Pastor et al. (2005) 
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created a set of distance-based weights matrix with the maximum distance for a 
neighbor being 3000 m between neighborhood geographic centers. ‘They used this 
technique, instead of contiguity weighting (i.e., rook or queen method), because it 
is more appropriate for data in square grids (Pastor et al. 2005). Like Pastor, 3,000 
meters also made sense for our data, given the neighborhood size (mean area: 
538,377 square meters) and spatial extent of the city (32,000 X 17,000 meters). 
We then ran the first set of spatial econometric models for each of the five depen- 
dent variables, and tested the residuals for spatial autocorrelation to determine if 
the neighbor distance (i-e., 3,000 meters) accounted for the spatial autocorrelation. 
It did for /adrillera and rail (see Table 2 for Moran’s I coefficients), but not for the 

other three. We next tried a 3,500 meters neighbor distance, and this removed the 

spatial autocorrelation for maquiladora, porcicultura and composite density (see 

Table 2 for Moran’s I coefficients). We then repeated this process for the second set 

of models, and the tests suggested the same specifications and neighbor distances 

for each dependent variable (see Table 3 for Moran’s I coefficients). 


Hypotheses 


Despite Carruthers’ (2008a) assertion that all people are similarly burdened by 
environmental hazards in Latin American cities, we hypothesize that marginal- 
ized groups (lower class, children and migrants) will be at increased risk from 
technological hazards in Juarez based on the meta-patterns that have emerged 
through myriad studies in a variety of Northern contexts. Based on previous 
studies in Juarez (Collins et al. 2009) and the general layout of Global South 
cities (Davis 2006; Arreola and Curtis 1993), we hypothesize that higher levels of 
formal residential development will predict higher densities of maquiladoras, but 
lower densities of porciculturas, ladrilleras and rail hazard. 


Results 
First Set of Multivariate Results 


‘The first set of multivariate results are shown in Table 2. The first set of five hazard 
models fit relatively well (R? = .28 for porcicultura density, .20 for ladrillera density, 
.33 for maquiladora density, .29 for rail density and .24 for composite density). In 
all five models, a spatial autoregressive coefficient (i.e., Lambda in the spatial error 
models and rho in the spatial lag model) is included as a predictor and it is statisti- 
cally significant. This coefficient accounts for the misleading effect that spatially 
correlated error terms would have, if we used a classic OLS estimate instead. 
When predicting porcicultura density, higher percentages of children and mi- 
grants were significant predictors (p < .05); lower social class was significant at p 
< .10. Lower class, as well as higher percentages of children and migrants, were 
significant predictors (p < .05) for ladrillera density. For maquiladora density, 
social class was a significant and positive predictor (p < .05). The coefficients for 


Table 2: First Set of Regression Results 





Porcicultura Density 


Ladrillera Density 


Maquiladora Density 


(spatial lag model, 3,500 m) 


Rail Density 


Composite Density 


Variable Coefficient SE p 
Constant 146 301. 628 
Social Class ee 088 
(spatial error model, 3,500m) Children 136° 064 G05 
Migrant oa .053 014 
Formal Dev - - - 
Lambda 859" 058°. 000 
Moran’s | .005 oan 
Constant -.084 247 134 
Social Class EOD" .079 .049 
(spatial error model, 3,000 m) Children .254* .070 .000 
Migrant 174* .058 .003 
Formal Dev 
Lambda 819" .062 .000 
Moran’s | .001 .300 
Constant -,.004 041 922. 
Social Class jel = = 40 C=” 
Children 057 045 203 
Migrant .054 044 215 
Formal Dev. 2 - _ 
Rho Bor 055 .000 
Moran’s | -.006 G4 | 
Constant -.082 306 .188 
Social Class -.197* 075 .009 
(spatial error model, 3,000 m) Children -.019 .067 .180 
Migrant .087 055 413 
Formal Dev 
Lambda .863* .051 .000 
Moran’s | 013 103 
Constant -.102 Ail .804 
a _ Social Class -.247* O76 001 
(spatial error model, 3,500 m) Children a2, DOT 001 
: _ Migrant 244" 055. —-.000 
Formal Dev 
Lambda 694" 047 .000 
_Moran’s | -.006 420 
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Note: *p < .05; Lambda and Rho are the spatial autoregressive coefficients included 
in the spatial econometric models. Lambda is the SAC used with spatial error models 
whereas Rho is the SAC used with spatial lag models. 
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Table 3: Second Set of Regression Results 





ers 





Ladrillera Density ~ Constant -.080 
Social Class -.166 
(spatial error model, 3,000 m) — Children .204* 
Migrant Tae 
Formal Dev 018 
Lambda 817" 
Moran's |_ 





Rail Density Constant -.064 
Social Class -.247* 

(spatial error model, 3,000 m) — Children -.017 
Migrant 105 
Formal Dev 081 
Lambda .857* 
Moran’s | 





Coefficient 





244 

.099 
.070 
062 
.098 
.063 


.295 
094 
.067 
.058 
.093 
.052 


743 
094 
000 
004 
851 
000 
341 


828 
.009 
199 
073 
.382 
.000 


152 


Note: *o< 05; Lambda and Rho are the spatial autoregressive coefficients included 
in the spatial econometric models. Lambda is the SAC used with spatial error models 


whereas Rho is the SAC used with spatial lag models. 
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percentages of children and migrants were positive but less significant (p = .20, p = 
.22, respectively). When predicting rail density, lower social class was the primary 
significant predictor (p < .05); percentage of migrants was positive but less signifi- 
cant (p = .11). Lower class, as well as higher percentages of children and migrants, 
were significant predictors (p < .05) of composite hazard density. 


Second Set of Multi-variate Results 


For the second set of models (see Table 3), the fit improved for porcicultura density 
(R’ = .33) and for maquiladora density (R? = .35). It was unchanged for adrillera 
(R* = .20), rail (R? =.29), and composite (R? = .24) density. As in the first set of 
models, the spatial autoregressive coefficient was statistically significant in all five 
full models. When predicting porcicultura density, the addition of formal residential 
development caused social class to become positive and significant. Children re- 
tained its positive relationship, but the significance dropped to p = .06, and migrant 
became insignificant. Development itself was a negative and significant predictor 
of porcicultura density. In the model predicting /adrillera density, the addition of 
formal development (not significant) caused social class to drop in significance 
from p < .05 to p< .10 while remaining negative. Children and migrant remained 
significant (p < .05). For maquiladora density, the addition of development caused 
social class to lose its significance as its effect was subsumed by formal residential 
development, which was a positive and significant predictor. Migrant became a 
significant and positive predictor (p < .05) while children approached significance 
(p = .14). When predicting rail density, lower social class remained a significant 
predictor. However, the addition of formal development caused the percentage 
of migrants to become more significant (p = .07) even though development was 
not a significant predictor. For composite density, formal development was not 
significant and class, migrant and children retained their direction and significance. 


Discussion 


The results from this study in the Global South generally support the basic work- 
ing hypothesis (developed through studies in the Global North) that socially 
marginalized groups experience greater residential technological hazard exposure 
with a few key exceptions. Contradictory findings are explicable based on benefits 
that marginalized groups may receive from locating near a specific hazard and the 
hazard’s relationship to the global economy. Thus, we will explain the patterns 
discovered by investigating these considerations. Independent variables will be 
discussed before dependent variables. 

Related to the social class findings, the composite model demonstrates that 
class is a negative predictor of hazard (e.g., as social class increases, hazard density 
decreases), which is similar to research findings from the North (Pearce et al. 2006; 
Kingham et al. 2007; Gouldson 2006; Jerrett et al. 2001; Grineski et al. 2007; 
Perlin et al. 2001). But within the different hazard models, we see that relation- 
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ships between class and hazard are sometimes contingent on other independent 
variables, as was the case for porcicultura and maquiladora density. 

All significant relationships found between children and hazards—across the 
two sets of analyses—were positive. A relationship between hazards and children 
was also found in Sweden, Great Britain and the United States (Chaix et al. 
2006; Mitchell and Dorling 2003; Downey and Hawkins 2008), and in Tijuana 
(Kopinak and Barajas 2002). While percent children is not used as often in en- 
vironmental justice studies as is social class, it has been shown to be significant 
in studies conducted in several different northern contexts and in this study of 
a southern city. Despite defying easy explanation (Downey and Hawkins 2008), 
similarities in findings from several contexts suggest the disturbing conclusion that 
children may be bearing a disproportionate share of environmental hazards across 
the globe. This is of critical concern because children are more vulnerable to envi- 
ronmental conditions than are adults; they spend more time outside and, relative 
to body size, they have higher metabolic demands for oxygen (Landrigan 2001). 

Across hazard types, percent migrant was perhaps the most important predictor 
of hazard density.* While this study was not designed to uncover which came first, 
migrants or environmental hazards, these results suggest that migrants and hazards 
are co-locating. Certainly, Juarez is home to a large population of impoverished 
rural residents looking for jobs. This is likely due to the availability (or the percep- 
tions of availability) of employment opportunities offered by the land uses studied 
here. Employment-driven migration to border cities has accelerated post- NAFTA 
as rural Mexico has continued to suffer economically (Tiano 2006), making a vari- 
able capturing the floating population especially relevant in a globalizing city like 
Juarez. We argue that a variable representing the in-migrating population should 
bea staple in future quantitative environmental justice studies in the Global South. 

The formal development variable also proves to be important as it captures 
another aspect of socio-economic status that may be more important in predicting 
hazard than one’s income and possessions (i.e., social class in this study). While 
formal development did not affect the rail, /adrillera or composite findings, it did 
change the maquiladora and porcicultura findings. This may be because a specific 
level of infrastructure is best for operations in these two cases, more so than in the 
others. As large transnational factories, maquiladora operators can afford to pay 
the local premium for land that is plugged-in to the civil infrastructure needed 
for operations. Maquiladoras demand electricity, paved roads and a reliable water 
supply and thus co-locate with residential developments catering to more affluent 
people who also demand these services. Porcicultura operators, on the other hand, 
seek out inexpensive land to reduce production costs, which leads them to peri- 
urban areas away from the more expensive, developed zones. 

Next, each of the dependent variables will be discussed in turn. For porciculturas, 
a key difference is that they are more closely tied to domestic consumption and 
semi-rural land use than are the other hazards. However, rapid growth has meant 


No Safe Place © 2257 


encroachment of residences into areas near porciculturas. When we did not account 
for formal development, we found that higher percentages of migrants were as- 
sociated with porcicultura density. This likely reflects the process whereby migrants 
settle in less desirable, less developed areas. Then, when we added development to 
the model, the social class sign changed from negative to positive. This demon- 
strates that porciculturas are located in poorer parts of Juarez that are less developed 
and home to poorer quality housing. However, when we account for residential 
development, people of higher class live most proximate to this hazard. It is likely 
that this finding captures the owner-operators of the feedlots who are economically 
better off than the other semi-rural residents. These are also individuals likely to 
benefit from living near a pork farm (i.e., source of livelihood). Other residents 
also benefit from the feedlots because they are a source of protein and a sink for 
organic waste from maquiladoras, restaurants and private homes. However, the 
smells and soil contamination, which increase during seasonal floods and wind 
storms, represent a health risk for proximate residents. As the growth of Juarez 
envelopes the current network of porciculturas, it is likely that risks will be amplified 
and porcicultura operators will have incentives (selling increasingly valuable land) to 
relocate their operations in a leap frog fashion beyond the residential fringe. 

Second, maquiladoras are a unique case given their tight linkages with transna- 
tional production and export markets, and the results reflect this. Social class (set 
1) and formal development (set 2) were positive predictors of maquiladora density, 
and this is intimately connected to their transnational status. Maquiladoras are 
able to occupy the most valuable, formally developed land in Juarez because it 
is relatively inexpensive to foreign investors. This is made possible by unequal 
exchange relations between transnational corporations from the Global North 
and cities of the Global South. As a consequence, Judrez’s more affluent residents 
live disproportionately closer to maquiladoras. At the same time, transnational 
industrial linkages have been facilitated by Mexican state and market institutions 
as they have worked to attract foreign investment to Juarez by providing the neces- 
sary infrastructure such as road building and water/sewer line installation, and by 
offering power over cheap labor through union control (Zeisel et al. 2006). This 
investment has been largely focused in the core of the city at the expense of poorer 
communities on the urban fringe (Arreola and Curtis 1993), 

Interestingly, our analysis finds that migrants are disproportionately at risk 
within these more-developed residential areas. This pattern meshes with results 
from a survey of maquiladora workers in Tijuana: long-time workers lived fa- 
ther away from the plants than did new workers (Kopinak and Barajas 2002). 
However, it is likely that only a small percentage of workers actually live near 
their plants, and the migrants may be settling in these areas in hopes of finding 
work. In the North, numerous studies have demonstrated that factories rarely 
employ neighboring residents (e.g., Lerner 2006). While there has been less 
work done on this in the South, this survey of Tijuana maquiladora workers 
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revealed that only 5.6 percent lived in the same colonia (neighborhood) as the 
factories where they worked, and 14.8 percent lived in the colonia adjacent to 
their factories (Kopinak and Barajas 2002). 

While it is difficult to comprehensively measure the costs and benefits of the 
maquiladoras, some have argued that maquiladoras have not brought widespread 
development to Mexico (Frey 2003). The benefits of maquiladoras are distributed 
unequally with the transnational corporations controling production and earning 
profits. Within Mexico, the local elite and the state receive some economic benefits, 
while the Mexican people—especially those marginalized by gender, age and class, 
including maquiladora workers and colonia residents—bear the costs (Frey 2003; 
Pefia 1997). Maquiladora jobs are low-wage positions with little room for advance- 
ment accompanied by risks of workplace environmental exposures (Quintero and 
Romo 2001; Williams and Homedes 2001; Moure-Eraso et al. 1997). Instead of 
serving as a vehicle to spur development in Mexico, some have asserted that they 
have remained an enclave industry that provides low-wage jobs but little else (Barry 
2008; Paus and Gallagher 2008). As Frey (2003:336) states, “a crucial fact remains: 
the maquiladora industry has had little impact on Mexico's economic development 
beyond the creation of jobs (many of which are unskilled, though this has begun to 
change somewhat) and increased revenues from exports.” Because of their size and 
profit-generating capabilities, maquiladoras presenta largely missed opportunity for 
the surplus value generated to be reinvested in health care, roads and schools that 
would benefit workers throughout northern Mexico. 

Third, rail density, the only transportation-based hazard included in this study, 
was strongly tied to lower social class. While there are no obvious employment 
benefits to living near the rail line, the noise likely keeps adjacent lands from 
being desirable places to live. This is further underscored by the final model in 
which percent migrant also becomes a positive predictor of rail density. It is likely 
that neighborhoods near the rail are low-income, transient neighborhoods, and 
residents face acute risks in cases of derailment because one of the world’s most 
dangerous chemical plants ships hydrofluoric acid using the rail line (Mackler 
2000). While the rail line is obviously a key link in the global movement of goods, 
it provides the fewest direct employment benefits for proximate residents and is 
probably the least desirable locational hazard examined here, which likely explains 
the strong relationship that exists with lower social class. 

Fourth, /adrillera density represents an industry that is both export- and 
import-oriented and, unlike maquiladoras, owned in part by local people. The 
relationship between low class and /adrillera density is due to the fact that locals 
make very small profits from their kilns and are under pressure to sell out to 
larger operators because they cannot support their families. The average yearly 
profit per kiln is only $360 in 2009 U.S. dollars (5,200 pesos), and 53 percent 
of operators own only one kiln (Romo et al. 2004). The meager profits garnered 
by ladrillera owner-operators are likely to be spent locally, unlike the larger 


No Safe Place * 2259 
Figure 5. A Ladrillera i 






Juarez 





profits generated by maquiladoras. However, unlike maquiladoras, the ladrilleras 
generate few direct employment opportunities; 75 percent have only one or 
two employees (Romo et al. 2004). The positive migrant finding more likely 
relates to the undesirability of these areas—coupled with a service provided by 
the kilns—than to the few employment opportunities generated by /adrilleras. 
Operators sell broken bricks inexpensively to nearby marginalized households 
for home construction (Romo et al. 2004). The disproportionate presence of 
children near ladyilleras, which likely connects to the family-owned nature of 
the business, is a serious concern given the air pollution generated by the low- 
technology wood and saw-dust fired kilns (Figure 5). Because of the associations 
with social marginality in the models (more children, more migrants and lower 
class), the few employment opportunities, and the small profits made by opera- 
tors, Jadrilleras represent a clear case of environmental injustice. 
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While glossing over important particularities, findings from the composite 
model help to make sense of the divergent findings across the four hazards. At 
an aggregate level, lower class, higher percentages of children and migrants were 
statistically significant predictors of composite density. Thus these results compare 
favorably with previous studies in the North demonstrating that socially marginal- 
ized groups live closer to hazards. 


Conclusion 
While Carruthers (2008a) suggested that all people would be similarly burdened by 


hazards in Latin American cities, we did not find that to be the case in Juarez. The 
overall pattern of marginalized people living closest to hazards aligns with many 
studies from the Global North. The decomposable relationships within this broad- 
scale pattern demonstrate that exactly which groups are at risk depends on the 
market-orientation of the hazard (i.e., domestic vs. transnational) and its location 
within the urban structure. While maquiladora owners make locational decisions 
in reference to global markets, domestic industries make locational decisions within 
the space economy of the bounded city. People also make residential decisions in 
accordance with their socio-economic characteristics within the local land marker. 
Unlike in the North, southern elites typically choose to live in formally developed 
locations at the center of the urban space economy with access to basic infrastruc- 
ture (Davis 2006). These same physical characteristics are positive influences on the 
locational decisions of transnational industries, like maquiladoras. 

At the other end of the socio-economic spectrum in the South, the most mar- 
ginalized households typically live in the least developed areas on the urban fringe, 
and often in places at great risk to natural hazards, such as steep hills where mud 
slides occur or flood-prone plains (Davis 2006). These risks are heightened by the 
lack of social protections made available because, unlike their counterparts living 
on metropolitan fringes in cities of the Global North, residents at the margins of 
cities of the Global South lack the power to appropriate state and market resources 
(which are also generally less well developed) to externalize the risks they confront 
(United Nations Human Settlements Programme 2003). These informally de- 
veloped locations at the margins of the urban space economy are prime areas for 
domestic industries, like porciculturas, that do not have the need (or the capital) 
to locate in more developed areas like the transnational industries. They are also 
not places where people with means would choose to live. 

This research demonstrates that quantitative studies in the South are possible 
and important because they allow for the testing of the robustness of meta-link- 
ages between social marginality and the hazardousness of place. In this case, we 
found complex patterns of risk that vary between social groups and hazards, with 
the general pattern being similar between this southern city and Global North 
contexts previously researched. This is despite their differing position within the 
global economy and trajectory of urbanization and industrialization. Clearly fur- 
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ther research in more varied urban contexts is needed. Quantitative studies in the 
South can provide another piece of evidence that can feed social movements for 
environmental justice in the South as they have in the North. While quantitative 
environmental justice studies can be powerful tools for justice in political and 
legal arenas, activists in the South have not had widespread access to this tool. The 
potential also exists for studies conducted in the South to help activists galvanize 
support across political boundaries, as many nations are implicated in the injustices. 


Notes 


1. While race/ethnicity is typically used alongside social class in studies done in the 
United States, this information is not reported in the Mexican census and therefore 
not included in the analysis. Race/ethnicity variables are traditionally considered in 
U.S. environmental justice studies, yet they are less commonly employed in studies 
done outside of the United States. Even if they had been available, this study makes 
transparent the socially constructed nature of these categories in environmental 
justice work. Hispanic, for example, would not be a very meaningful indicator of 
social marginality in a study in Mexico, despite its applicability in the U.S. context. 


2. The social class factor was created by combining the four highly correlated variables 
into one using the data reduction factor option in SPSS. The Cronbach's alpha score, 
which measures how well a set of variables are measuring a latent construct, was .913 
for this factor. 


3. Inthe final set of models, a higher percentage of migrants were significantly or nearly 
significantly associated with increased density of hazard for four of the five dependent 
variables. For the fifth, the relationship was positive but not significant. 
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Movement Organizations, Synergistic Tactics and 
Environmental Public Policy 


Erik W. Johnson, Washington State University 
Jon Agnone, University of Washington 
John D. McCarthy, Pennsylvania State University 


This study builds on political mediation and movement infrastructure models to 
highlight contingent and synergistic ways in which social movements may impinge 
upon the U.S. national policy-making process. Analyses employ a variety of datasets 
to examine the role of environmental movement organizational capacity, protest and 
institutional activity in garnering Congressional attention to, and action on, salient 
issues from 1961 through 1990. We find all types of movement activity, but especially 
the development of national organizational infrastructures, to be positively associated 
with the convening of Congressional hearings on the environment. Only when there 
are high levels of both protest and institutional activity is there any evidence that the 
environmental movement directly influences the passage of environmental laws. 


Research examining social movement outcomes has primarily focused on policy- 
related consequences of mobilization (for reviews see Amenta and Caren 2004; 
Andrews and Edwards 2004). Increasingly, this literature has sought to incorpo- 
rate information on a wide variety of theoretically important covariates, including 
social movement organizations, movement activities, public opinion and elements 
of the political opportunity structure, as well as exploring the ways in which these 
factors interact and combine to affect policy change (Agnone 2007; Andrews 
2001, 2004; Amenta, Carruthers and Zylan 1992; Amenta, Caren and Olasky 
2005; Cress and Snow 2000; Soule and Olzak 2004). At the same time, there has 
been heightened interest in examining different stages in the policy-making pro- 
cess such as legislation and hearings (Burstein, Bauldry and Froese 2005; Johnson 
2008; King, Bentele and Soule 2007; King, Cornwall and Dahlin 2005; Olzak 
and Soule 2009; Soule and King 2006). 

Guided by extant scholarship on the impact of policy on social movements, we 
account for each of the major causal factors that influence change in public policy, 
assessing contingent and synergistic effects of these factors at two important stages 
in the policy-making process: agenda setting and law passage. In doing so, we test 
theoretically driven hypotheses that specify how and under what circumstances 
movements are more or less likely to elicit changes in public policy. We examine 
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movement infrastructure claims (Andrews 2001, 2004) ofa synergistic effect 
when movements simultaneously employ both protest and institutional influence 
activities (i.e., press conferences and legal maneuvers) as well as political media- 
tion claims (Amenta 2006; Amenta and Caren 2004; Amenta et al. 1992, 2005; 
Cress and Snow 2000) about the political conditions under which activities are 
likely to be more or less efficacious. We extend these theories by positing how 
social movement organizations and activities, or combinations of strategies and 
political conditions, differentially affect the agenda setting and law passage stages 
of the legislative process. 

We use as our test case the U.S. national environmental movement. The 
American environmental movement experienced an upsurge in organization 
building, institutional influence activity and protest beginning in the late 1960s 
and continuing through the mid 1980s. Meanwhile, environmental issues became 
increasingly institutionalized within the federal policy-making arena (Andrews 
1999; Portnoy 1990). Accordingly, our analyses are guided by the following ques- 
tions: (1. What has been the role of the U.S. national environmental movement 
in attracting federal attention to, and affecting policy changes on, environmental 
issues? (2. How has the movement exerted influence? Does greater organizational 
capacity, the adoption of institutional influence activity, the use of protest, or 
some combination of these factors best account for political success? (3. Are there 
certain conditions under which national movements are more or less likely to 
influence federal agenda-setting activity or the passage of laws? 


How Movements Influence the Policy Process 


In attempting to achieve their social change goals, social movement groups primar- 
ily target the state in an attempt to influence public policy.! Drawing on several 
influential typologies (see Andrews and Edwards 2004; Amenta, Halfmann and 
Young 1999; Giugni 1998; Kingdon 1984), we conceptualize the federal policy- 
making process as consisting of at least four important stages: (1. agenda setting, 
(2. content specifying, (3. legislation passage, and (4. policy implementation. 
Before decisions can be made on any given issue, the problem must first come to 
the attention of political decision makers. While increased agenda salience does 
not necessarily assure desired outcomes (Burstein 198 5), it is a necessary precursor 
to the development of public policy that increases the likelihood of legislation 
(Baumgartner and Jones 1993; Cobb and Elder 1975; Kingdon 1984). Moreover, 
it is at the agenda-setting stage that interest groups and social movements are 
thought to have the greatest influence (Andrews and Edwards 2004). 

Following this initial phase, a wide variety of actors compete to frame the 
issue. In a simplified version of legislative action, a formal bill is then introduced, 
and legislators determine whether to pass a law. This stage is the most visible in 
the political process, but policy making does not end with the Passage of new 
laws. Instead, considerable work remains to specify how laws and policies are 
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implemented, funded and enforced. The entire process is subject to feedback 
and revisions are likely. 

Despite the assumption “that each part of the policy process is shaped by dif- 
ferent explanatory factors” (Andrews and Edwards 2004:492), scholars have only 
begun the process of illuminating how movement strategies—or combinations of 
strategies —affect various stages of the policy-making process. Recent scholarship 
(King et al. 2005, 2007; Soule and King 2006) describes how the influence of 
social movements is conditioned upon a /egislative logic of increasingly stringent 
tules applied at each stage of the process. This logic predicts that movements’ ef- 
fects should diminish at each successive stage of the policy-making process, as the 
results of acting on potential legislation become increasingly consequential (i.e. 
closer to enforceable law) and more closely contested. Agenda setting is the least 
consequential stage, and the one that should be most responsive to social move- 
ment organization activity. And indeed, a growing empirical literature examining 
multiple stages of the policy-making process suggest that women’s, civil rights and 
environmental movements significantly affect the setting of legislative agendas, 
but have only minimal or null effects on the passage of laws (Johnson 2008; King 
et al 2005; 2007; Olzak and Soule 2009; Soule and King 2006). Rather than 
simply a question of diminishing effects of social movements across the policy 
making process, it may also be that different factors or combinations of factors 
matter at different stages. 

At the agenda-setting stage, more expansive organizational capacity and greater 
amounts of institutional influence and/or protest activities may be sufficient to 
spur action. We therefore hypothesize: 


H1: Social movement organizational capacity, protest and 
institutional influence activity will each be positively associated 
with legislative agenda-setting activity on relevant issues. 


Concerning law passage, however, we are much more pessimistic about the direct 
influence of social movements, particularly the role of movement organizations. 
This reflects our understanding of the theoretically contingent nature of law pas- 
sage and the legislative logic of the public policy process, as well as the results of 
previous empirical analyses testing for a direct relationship between organized 
mobilization and the passing of legislation (Burstein and Linton 2002). In short, 
we think it unlikely that movements directly influence the passage of laws and 
embrace Burstein and Linton’s (2002) challenge to posit a null effect. 


H2: Neither social movement organizational capacity, protest 
nor institutional influence activity will have a direct effect on 


the incidence of law passage. 
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Political Context 


Originally developed to explain social movement mobilization (McAdam 1982; 
Tilly 1978), the application of political opportunity theory to analyses of move- 
ment outcomes has become standard practice (e.g., Almeida and Stearns 1998; 
Banaszak 1996; Meyer and Minkoff 2004). The political opportunity structure 
refers to “dimensions of the political environment that provide incentives for 
people to undertake collective action by affecting their expectations of success or 
failure.” (Tarrow 1994:85) What precisely should be included as relevant dimen- 
sions of the POS is highly variable, however. Analysts suggest a wide range of 
models, from those focused narrowly on structural dimensions of the political 
system (McAdam 1996) to those suggesting the inclusion of broad cultural ele- 
ments (Gamson and Meyer 1996). Given the wide variance in its application, 
it is perhaps not surprising that political opportunity theory has regularly been 
critiqued as a “black box,” a potential “sponge that soaks up every aspect of the 
social movement environment” (Gamson and Meyer 1996:275) or, in even less 
flattering terms, a “winding snarling vine.”’(Goodwin and Jasper 1999) 

In an attempt to add conceptual clarity to political opportunity theory, Meyer 
and Minkoff (2004) parse commonly employed measures of the POS along two 
theoretical dimensions. First, they argue that it is important to distinguish between 
issue-specific measures of the POS (those that apply only to a particular move- 
ment or issue) and general measures of the POS important across a range of social 
movements. Second, they distinguish between measures that speak to structural vs. 
signaling models of the POS and find that signaling models are closely linked to 
dynamics of SMO formation but that “[m]ovement-related policy outcomes are 
unequivocally determined by structural elements in the polity.’(Amenta 1998:1483) 
Although analyzing the substantive case of environmentalism, we are interested in 
how social movements may affect a broad range of public policy domains. 

Political mediation theory (Amenta 2006; Amenta et al. 1992, 1999, 2005) 
asserts that, in addition to direct effects, political opportunities mediate the effect 
of social movement activities on legislative action. In the strongest version of the 
theory movement mobilization is seen as a necessary, but not sufficient, condition 
for political action: “...mobilization and collective action are usually insufficient 
for policy changes; under some circumstances, however, mobilization and collec- 
tive action may bring about changes.” (Amenta et al. 1992:312) The circumstances 
under which movement mobilization and activity are likely to bring about change 
are openings in the political environment. Again, however, rather than focus 
on direct effects, researchers applying political mediation models have begun to 
specify how different movement activities may be more or less effectual under dif- 
ferent political conditions. The central argument of political mediation models is 
that movement strategies are more likely to result in political gains when there is 
a “fit” between movement activities and the political environment. Elite allies play 
a particularly important role in mediation models (Cress and Snow 2000; Soule 
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and Olzak 2004). When sympathetic or allied elites are in power, less assertive 
movement strategies (e.g., institutional influence activity) are needed to achieve 
_ a favorable political outcome, and more assertive protest activity may even waste 
good will and backfire. But, under more restrictive political opportunities more 
assertive strategies are required. Thus, institutional influence activity and organiza- 
tional capacity are more likely to be successful in favorable political climates, while 
relatively restrictive opportunities are more responsive to the use of disruptive 
protest tactics (Amenta 2006; Amenta et al. 2005; Soule and Olzak 2004). 

In the political mediation approach movement organizations, institutional 
influence activity and protest are presumed to be equally effective. Accordingly, it 
is the appropriateness of movement activities to existing political conditions that 
determines policy success. 


H3: Across the policy process, the effect of social movement 
organizational capacity and institutional influence activity 
will be increased, and protest decreased, under favorable 
political conditions. 


Movement Infrastructure 


The movement infrastructure model (Andrews 2001, 2004) emphasizes: (1. how 
tactical diversity may heighten movement effects; and (2. the role of organiza- 
tional infrastructure in contributing to diverse movement tactics. SMOs, in this 
view, are important beyond their contribution to mobilizing resources. Primarily, 
movements with larger organizational infrastructures are also likely to have a 
greater diversity of organizations, in turn facilitating the simultaneous adoption 
of multiple influence methods. 

The movement infrastructure approach points to the need for movements 
to employ both “outsider” and “insider” tactics in achieving policy gains by 
asserting that movements have the greatest policy impact “when they... create 
leverage through multiple mechanisms.” (Andrews 2001 :76-77) In short, neither 
disruptive protest nor institutional influence activity alone is sufficient to make 
favorable changes in public policy. The movement infrastructure model builds 
on longstanding observations of a radical flank effect within social movements 
(Jenkins and Eckert 1986; McAdam 1982), whereby the existence of a radical 
fringe prompts elite concessions in an effort to co-opt more moderate elements 
of a social movement. In a diverse organizational field, highly stable profession- 
alized SMOs offer the possibility of more effectively maneuvering routinized 
political channels through a range of institutional influence activities such as 
petitioning, letter-writing campaigns, litigation, testifying at Congressional 
hearings, and even drafting legislation and regulations (Staggenborg 1988). 
By employing professional staff scientists, lawyers and lobbyists, SMOs act 
like interest groups, serving as stable partners that may influence public policy 


2272 © Social Forces 88(5) 


by providing information and assistance in framing issues to Congressional 
members and staff (Andrews and Edwards 2004; Banaszak 1996; Burstein and 
Hirsh 2007). 

Simultaneously, in a diverse organizational field other groups may focus on 
more disruptive activities, such as political protest, which may be more likely to 
provoke an elite response. While a focus on social movement organization build- 
ing has been challenged as counter-productive to direct action (e.g., Piven and 
Cloward 1979), there is evidence that the size of a movement's organizational 
infrastructure may be positively associated with protest capacity (McAdam 1982; 
Minkoff 1997). The organizational infrastructure model sidesteps tension in the 
literature over the relative efficacy of protest, institutional influence activity and 
organizations by focusing on potential synergies between various social movement 
strategies, whereby movements disrupt on the one hand and offer opportunities 
for reconciliation on the other (see Giugni 1998 for a review). 

Despite its theoretical appeal, evidence speaking to the notion of a synergistic 
effect between movement protest and institutional influence activity remains 
underdeveloped. Research has generally failed to include data on movement orga- 
nizational capacity, protest and institutional influence activity, typically focusing 
on one or the other (Amenta et al. 2005 and Cress and Snow 2000 are notable 
exceptions). Andrews clearly indicates the necessity of movements employing 
both institutional influence and protest tactics, but does not directly test for the 
existence of tactical synergism or, indeed, account for movement activities in 
his empirical analysis. Instead, there is a presumption that existence of a strong 
organizational infrastructure allows for the deployment of a wide range of stra- 
tegic activities. While there is some merit to this approach, it is an open empiri- 
cal question whether movements with strong organizational infrastructures are 
necessarily more diverse in the strategies they employ (but see Olzak and Ryo 
2007). Accordingly, we test for a synergistic effect between institutional influence 
and protest activities in affecting law passage, while controlling for the size of the 
movements organizational infrastructure. We do not consider multiple influence 
mechanisms a necessity at the agenda setting stage, however, because this part of 
the political process is relatively more open to influence. Echoing the idea that 
this fundamental proposition should apply at the law-passage, but not necessar- 
ily agenda-setting stage, Amenta et al. (2005:522) note that, “[f]undamentally 
altering policy, however, is likely to take both strong mobilization and extensive 
assertive shows of strength.” We therefore expect a differential influence of tactical 
synergy between the agenda-setting and law-passage stages of the policy process. 


H4: The simultaneous deployment of both protest and 


institutional influence activities is not necessary for influencing * 
agenda-setting activities. 
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H5: The simultaneous deployment of both protest and 
institutional influence activities is positively associated with 
the incidence of law passage. 


The Modern Environmental Movement 


Although the U.S. national environmental movement is rooted in the conserva- 
tion movement of the late 19" and early 20" centuries, it is the extensive period 
of organization building beginning in the late 1960s and continuing through 
the 1980s that marks the beginning of the modern environmental movement in 
America (Brulle 2000; Gottlieb 1993). The rate of new national environmental 
movement organization foundings surged after 1965, peaked in 1970, and re- 
mained high throughout the 1980s (Johnson and McCarthy 2004; McLaughlin 
and Khawaja 2000). Many of the EMOs founded during this period can be 
characterized as proto-typical professional movement organizations (McCarthy 
and Zald 1977), with a heavy reliance on a paper membership and a full-time 
paid staff of scientists, lawyers and lobbyists rather than an active membership. 
Compared to other social movements, the professional skills these organizations 
brought to environmental issues, along with early legislative success, fostered a 
heavy reliance on more traditional institutional influence activity such as lobbying, 
litigation, testifying at Congressional hearings and public education. 

The rapid emergence and growth of a popularly supported national environ- 
mental movement with a large organizational infrastructure and increasingly 
diverse tactical repertoire is thought to have spurred the development of a fed- 
eral environmental regulatory framework. Indeed, during the 1970s, there was 
unprecedented attention paid to, and bi-partisan support for, ecological issues in 
national policy. The Environmental Protection Agency founded in 1970, central- 
ized federal responsibility for environmental protection and regulation. Further, 
30 major pieces of environmental legislation were passed (Miller 1991). While 
the inauguration of President Reagan decidedly shifted the political landscape, 
environmental issues remained firmly institutionalized within the U.S. political 
system throughout the 1980s. The erosion of federal environmental protections 
during the Reagan administration initially produced a backlash of support for 
environmental issues, resulting in a membership surge during the early 1980s 
for national EMOs (Dunlap and Mertig 1992). As the decade wore on, however, 
national organizations increasingly came to be perceived as incapable of protect- 
ing previous legislative gains. This, combined with the emergence of a vibrant 
grassroots strand of the movement focused on issues of toxic contamination 
and environmental justice and extremely critical of mainstream national EMOs 
(Bullard 1990; Brown and Masterson-Allen 1994; Cable and Cable 1995; Szasz 
1994), contributed to a growing crisis of legitimacy within the national movement 
(Dowie 1995)Observers have tended to assume a link between citizen mobili- 
zation around environmental issues and federal attention to those same issues, 
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although the topic has only recently garnered scholarly attention (Agnone 2007; 
Johnson 2008; Olzak and Soule 2009). While informative, these analyses suffer 
from a weakness that has plagued movement outcomes research more generally: 
the inability to simultaneously account for the role of movement organizations 
and activities in models while controlling for other important covariates. 


Data 
Policy Outcomes 


Following previous research (Johnson 2008; Jones and Baumgartner 2004; King et 
al. 2007; Sheingate 2006), Congressional agenda-setting activities are measured by 
yearly counts of Congressional hearings convened on environmental issues. The 
convening of Congressional hearings on an issue signals the importance of that 
issue beyond the limited confines of the particular committee in which they are 
held (Diermeier and Feddersen 2000). Further, hearings provide outlets for SMOs 
and interest groups to express policy preferences and provide information that may 
have important consequences for relative legislation (Burstein and Hirsh 2007). 

Measures of Congressional hearings and law passage are constructed using 
data from the U.S. Congressional Hearings database and the U.S. Public Laws 
data file of the Policy Agendas Project.* Each hearing and law in these data is 
coded according to 19 major topics, and 225 subtopics. We initially defined en- 
vironmental hearings and laws to include all those coded under the Environment 
heading. Subsequently, textual summaries of each hearing and law were reviewed 
to determine whether each was consistent with the goals of the environmental 
movement.’ In the end, we included 1,701 total hearings, an annual mean of 57, 
over the 30-year observation period. A total of 309 laws (maximum yearly count 
of 27 and mean of slightly more than 10) were identified as directly relevant to 
the environmental movement. Examples of major environmental legislation are 
the Clean Water Act and Federal Environmental Pesticides Control Act of 1972 
and the Ocean Dumping Act of 1988. 


The Environmental Movement 


Movement Activities 


Data on the incidence of environmental movement collective action events were 
drawn from The Dynamics of Social Protest Project We extracted all events (n = 
554) for which the organizing claim was environmental. This final dataset includes 
348 events that can be characterized as institutional influence activity on the part 
of the movement (e.g., information distribution, press conferences or lawsuits) 
as well as 180 protest events (e.g., demonstrations, marches, civil disobedience).° 

Despite a long tradition in social movement research of relying on newspaper 
reports of protest events (see Earl et al. 2004 and Olzak 1989 for reviews), the 
method is not above reproach. Although detailed review of the limitations of 
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newspaper data is beyond the scope of this article, critiques have focused on the 
extent of description and, especially, selection biases in newspaper coverage (Earl 
et al. 2004; McCarthy, McPhail and Smith 1996; Myers and Caniglia 2004; 
Ortiz et al. 2005). While selection bias is a potentially large problem for analyses 
that employ newspaper events as the dependent measure, when employed as the 
independent measure, newspaper data is less problematic because newspapers 
likely capture those events most salient to lawmakers and the public (Earl et al. 
2004; McAdam and Su 2002). 

Protest has been less central to the environmental than other social movements 
that have traditionally dominated the field of study (e.g., labor, civil, gay and 
lesbian rights). This is reflected in the relatively low rates of environmental protest 
incidents in the United States compared to other domestic social movements 
(Jenkins and Halcli 1999) and the popular characterization of U.S. national envi- 
ronmental organizations as proto-typical professional SMOs. This does not mean 
that protest is unimportant or that lawmakers ignore environmental protest. For 
example, protest activities by the Love Canal Homeowners Association, includ- 
ing the brief hostage taking of EPA officials, are widely credited with playing an 
important role in the development of federal Superfund legislation (Szasz 1994). 
And, the first Earth Day protests/celebrations were organized by Sen. Gaylord 
Nelson and his staff in what was a conscious—and ultimately successfully — effort 
to pressure his peers into voting for a host of environmental reforms, including 
passage of the Clean Air Act of 1970. As Gottlieb (1993:127) notes, “[t]o policy 
makers in Washington, it was precisely the concerns about activism that pushed 
along the Clean Air debate.” Despite this anecdotal evidence that environmental 
protest matters to federal policy makers, it may be that protest plays a more im- 
portant role in terms of securing political outcomes for movements where protest 
is a more central political tactic. If true, our analyses are likely to underestimate 
the ability of protest to influence the policy-making process. 


Organizational Capacity 


Data on environmental movement organizational capacity come from the 
Encyclopedia of Associations, National Organizations of the U.S. (Gale Research Inc. 
1956-2003). This yearly survey of national non-profit associations active in the 
United States has been published annually since 1974 and intermittently back to 
1956. National EMOs were identified using headings that indicate environmental 
concern (see Johnson 2008 for detail). Both highly institutionalized issue advo- 
cacy organizations (e.g., Sierra Club or National Wildlife Federation) and more 
confrontational, loosely structured direct action groups (¢.g-, Earth First! or The 
Clamshell Alliance) are included in the sample. Collectively, these organizations 
exhibit a wide range of tactics, discourse frames, structures and constituencies. An 
average of 296 EMOs existed across our time series, with a minimum of 92 in 
1960 and maximum of 503 in 1989. 


2276 © Social Forces 88(5) 


As a measure of movement organizational infrastructure capacity, the total 
number of U.S. national EMOs active in each year, or population density, was 
computed from organizational birth and death information contained in the 
Encyclopedia. Population density is a commonly employed measure of social move- 
ment organizational infrastructure strength, including for the cases of women’s 
and civil rights (Minkoff 1997) and the environmental movement (Johnson and 
McCarthy 2004; McLaughlin and Khawaja 2000) 


Political Context 


Democrats have long been identified as allies of the environmental movement in 
the United States (Dunlap and Allen 1976; Guber 2003), with the presence of 
Congressional Democrats and a Democratic President expected to increase the 
number of environmental hearings and the amount of environmentally favorable 
legislation passed. Democratic control of Congress is measured as the percentage of 
total Democratic seats held in the House of Representatives and Senate, the standard 
in the literature (e.g., Olzak and Soule 2009; Stimson, MacKuen and Erikson 1995; 
Soule and Olzak 2004; Wlezien 2004).” The presence of a Democratic President 
is measured using a dummy variable coded 1 during years of a Democratic presi- 
dential administration (Meyer and Minkoff 2004). Democratic Party advantage in 
Congress and the presence of a Democratic President are both expected to facilitate 
environmental issue agenda setting and law passage. 

Another factor affecting legislative activity is the nature of the Congressional 
workload. Typically, greater numbers of hearings are held in the first year of a 
Congress, while a greater number of bills are passed in the second Congressional 
year, which is also an election year. We employ a dichotomous variable coded as 
1 during Congressional election years, with the expectation that the incidence 
of environmental hearings will decrease, and the passage of environmental laws 
increase, during election years (Agnone 2007; Meyer and Minkoff 2004). 

Accounting for movement opposition is important (Andrews 2002; Meyer and 
Staggenborg 1996). However, the operationalization of environmental movement 
opposition is complicated by the nature of changing alliances according to the 
specific environmental issues under consideration (Schnaiberg and Gould 2000) 
and the changing relationship between industry and environmental issues over 
time, whereby industries that once steadfastly opposed environmental protection 
measures have increasingly come to embrace them (Hoffman 2001). Opposition 
to pro-environmental legislation in Congress has traditionally come from the 
Republican side of the aisle however, and especially from more conservative quar- 
ters. To account for opposition to environmental legislation, we follow Olzak and 
Soule (2009) by including a measure of Conservative Coalition votes in Congress 
(data from Ornstein, Mann and Malbin 2008).? < 

Finally, we control for prior congressional action in two ways: the number of 
environmental hearings held and the number of laws passed in the previous year. 
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There are two reasons for controlling for past hearings. First, legislative inertia 
may be at work, as hearings on one topic may spur the need to hold hearings on 
related topics. Second, controlling for hearings held the previous year is necessary 
to investigate the impact of agenda setting on the passage of future legislation, as 
most bills begin as topics explored in hearings. A similarly dynamic process ex- 
ists for the passage of laws wherein past legislation often spurs future legislation 
designed to strengthen or weaken earlier laws. 


Control Variables 
Public Opinion 


Research on social movement political outcomes has been strongly criticized for 
failing to account for public opinion in theories and models (Burstein 1998). One 
obstacle is that polling firms do not ask about particular issues until they have 
become important or salient (Burstein 2006). In the case of environmental issues, 
this means that environmental public opinion trend data is nonexistent prior to 
1970 (Dunlap and Scarce 1991; Guber 2003). As a measure of public opinion 
we employ an environmental attitudes index constructed using Stimson’s (1999) 
WCALLC program. This involved entering the positive responses calling for more 
action on the part of the government based on 64 “readings” of public opinion 
on the environment from 1954 through 2000 (see Agnone 2007 for details). This 
technique has been employed to compile time series public opinion data on a 


range of public policy issues (e.g., Smith 2000; Kellstedt 2003). 


Media Attention 


Mass media plays an important role in the policy process by focusing public at- 
tention on a particular issue (Downs 1972) or by raising the salience of a topic 
(Zaller 1992). We measure media attention using the annual number of stories 
in the New York Times Annual Index referring to the environment. ‘The data come 
from The Policy Agendas Project. 


Environmental Pollution 


Objective environmental pollution is measured as the five-year average of annual 
U.S. emissions of five air pollutants: particulate matter less than 10 microns, car- 
bon monoxide, sulfur dioxide, nitrogen dioxide and volatile organic compounds 
(U.S. Census Bureau 1965-1995). Yearly emissions for each of these pollutants 
were standardized and combined to create an air pollution index, with each pol- 
lutant weighted equally. The resulting index is a direct measure of environmental 
degradation that represents four of the six criteria air pollutants, integrating data 
ona diversity of types and sources of emissions (Environmental Protection Agency 
2000).'° Descriptive statistics for the dependent and independent variables, as well 
as interactions described later, are shown in Table 1. 
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Table 1: Descriptive Statistics 1961-1990 > 
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Methods of Analysis 


Poisson and closely related negative binomial regression models are appropriate 
statistical inference techniques when fitting models with count data as the depen- 
dent variable (Long 1997) and common within the relevant literature (Agnone 
2007; Johnson 2008; King et al. 2005, 2007; Meyer and Minkoff 2004; Olzak 
and Soule 2009). In Poisson regression, a log transformation prevents the model 
from producing negative predicted values, adjusts for a skewed distribution, and 
models the variance in event counts as a function of the mean (Liao 1994; Long 
1997). The Poisson distribution does, however, have a rather restrictive assump- 
tion that the mean equal the variance, known as equidispersion. The violation of 
this assumption downwardly biases the standard errors while leaving the regression 
coefficients unaffected (Hoffmann 2003). Formal statistical tests for the presence 
of overdispersion in the data indicated that Poisson regression is more appropriate 
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than Negative Binomial regression for the estimated models (results available from 
authors). Data were analyzed using Stata 10. 


Results 
Setting the Agenda 


The results from multivariate Poisson regression models examining the determi- 
nants of federal agenda-setting activities from 1961-1990 are shown in Table 2. 
We began by fitting a baseline model (Model 1) that focuses on the legislative pro- 
cess net of any social movement activity. We control for party of the President, par- 
tisan control of Congress, opposition party strength, Congressional election cycles, 
media attention to environmental issues, pollution, public opinion and prior leg- 
islative activity on the environment. Results indicate that Congressional election 
years are negatively associated with hearings, as is opposition party strength; media 
attention to environmental issues increases the number of environmental hear- 
ings held, as does the occurrence of prior hearings. The percentage of Congress 
controlled by Democrats is not a significant predictor of environmental hearings. 

In Model 2, we include indicators of environmental movement organizational 
capacity, protest and institutional activity to establish their effect beyond what 
is accounted for by our baseline model. Independent of one another, each is 
positively associated with the incidence of Congressional hearings on environmen- 
tal issues (not shown). Organizational capacity remains a statistically significant 
predictor of Congressional agenda-setting activities. Exponentiating the coef- 
ficient for organizational capacity suggests that the existence of each additional 
environmental organization in the prior year, net of control variables, increases the 
likelihood of Congressional hearings on the environment being held the following 
year by .3 percent (e”). 

Model 3 tests for a synergistic tactical impact on Congressional hearings (H4), 
but this coefficient, as well as the likelihood ratio test measuring overall model 
fit compared to the baseline linear movement model (Model 2), lacks statistical 
significance. The environmental movement does not increase its ability to set the 
Congressional environmental agenda when simultaneously engaging in protest 
and institutional influence activity. 

Political mediation theory posits that the effect of social movement organizational 
capacity and institutional influence activity will be amplified, and protest moderated, 
under favorable political conditions. We test this hypothesis in a step-wise manner 
by including interaction terms between the percentage of Congressional seats 
controlled by Democrats and social movement organizational capacity (Model 4), 
institutional influence activity (Model 5), and protest (Model 6). Our hypotheses 
are supported if the individual interaction effects achieve statistical significance 
and likelihood ratio tests of the interaction model show an improvement in fit 
over the baseline linear model (Model 2).!! All movement-political opportunity 
interaction effects are statistically significant at .01 or higher and the likelihood 


ratio test comparing the improvement in fit of each interaction model to the linear 
movement model (Model 2) are statistically significant at .01 or higher, providing 
support for Hypothesis 3. Movement activity of any type—building organizational 


capacity, institutional influence activity or, unexpectedly, protest—is associated 
minants of the passage of pro-environmental laws from 1961 through 1990. We 


employ the same modeling strategy as in the analysis of hearings, beginning by 


with greater Congressional attention when political conditions are favorable to 
fitting a baseline model controlling for the political environment and its covari- 


movement goals. 


Law Passage 
Table 3 depicts results from multivariate Poisson regression models of the deter- 
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ates. Model 1 shows environmental law passage to be positively associated with 
the presence of Congressional election years and the convening of environmental 
hearings the prior year. 

In Model 2 we include indicators of environmental movement organizational 
capacity and activities. The U.S. environmental movement does not appear to 
directly affect the passage of environmental legislation (Model 2). The coeffi- 
cients for organizational capacity and protest both fail to achieve statistical sig- 
nificance and the addition of these movement variables does not alter the overall 
fit of the model (the same pattern of results is evident in partial models with 
movement variables entered individually). No single tactical repertoire of the 
environmental movement appears sufficient to affect legislative policy. Model 
3 tests for a synergistic tactical impact on the passage of pro-environmental 
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laws, as suggested by the movement infrastructure model. We find support for 
Hypothesis 5. The movement infrastructure coefficient measuring the interac- 
tion of protest and institutional influence activity is positive and statistically 
significant at .01 or greater, while the likelihood ratio of the overall model fit 
compared to the linear movement model (Model 2) achieves statistical sig- 
nificance at .05 or greater. Consistent with the expectations of the movement 
infrastructure model, the environmental movement positively affects the passage 
of pro-environmental legislation when simultaneously invoking protest and 
institutional influence activities. 

Models 4 through 6 offer empirical tests of political mediation theory, which 
suggests that SMOs and institutional influence activity will be received more 
favorably during times in which movement allies hold office, whereas protest will 


Estimates of the Effects of Environmental Movement on Congressional 
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be more effective when movement adversaries are in power. Neither movement 
organizational capacity (Model 4) nor institutional influence activity (Model 
5) interact with a Democratic Congress to significantly affect law passage. The 
interaction term for movement protest and a supportive political environment, 
conceptualized as Democratic control of Congress (Model 6), is negative and 
statistically significant at .01 or greater, with significant improvement in model 
fit at .05 or greater. Thus, consistent with political mediation models, our find- 
ings indicate that protest is associated with greater amounts of law passage during 
politically closed periods and can backfire during open political periods—in the 
case of the environmental movement diminishing effectiveness when Democrats 
have higher percentages of Congressional seats. 
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Discussion 


Our results provide evidence that social movements are able to successfully influ- 
ence the agenda-setting process, but ow movements are able to do this is not as 
straightforward as prior research suggests. Our findings support a legislative logic 
whereby social movements are more consequential at earlier stages in the political 
process. Using the U.S. environmental movement as our empirical case, we find 
evidence that movement mobilization is positively associated with Congressional 
agenda-setting activities, especially when operating in a favorable political context. 
These findings remain robust when looking at protest, institutional influence actiy- 
ity or organizational capacity, although organizational capacity appears especially 
efficacious at the agenda-setting stage of the policy process. 

These same factors are much less influential at the law passage stage, however. 
There is no indication of a direct effect for environmental movement institutional 
influence activity, protest or organizational capacity on the incidence of law pas- 
sage. Nor is there any indication that movement organizational capacity or insti- 
tutional influence activity is more efficacious during open political periods. Protest, 
however, is associated with increased law passage during closed political periods. 

Finally, we find evidence that the environmental movement influences law pas- 
sage in both high rates of institutional influence and protest activity, as predicted 
by the organizational infrastructure model. This finding should be of particular 
interest to movement scholars as it supports the notion that tactical diversity 
within a movement can be efficacious in achieving desirable political outcomes. 
Protest activities are likely to be more successful in garnering favorable policy 
outcomes when accompanied by a vibrant movement simultaneously working 
through institutionalized political channels. The combination of institutional 
influence activity and protest allows a movement to simultaneously persuade, 
disrupt and bargain. This suggests that long-standing debates over the relative 
efficacy of protest vs. more traditional institutional influence activity (Gamson 
1975; Piven and Cloward 1979) represent a false dichotomy. Rather than consti- 
tuting a zero-sum trade-off, institutional influence and protest activities interact 
to affect policy change, with movements hoping to secure legislative gains best 
served by employing diverse tactics (see also Olzak and Ryo 2007). However, it 
is noteworthy that the simultaneous deployment of these tactics offers no signifi- 
cant advantages at the agenda-setting stage, where Congress appears much more 
responsive to direct movement influence (Cornwall et al 2007). 


Conclusion 


While the U.S. national environmental movement has impinged upon the policy- 
making process, the ways in which it has done so are both complex and context 
dependent. Our research confirms that movements have greater efficacy at the ear- 
lier agenda-setting stage of the policy process than at the more consequential and 
contingent law-passage stage. Our principle finding, however, is that high levels 
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of institutional influence and protest activity interact to increase the incidence of 
law passage. Secondarily, we find the contextual effect of a supportive political en- 
vironment limited to the agenda setting stage of the policy process, while protest 
increases rates of law passage only when the political context is relatively closed. 

While we examine the importance of institutional influence activity, we are 
not able to account for the direct lobbying of legislators, which may be especially 
important in explaining public policy outcomes. In attempting to understand 
how movements influence the legislative process, it would be useful to observe 
and record information on direct contacts between movement representatives and 
individual legislators. Wright (1990) shows that PAC contributions are effective 
only in tandem with lobbying contacts. Whether lobbying activities would have 
an effect independent of the activities described here, or in combination with 
them, remains an open empirical question. 

There are also specific characteristics of the time period and movement studied 
here that may be unique. The observation period encompasses extremely high 
mobilization within the U.S. national environmental movement. The 1965-1985 
period is also one of heightened federal legislative attention to environmental 
issues, what Baumgartner and Jones (1993) refer to as a period of punctuated 
equilibrium. In periods of lower movement activity and/or legislative attention 
we might expect the relationship between movement and political dynamics to 
be altered. Unfortunately, t majority of analyses of movement outcomes have 
studied movements in periods of extremely heightened mobilization (e.g., Amenta 
et al. 1992, 2005; Andrews 2001; King et al. 2005; 2007; McAdam and Su 2002; 
Santoro 2002) or excluded cases from analysis after movements achieved a certain 
level of success, such as the passage of a state-level Equal Rights Amendment 
(Soule and Olzak 2004). Research assessing the outcomes of movements in periods 
of stable or declining mobilization is needed to help fill this lacuna. 

We hope that this research will move scholars to more routinely account for 
the variety of factors that influence public policy, the ways in which they interact 
to effect change, and their differential effects across stages of the policy process. 
The increased availability of data covering multiple stages of the political process 
and covariates of interest, combined with increasingly sophisticated theoretical 
specifications for the complex and contingent ways in which movements affect 
the policy making process, has great potential to improve our understanding of 
how and when social movements influence changes in public policy. 


Notes 


1. For a summary of the literature on non-governmental targets of social movements 
see Van Dyke, Soule and Taylor 2004. 


2. Openings in the POS are generally seen as providing enhanced opportunities for 
activism, thus contributing to elevated rates of mobilization. But, political threat 
may also inspire mobilization to protect existing resources (Goldstone and Tilly 2001; 
Tilly 1978). Meyer (1990, 1993) documents how perceived threats by the Reagan 
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Administration provoked mobilization of the nuclear freeze movement. Similarly, 
Reagan’s Secretary of the Interior James Watt evinced a similar sense of threat that 
spurred U.S. environmental mobilization in the early 1980s (Sale 1993; Shaiko 
1999). Evidence of politically imposed threats leading to mobilization is widespread 
in literatures examining ethnic conflict (Jacobs and Wood 1999; Van Dyke and 
Soule 2002), the interactions between protestors and security forces (Earl, Soule 
and McCarthy 2003; Goldstone and Tilly 2001; Khawaja 1993), and to a lesser 
extent movements operating within authoritarian regimes (Almeida 2003). This 
literature documents ways in which a contracting POS may result in movement 
mobilization, but there is no corollary literature similarly arguing that contracting 
political opportunities enhance the odds of movements achieving political success. 


3. The data were originally collected by Frank R. Baumgartner and Bryan D. Jones, 
with the support of National Science Foundation grant number SBR 9320922, and 
were distributed through the Center for American Politics and Public Policy at the 
University of Washington and/or the Department of Political Science at Pennsylvania 
State University. For details, go to: http://depts.washington.edu/ampol/. 


4, All hearings on the Environment between 1960 and 1990 were initially selected, 
yielding 2,005 events. Of these, 304 were eliminated from the sample because of 
a focus on funding authorizations, procedural debates or topics otherwise deemed 
beyond the scope of environmental politics or inconsistent with the goals of the 
environmental movement (e.g., relaxing air pollution standards). Twenty-five laws 
were deleted from the sample for the same reasons. Although we think there are good 
theoretical reasons for excluding these hearings and laws in analyses, in practice their 
inclusion in models of the incidence of environmental hearings and laws does not 
change the substantive interpretation of results. 


5. ‘These data are collected from daily editions of the The New York Times as part of a 
larger research project initiated by Doug McAdam, John McCarthy, Susan Olzak and 
Sarah Soule (see McAdam and Su 2002; Earl, Soule and McCarthy 2003; Van Dyke, 
Soule and Taylor 2004). To be included in this dataset an event had to be collective 
in nature, have articulated some claim (either a grievance against, or expression of 
support for, some target) and must have occurred in the public sphere or have been 
open to the public. These data are publically available and can be found at http:// 


www.stanford.edu/group/collectiveaction/cgi-bin/drupal/. 
Twenty-six events coded as “other” were dropped from analysis. 


In models not shown, we also tried a series of dummy measures for divided party 
control of Congress, as well as a cumulative scale of Democratic power (see Agnone 
2007). These alternative specifications of the POS had no significant effects. 


8. Although we control for Democratic president, substantial environmental 
legislation was passed and the EPA established during the tenure of Richard Nixon, 
a Republican president. It has been convincingly argued that Nixon was primarily 
responding to political pressure from legislative opponents in the democratically 
controlled congress, rather than being committed to environmental issues or an ally 
of the environmental movement (Flippen 2000), but there is potential for a “Nixon 
effect” on environmental legislation that may skew our results. To test for this, we 
explored alternative modeling strategies that included omitting the variable from 
analysis, recoding Nixon as a Democrat, and adding a Nixon period effect dummy 
variable. None of the alternative specifications significantly improved model fit or 
changed the central findings of the analysis (results available from authors). A Nixon 
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period dummy variable was consistently and positively related to the incidence of 
environmental law passage, although its inclusion did not significantly improve the 
amount of variation explained in our models. 


9. Because our dependent variables represent broad Congressional processes (total 
number of hearings in the House, Senate and joint committees, as well as the number 
of laws passed), political opposition is measured using counts of the total number 
of Conservative Coalition votes in Congress, unlike Olzak and Soule (2009) who 
focus on voting patterns only in the House of Representatives. In models not shown 
we also controlled for political opposition using the annual budget for the National 
Association of Manufacturers, the largest trade association in the United States and 
an organization that has played an important role in the U.S. conservative movement 
(Soffer 2001). NAM budget was highly correlated with several of our central variables 
and thus not maintained in our final models. Regardless of measure of political 
opposition used, our central findings hold (results available upon request). 


10. Criteria air pollutants are those for which the EPA has set health based standards and 
include carbon monoxide, nitrogen oxides, sulfur dioxide, particulate matter, ozone 
and lead. No measure for ozone emissions is available as this pollutant is produced 
by photochemical reactions in the atmosphere rather than direct emissions. Data on 
air-borne lead emissions, available beginning in 1970, are excluded from analysis. 
Volatile organic compounds are included as they enable and facilitate the formation 
of other criteria air pollutants. 


11. Jaccard and Turrisi (2003) consider the interaction model a significant improvement if 
the interaction term is statistically significant. However, we only consider the interaction 
terms meaningful if the interaction model elicits an improvement in model fit above 
the linear specification (Soule and Olzak 2004). We employ a likelihood ratio test 
comparing each interaction model to the baseline model (Model 2). 
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scenes: Social Context in an Age of Contingency 


Daniel Silver, University of Toronto 
Terry Nichols Clark, University of Chicago 
Clemente Jesus Navarro Yanez, Universidad Pablo de Olavide 


This article builds on an important but underdeveloped social science concept—the 
“scene” as a cluster of urban amenities—to contribute to social science theory and 
subspecialties such as urban and rural, class, race and gender studies. Scenes grow 
more important in less industrial, more expressively-oriented and contingent societies 
where traditional constraints fall and self-motivated action around consumption, lei- 
sure and amenities is a more important feature of social cohesiveness and interaction. 
Scenes contextualize the individual through amenities and consumption-based expres- 
sions of shared sensibilities as to what is right, beautiful and genuine. This framework 
adds to concepts such as neighborhood and workplace by specifying 15 dimensions 
of the urban scenescape. Like neighborhood and workplace, scenes reduce anomie, 
but because of their focus on consumption and the use of specific amenities, they are 
more consistent with today’s ethos of contingency, moving beyond traditional ideas of 
the fundamental power of social, family and occupational background. We introduce 
a new amenities-focused database to measure and analyze scenes and their dimensions 
for each of 40,000 U.S. zip codes. We illustrate the framework by applying it to one 
distinct type of scene, bohemia, and analyze its position in the broader social system. 





The Salience of Scenes: Culture and Urban Attractiveness 


This article proposes a new analytical framework for the study of culture and place, 
developing the concept of “scenes” as clusters of urban amenities. Although “the 
cultural turn” brought culture “back in” to sociological research (Smith 1998; 
Bonnell and Hunt 1999; Steinmetz 1999; Molotch 2003; Mohr 2003; Harding 
2007; Alexander 2003; Swidler 1986), it did so usually without locating cultural 
practices in concrete cities, spaces and places.' Ironically, in the years when sociolo- 
gists stressed economic and other constraints, economists began to study cultural 
activities systematically. Terming them “amenities,” defined broadly as “non-market 
transactions,” (Glaeser, Kolko and Saiz 2004) economists have built many models 
of consumption and lifestyle. But they largely assume that individuals act in isola- 
tion and that each amenity (e.g., restaurant or museum) can similarly be analyzed 
atomistically. This research seeks to join the amenities and consumption work from 
economics and cultural geography with core social and cultural processes. It aims 
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~ . 
to combine the (renewed) concern for culture in sociology with a more systematic 
and comparative approach to neighborhoods, cities and regions. 


Cities and Culture 


This turn to culture as connected with distinct places and spaces dovetails with 
a shift in urban development research, which in the past decade has increasingly 
stressed cultural amenities as attracting “high human capital individuals” whose 
innovations drive economic development (Glaeser, Kolko and Saiz 2004; Florida 
2002; Clark 2004; Markusen, Schrock and Cameron 2004). Vibrant artistic com- 
munities, thriving music and theater, lively restaurants, beautiful buildings, fine 
schools, libraries and museums contribute to a better local “quality of life.” In in- 
creasingly post-industrial societies, it is claimed, where labor-intensive production 
is giving way to knowledge- and information-intensive production, more individu- 
als have more time to enjoy and define themselves by their engagement with the 
“amenities of life.”(Fogel 2000) Cities are quickly becoming centers of consumption 
rather than production (Glaeser, Kolko and Saiz 2004). Culture and tourism are 
gaining momentum, adding intangible value to what is there and restructuring the 
existing stock of capital (Sacco and Blessi 2006). Residents respond by exercising 
“symbolic ownership” over the aesthetic and ethical images projected by their local 
establishments (Deener 2007). But these formulations raise many questions. 

Earlier urban development theorists did not explore specifics of culture and 
amenities. Economists (such as Roback 1982) pioneered by adding culture and 
amenities to urban research. But typically they used climate-related amenities 
(i.e., humidity, clean air) and studied their impact on land value (Zelenev 2004 
reviews this tradition). Amenities were important to urban economists if they 
increased land value, but the process of how and why was largely ignored. Some 
Continental economists (e.g., Santagata 2004) write about cultural districts, ex- 
tending industrial district ideas, but these, as in some more Marxian studies of 
consumption (Zukin 1989), frame culture and consumption as largely driven 
by broad economic changes. So does the “post-modernism” of Inglehart (1990), 
downplaying specifics of culture and politics. Florida (2002) suggests that street 
life and bicycling, rather than opera and bowling, attract creative people who favor 
multi-tasking and autonomy, although differences among “creative” lifestyles are 
not explored (Florida 2008 begins to remedy this). 

‘There is considerable ferment over conceptual approaches to the sociological 
study of cultural activities, evidenced in the shift from mass culture criticism to 
increased questioning about the distinctiveness of broad divisions such as “high” 
vs. “low” culture, “formal” vs. “informal,” “elite” vs. “popular,” or “passive” vs. 
“participatory” as meaningful dimensions to capture cultural experiences (Peterson 
and Kern 1996; Abbing 2006; Lizardo and Skiles 2008). In both sociology in gen- 
eral and urban studies in particular, translating the “meanings of social life” —theo- 
retically and empirically—into specific analyses of the concrete role of culture in 
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defining the character of places and spaces and in driving social processes has 
been difficult. “Culture” remains a contested concept with a range of meanings.’ 
“Culture” typically includes the traditional high arts of opera, Shakespearean theater 
and classical symphonies. Does it also include local, authentic items like Chicago 
blues or Carolina barbecue? How about experimental, innovative avant-garde art 
galleries, cutting-edge theater and novel architectural forms? Does it extend as 
far as adding an aesthetic perspective to more standard fare: street-level culture, 
beachfront entertainment, arts and crafts fairs? These and other definitional issues 
invoke distinct paradigms and can shape competing priorities for policymakers, 
to invest in or ignore. Class, race, gender, neighborhood and political culture, in 
turn, invoke competing criteria for theoretical, ideological and policy allocation 
debates by political leaders, foundation officials, public intellectuals and an urban 
populace increasingly divided along moral in addition to class axes (Sharp 2005). 

Empirical issues are complicated not only by these issues of “high” and “low,” 
but also by the fact that cultural activity involves more than the arts. Cultural 
meanings and codes are expressed in, and define, different styles of life and situa- 
tions, shaping what it means to frequent restaurants, cafes, sporting events, parks 
and more. And culture is more than the “cultural industry” or “cultural districts” 
(Alexander 2003) because cultural amenities are not only, or even mainly, sites 
of economic activity, and their attraction is not reducible to economic factors; 
cultural amenities may well generate jobs and economic development, but they 
do so (at least in part) because they provide places where people can express their 
lifestyles, generating independent value (Currid 2007). Culture is not disem- 
bodied; cultural products and meanings exist in geographic spaces, ecologically 
distributed across neighborhoods, cities, regions and nations. Distinct urban 
cultures may emerge spontaneously in response to citizens’ lifestyles, but private 
and public actors also seek to produce them intentionally; they are both top- 
down and bottom-up. How can we see form and structure amidst such variety? 
New conceptual and empirical resources are needed. 

Enter “scenes.” As settings structuring shared cultural consumption, scenes pro- 
vide a new conceptual fulcrum for cultural analysis. Scenes include the arts, but also 
beaches, cafes, restaurants, sporting events, street life and more. Scenes join these 
together, permitting a range of seemingly diverse activities—from sipping coffee to 
listening to music to reading poetry—to be analyzed as part of one social process. 
Scenes provide ways of social belonging attuned to the demands of a culture in 
which individuals increasingly define themselves less by primordial attachments to 
home or family background or class or party or confession and more contingently 
and expressively, in terms of lifestyle and sensibility (Joas 2004). Just as neighbor- 
ze residence and heredity, and occupations contextual- 
enes provide specific social contexts for individuals to 
ontingently cultivated sensibilities as to how to dress, 
t and more. By articulating the concept of scene, de- 


hoods and family contextuali 
ize achievement and work, sc 
interact on the basis of their c 
eat, listen to music, look at ar 
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veloping techniques for measuring scenes, and showing how the social consequences 
and correlates of scenes vary across local contexts, we lay out a research program that 
. . . . . . . 3 

injects culture into urban studies in a systematic and comparative way. 


What is a Scene? The Situated Character of Urban Culture 
Social Consumption, Culture and Territory 


The arts, in particular, and consumption, in general, occupy an important place 
in recent studies of urban development (Markusen, Shrock and Cameron 2003; 
Markusen and King 2003; Glaeser, Kolko and Saiz 2004; Clark 2004; Molotch 
2003; Currid 2007; Florida 2002; Lloyd 2006; Scott 2000). Yet these rarely specify 
how distinct types of arts and amenities differentially affect urban change. Nor 
do they identify contexts within which arts and amenities are embedded—not to 
mention the effects of geographically (and temporally) varying combinations and 
densities of amenities, as well as differences in their aesthetic and ethical symbolism. 
What they lack is a conception of cultural consumption as a structured, em- 
bodied and emplaced social activity that can come in varying forms and degrees. 
To fill this gap, we draw on recent work in cultural and youth studies that has 
begun to develop the notion of scene as just such a form of activity. Irwin (1977) 
links the importance of scenes to the rise of self-conscious niche expressive and 
leisure groups and their resulting social connections, identifying certain grand 
scenes such as surfing and hippiedom that influenced broader societal values and 
lifestyles. Straw (1991, 2002) defines music scenes as “geographically specific 
spaces for the articulation of multiple musical practices,” (Straw 2002:8) stressing 
the useful flexibility of the concept (compared to the more rigid counter-culture, 
class, art world or movement)‘ and the way scenes function as sites of fluid but 
gripping urban community, both intimate and cosmopolitan (Straw 2002, 2004 
reviews many meanings of the term in discussions of popular music). Blum 
(2003) seeks to ground scenes in “urban theatricality,” providing a “grammar” of 
scenes that includes dimensions such as mortality, transgressiveness, exhibitionism, 
extensiveness, regularity and more. Bennett and Peterson (2004) compile descrip- 
tions of various music scenes, dividing them into groups of local, translocal and 
virtual scenes; Lena and Peterson (2008) treat scenes as one stage in the life course 
of music genres. Hitzler et al. (2005) link scenes to the “rise of youth” as a specific 
phase of the life course. He stresses the function of scenes as localized, typically 
part-time, voluntary meeting points for the like-minded, linked to physical places 
(bars, clubs, cafes, etc.), whose loose membership criteria and boundaries are not 
reducible to other social structures or institutions. Leach and Haunss (2009) build 
on this approach to explore the connections between scenes and social movements. 
Lizardo and Skiles (2008) summarize many aspects of the emerging “scene per- 
spective’ in popular culture studies. They highlight how sub-cultural codes (e.g., 
Goth unconventionality or Club glam or Salsa authenticity) 


can become key ways 
to determine membership and status within a scene. 
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In these discourses, the concept of scene has turned attention to the specific 
locales and places, the constellations of establishments and activities, in which 
cultural practices are articulated. Some urban development scholars have drawn 
from this work (cf., Currid 2007 on Hebdige’s 1979 discussion of punk cultural 
places), stressing that scenes are less starkly demarcated and demanding than 
oppositional counter-cultures. Here, scenes are primarily indicated by “diverse, 
open and amenity-rich places,’(Currid 2007:107) constituting nodes in which 
the cultural and social generate the economic (Florida 2008). 

Our approach to scenes synthesizes many aspects of these strands of research. 
We seek to join key insights from cultural, youth, music and urban studies to 
build a more flexible and differentiated notion of scenes suitable for compara- 
tive studies about how scenes vary in specific locales and how those variations 
affect key urban development variables. Generally, these diverse approaches to 
the phenomenon suggest to us that scenes should be conceived as places devoted 
to practices of meaning making through the pleasures of sociable consumption. The 
possibilities for, and practices of, sociable consumption available in a place (its 
restaurants, cafes, galleries, clubs, stores, theaters) articulate a range of experiences 
and values, and these are what defines that place as the scene it is. Thus, a scene 
is more than (1. neighborhood (2. physical structures (3. persons labeled by race, 
class, gender, education. We include these but stress (4. specific combinations of 
these and activities (like attending a concert). 

These four components are in turn defined by (5. the values and meanings scenes 
enable people to actualize. General dimensions of meaning stressed often in the 
scenes literature are legitimacy, a right or wrong way to live (Haenfler 2004); theat- 
ricality, a way of seeing and being seen by others (e.g., Blum 2003); and authentic- 
ity, as a meaningful sense of identity (Grazian 2003; Urquia 2004). Scenes enable 
participants to share in a certain mood~listening to a certain style of music, dressing 
in a certain manner, eating in a certain ambiance. Such moods embody distinctive 
feelings as to what is right, genuine and beautiful. These, in turn, are transformed 
when combined in different ways: a tattoo parlor, water pipe store and modernist 
art gallery make a different scene than do a tattoo parlor, motorcycle shop, gun shop 
and biker bar. Each involves an affirmation of transgression, but that transgression 
is fused with different dimensions of meanings, like self-expression and tradition, in 
which the meaning of the whole scene changes—from Avant-garde to Don't Tread on 
Me. Thus, such dimensions can join in ideal-typical combinations such as Bohemia. 


Scenes generate meaningful social spaces of consumption rather than of work 


or residence. What matters are the CDs one listens to (jazz or indie pop or coun- 
try, say), the types of foods and restaurants one enjoys (barbecue or fusion, for 
example), the clothes one buys and wears (leather or African print), and more. 


: ce is 
These are not necessarily determined by how creative ones job is: we prefer to dis 
s creative class is not a homogenous 


aggregate occupations Florida calls creative. Hi 
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Table 1: Contrasting Scenes, Neighborhoods and Industrial Clusters 


S Scene _Neighborhood ___ Industrial Clusters 
















Consumer ~~ Resident 
i = M tie: Ba Pane PR Sen io ane sR mate ee Scone 
Basis of social bond _Lifestyles/ Sensibilities Being born and raised  Work/ production 
nearby, long local relations — 


residence, heritage 


the same music or go to the same restaurants. Jobs weakly predict how people 
play. Consumption groups and occupation groups need not align (Markusen 
2006). That one values a colleague’s drive at work does not straightaway mean that 
one welcomes him to the barber shop scene or country line dance. Nor is one’s 
consumption and leisure activity in scenes straightaway determined by ascriptive, 
particularistic ties of kinship and neighborhood; a younger brother deep into the 
vegan punk scene need not share this interest with his older brother, and within 
the scene their shared blood or heritage may not bring status to the older brother. 
More important for the scene is sharing and expressing its sensibilities. 

These are analytical distinctions, and it is possible to stress one perspective 
over the other: depending on the types of practices and relationships it promotes, 
a place can be more or less a scene, more or less a neighborhood, more or less 
an industrial cluster. Overlaps may generate considerable strains and produc- 
tive tensions,’ and advocates of one perspective often reduce the others to their 
own.° A full study of the place of the scene in the broader social system would 
need to map out the potential interactions between scenes, neighborhoods, and 
work~—not to mention politics, families, and religion. Nevertheless, what is clear 
is that scenes mark arenas in which consumption can become a shareable and 
meaningful activity, and that the dynamics of this general process merit study 


in their own terms so that we can develop more systematic theories for a spatial 
sociology of consumption. 


The Internal Dimensions of Scenes: Theatricality, Legitimac y, Authenticity 


Our conception of scene seeks to capture a range of key symbolic dimensions of 
consumption, going beyond identifying amenity-rich places to the specific values 
and experiences different constellations of amenities promote. Building on past 
work that highlights the performative character of social interaction in general 
(e.g., Goffman 1959, 1974; Alexander 2003) and the role of mutual self-display 
in past scenes literature (Blum 2003), we include theatricality as one of our three 
general dimensions of scenes. Yet we treat theatricality as multi-dimensional; it is 
not only raw exhibitionism, bodies on display. We thus include four additional sub- 
dimensions of theatricality, all relevant in past discussions about scenes: deviance 
(transgression) and conformity (the formal theatricality of manners and etiquette) 
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panies; Fashion Shows/Designers; Yoga Studios; Art Dealers; Comedy Clubs; Body 
g; Tattoo Parlors; Recorded Music Stores; Vintage & Used Clothing; Custom Printed 
T-Shirts; Music Festivals: Fine Arts Schools; Graphic Design Services; Independent Artists, Writers 
Dealers; Dance Companies; Historical Sites; Motion Picture & Sound Recording Industries; Musical 


Groups & Artists; Performing Arts Companies; Promoters of Entertainment Events; Spectator 


& Performers; Musical Groups & Artists; Performing Arts Companies; Sound Recording Industries; 
Sports; Fine Arts Schools; Sports Bars; Sound Recording Studios 


Hobby, Toy, & Game Stores; Interior Design Services; Karaoke Clubs 
Designer Clothes & Accessories; Fashion Shows/Designers; Motion Picture & Video Exhibition; Art 


Piercin 


Self Expressive Enjoying hearing a jazz musician Dance Com 
play something that could only 
be improvised spontaneously at 
that particular moment 
Watching a Chicago Bulls 
game because of the 
charismatic aura of Michael 
Jordan rather than because 
one is a Chicagoan 


Charismatic 
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as well as outward shine (glamour) and inner intimacy 
(the neighborliness of a pub or folk scene). 

Still, we deny that theatricality is the Ur-scene of 
Every Scene, and so add two other broad dimensions: 
first, legitimacy. We build on authors (like Weber 1978; 
Bellah 1996; Elazar 1975; and Habermas 1981) who 
stress the moral values implicit in everyday practices 
and in scenes (as in the Haenfler 2006 discussion of 
the strong moralism in straight-edge punk scenes). But 
again we include five sub-dimensions. These highlight 
temporal aspects of legitimacy: tradition, in stressing 
the authority of the past (as in classicism), charisma, 
the auratic presence of a star performer, utility, the 
value of future outcomes (as in, say, benefit con- 
certs); they also highlight spatial aspects of legitimacy: 
egalitarianism, the value of universal ideals (as in the 
global moralism of fair trade coffee) and individual 
self-expression, the value of personal adaptiveness to 
particular situations (the non-repeatable uniqueness 
of an improvised solo or encounter). 

Our third general dimension builds on authors 
(like Taylor 1992, 2007; Heidegger 1996; as well as 
Grazian 2003 and Urquia 2004 in the scenes literature) 
who highlight authenticity. Authenticity in discussions 
of scenes often focuses on ethnicity, but the category 
has broader significance in that the artifice associated 
with scenes’ theatricality raises questions about what 
it means to be genuine rather than phony. Thus, fol- 
lowing Taylor (1989) we suggest that there are many 
ways of getting in touch with “the real” rather than the 
fake that have been central to the cultural traditions 
out of which scenes grow and thrive. We identify five, 
starting at the particular (the authenticity of the local 
contra the foreign), and moving outward to the most 
general, through notions of the authentic self rooted 
in state citizenship (rather than class or religious com- 
munity, famously articulated by Rousseau), ethnicity 
(as in Herder), corporation (Nike vs. knockoffs, cf. 
Taylor 2007) and reason (from Kant to Hegel). Table 
2 catalogs the 15 dimensions of scenes and provides 
illustrative examples of some of their indicators from 
our national database of 650 urban amenities. 
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Recognizing Scenes: Towards Systematic and Comparative Analysis of 
Urban Cultural Life 


Others have noted that assessing urban attractiveness requires studying the mix 
of amenities, built environment and people. This has typically turned researchers 
toward ethnography (Lloyd 2006) or anecdote (e.g., the tattooed programmer 
invoked by Florida 2002).” We seek to retain some of the conceptual holism and 
subtlety of ethnography, and do ethnographic work ourselves. But codifying scenes 
dimensions and measuring them helps place individual cases in broader context. 

The concept of scene, consistent with the phenomenological character of eth- 
nographic approaches, permits theorizing meanings internal to urban cultural 
spaces in terms of the qualities they manifest as valuable and the holistic networks 
within which any single amenity is located. How, then, do we know what sort 
of scene exists in a given place? Our proposal is to measure the dimensions of 
scenes empirically as they are indicated by clusters of urban amenities, allowing 
us to undertake what Baudelaire called a “botany of the sidewalk.” For example, 
the combination of amenities composing a given scene may promote a sense 
of self-expressive legitimacy, transgressive theatricality, local authenticity, anti- 
rational authenticity and anti-corporate authenticity—this combination we call 
a “bohemian scene.” Another cluster of amenities might promote neighborly the- 
atricality, traditional legitimacy and local authenticity—a more “communitarian 
scene.” Our conceptual apparatus focuses on the meaning of these distinct sets of 
values created by different combinations of the core 15. One can then analyze and 
interpret combinations with far more richness than by simply counting individual 
amenities or actors or producing case studies in splendid isolation. 

Our analytical framework thus grounds systematic and comparative analysis 
of embedded urban culture. Research may proceed from inductive and de- 
ductive standpoints, and pursue intensive and extensive research strategies (of 
individual cases or large Ns). Inductively, the empirical distribution and levels 
of the 15 dimensions can generate a scene profile for neighborhoods, cities or 
metropolitan areas. Deductively, the framework helps specify theoretical ideal- 


typical scenes by ex-ante-defined combinations of sub-dimensions, against which 
empirical scenes can be measured.* 


Measuring Scenes: Clustering Individual Amenities into Meaningful Scenes 


Combining our conceptualization of scenes with our amenities database allows us 
to begin to formulate and test hypotheses about the relations between (varieties of) 
cultural attractiveness and more traditional developmental factors (income, cost of 
living, etc.). How to do so empirically? By systematically scoring the meanings of 
distinct physical spaces of consumption. Operationally a scene is a specific cluster of 
amenities constituted by the ensemble of meanings or value orientations offered to 
the potential consumer. By scoring the value orientations of individual amenities, 
coding individual amenities in our database on each of the 15 sub-dimensions 
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with a 5-point scale,’ analyzing how they combine in distinct territories (neighbor- 
hood, city, MSA, region...), we capture distinct cultural experiences of separate 
territories.” In our framework, the analytical units are the 15 sub-dimensions 
measured for every amenity in a territory; these dimensions are the minimal 
analytical components of the scenes approach. By contrast, the amenity (like a 
restaurant or museum) is the observational unit. We do not count amenities, but 
analyze their implicit substantive meanings. The cultural life of cities is the focus, 
not the components or size of the cultural, leisure or tourist industry." 

Ideally, the specific amenities should meet at least two criteria. First, the analy- 
sis should highlight consumption-oriented rather than production-oriented es- 
tablishments (factories are not included, while cafes are). Second, the amenity 
should be potentially present across all territories under analysis in approximately 
similar form; local users should be able to reveal their preferences by patronizing 
a shoe store or Thai restaurant if they choose. But in other localities if citizens 
prefer catfish restaurants, the local market should not prohibit a catfish restaurant 
from emerging. The amenities, such as these types of restaurants, should be linked 
with similar meanings among potential cultural consumers; they should be “func- 
tionally equivalent.’(Van Deth 1998) Standardized amenities such as Starbucks 
and McDonalds meet this criterion relatively straightforwardly; less standardized 
amenities are more difficult, for example, cultural centers (which offer diverse 
activities) or restaurants (which differ by cuisine and price) (Kaple et al. 1996). 

Because there is no systematic database of all possible amenities across U.S. 
cities that could guarantee these two conditions, we assembled a national database 
of amenities from sources where the agency constructing each variable has ideally 
been sensitive to these criteria, such as the Yellow Pages or U.S. Census Bureau. 
This maximizes coverage of potential amenities (varieties of types) and territories 
(minimal units, as zip codes), and limits definitional ambiguity.’ Our database 
includes hundreds of amenities such as theaters, restaurants, bookstores, dance 
companies, jazz clubs, museums, gospel choirs and liberal arts colleges. It covers 
all U.S. metro areas and rural areas, some 40,000 zip codes. Levels and changes 
in more traditional factors such as education, crime, housing prices, and ethnic 
and class demographics can be analyzed to measure their relative contributions to 
various scenes. No such massive and comprehensive database has previously been 
generated. It took us five years with some dozen assistants, and is still growing. 

We coded roughly 650 amenities from high to low on each of the 15 scene 
dimensions. Hence the analysis can “travel empirically” from the observational 
unit—individual amenities—to the minimal analytical unit—the 15 scenes dimen- 
sions. To compare scenes, we created a performance index for each territory by 
multiplying the number of amenities of a given type in a zip code by that amenity 
type’s score on each sub-dimension, and then summing the products for each of 
the 15 sub-dimensions, then dividing this product by the total number of ameni- 
ties in the zip code. These provide a scene profile for each U.S. zip code based on 
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the 15 dimensions of legitimacy, theatricality and authenticity. The whole scene 
emerges in the combination of amenities, distinguishing for example, places with 
cafes and lawyers offices from others with cafes adjacent to nightclubs and bars. To 
be sure, scenes and zip codes are not neatly aligned, and the limits of this measure 
are legion. Other measures are no doubt possible and necessary, as is on-the- 
ground ethnographic work that can capture subtle variations of amenity usage and 
meanings. Nevertheless, this analytical profile permits analysis of consumption 
activity as a situated social phenomenon using the criteria above: (1. meanings and 
value orientations (2. interconnected in a holistic way and (3. situated in space 
and time. The appendix discusses index construction. 


Analyzing Scenes: Validation by Scenescapes Analysis 


What picture of the American scenescape emerges from these profiles? Do our meas- 
ures provide a valid proxy of the cultural life of cities? Because there are no similar 

measures to contrast our proposal against using “construct-validity” (convergent or 

divergent) (Webber 1990:18-19), we initially pursue validation by “face validity” 

. (are theoretical concepts and measures adequate to the judgements of researchers or 

to previous knowledge, do the scenes measures discriminate among different cultural. 
contexts that are well documented by previous literature?) and “hypothesis validity” 
(can the measure illuminate theoretical relationships (Webber 1990:18-22), are the 

scenes measurements confirmed by the “culture and cities” literature?). 


Basic Descriptive Insights: Confirming Expectations of Regional and Urban 
Cultural Life by Cross-Territorial Comparisons 


Simple statistical analysis of our measures of scenes helps to document the cultural 
variations among different regions, cities and local contexts. We have pursued 
many descriptive analyses for face validity and more. Some brief examples: Scenes 
in the Northeast and West have amenities that rely more on individual self-ex- 
pression for their legitimacy, while those in the South and Midwest express more 
traditionalistic legitimacy. Scenes in the South and Midwest contain amenities 
that offer neighborly theatricality, while northeastern and especially western scenes 
manifest more transgression (see Figure 1). These fit common views. As an ex- 
ample: we tabulated glamour for each Los Angles zip code, and found Hollywood 
close to the top and Watts scores near the bottom. These regional differences are 
striking, as they confirm that our methods yield results consistent with broad 
expectations from other sources, and identify cultural contexts varying within an 
emerging more expressively-oriented consumer society. Equally striking are varia- 
tions among New York City, Chicago and Los Angeles as widely discussed global 
centers identified with the new economy, where rents, education, arts and culture, 
technology jobs, and young people are rapidly increasing (Gyourko, Meyer and 
Sinai 2006; Cortright 2001; Currid 2007). Yet these three cities reveal strikingly 
different patterns: from the clustering of finance in downtown New York (Sassen 
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Figure 1. Regional Variation in Scenes 
aio 








B South 
@ West 





= Midwest 
Northeast 





Traditionalistic  Self-Expressive Neighborly Transgressive 


These are simple correlations, Pearson r’s, of dummy variables of the four major U.S. 
regions with some of the sub-dimensions of scenes. Each zip code is assigned 1 if it is 
within the region, and 0 if it is not. 


2001), to Mayor Richard Daley’s enthusiastic embrace of culture and aesthetics as 
central to urban policy (Clark forthcoming), to the individualism, fragmentation 
and image-building that lead some to name Los Angeles as ground zero of the 
post-modern age (Dear 1981). Critical differences appear in Figure 2. 
Compared to all U.S. zip codes (scored 0), scenes in these three cities are 
legitimated more by individual self-expression and utility than by tradition and 
egalitarianism; they encourage transgression, glamour and formal codes more 
than neighborliness; and they root identities in rational calculation, the state and 
corporation more than in local culture. Broadly, “urbanism as a way of life” (Wirth 
1938; Simmel 1971) continues in the late modern city, as more abstract, formal, 
distanced social relations are linked with heightened individualism and weaker 
primordial ties. The three cities also show striking differences. Los Angeles scenes 
are defined much more by individual self-expression and glamour, highlighting 
more amenities such as art schools, arts organizations and information, movie 
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theaters, dance companies, yoga studios, exercise and fitness classes, and infant 
and children’s clothing accessories. New York scenes more strongly affirm that 
identity is based in the power of reason and the stamp of the corporate brand; they 
legitimate themselves by appeals to efficiency, activity and material success, and 
promote the formal theatricality of the business suit and the dress code, featuring, 
relative to Chicago and Los Angeles, more amenities such as night clubs, book 
stores and book publishers, art dealers, designer clothing and accessories, conve- 
nience stores and delicatessens, advertising agencies and newsstands. In Chicago, 
scenes are the most neighborly, traditionalistic and egalitarian of the three, stress- 
ing more amenities such as pizza restaurants, bowling alleys, churches, parks and 
playgrounds, cemeteries and public libraries. Similar demographic patterns are 
here mediated by different cultural settings, which in turn might well account for 
divergent economic and political outcomes that would be otherwise difficult to 
capture. This all has much face validity and is consistent with recent urban scholar- 
ship. These data are simply the first to document these patterns so systematically. 

Perhaps even more striking than these differences in levels are different relations 
among the sub-dimensions of scenes. Figure 3 shows correlations within New York 
City, Chicago and Los Angeles of charismatic legitimacy by zip code, and Figure 4 
shows correlates of self-expressive individualism scores with selected sub-dimensions. 

Strikingly, in Chicago, amenities that express charismatic legitimacy correlate 
strongly with amenities that support a sense of neighborliness and appeal to equal- 
ity. By contrast, in New York City and Los Angeles, the more charismatic scenes 
are more individually self-expressive and glamorously theatrical. In Chicago, 
scenes high on individual self-expression also show a sense of corporate identity 
(for example, the fabulously post-modern Millennium Park built with massive 
corporate donations). Further, self-expression in Chicago is less strongly separated 
from local roots and abstract reasoning, and less tied to transgressiveness and 
glamorousness. In New York and Los Angeles, zip codes high on self-expressive 
individualism tend to show more transgression and glamour, less rootedness in 
the local, less faith in reason and more hostility to corporate culture. The scenes 
of these cities channel the power of charisma in different directions, some into 
individualism and transgression, others into the local neighborhood —there is no 
single track for The City of the Future, but multiple scenes structuring alternative 
responses to a social life more attuned to cultural consumption. 


Theoretical Elaborations: How Scene Analysis Reframes Bohemia 


The New York Times columnist David Brooks (2000) is credited with identifying 
a “Bobo” orientation which joins 1960s bohemian values with 1980s bourgeois 
budgets, as illustrated by President Bill Clinton. Brooks subtly describes several 
cases, especially Bobo cities such as Burlington, Vermont and Bethesda, Maryland. 
Richard Florida (2002) used Brooks’ Bobos as his core concept, but retitled it 
the “Creative Class” at the publisher's suggestion; the book relies less on “class” 
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and more on Bobo-like tolerance, which Florida holds is a key driver of urban 
innovation. Richard Lloyd’s “neo-bohemia” (2006) builds on these but makes 
the strongest case for a more literal bohemia as an urban dynamic, closer to 19% 
and early 20" century classic bohemias: contra Brooks and Florida, Lloyd claims 
that creativity requires breaking eggs, challenging authority. Brooks’ work is self- 
identified “comic sociology,” and offers only subtle anecdotes as evidence. Lloyd’s 
evidence is an ethnography of Chicago’s Wicker Park neighborhood that provides 
sensitive insight but does not locate this case in comparison with others. Florida’s 
prime systematic measures of bohemian tolerance are the percent of gays and 
artists in a metro area. He correlates these with patents and other innovation 
measures, but Clark’s (2004) reanalysis suggests that gays were largely spurious, 
and education was a more important indicator of creativity. 

The classic statements of Murger, Balzac and Baudelaire focused on Paris but 
(neo-)bohemia is increasingly woven into the post-industrial political economy, 
as a testing ground for new styles and patterns of consumption, analogous to 
that of scientific and technological research on the side of production (Campbell 
1989), and a defining half of the modern spirit (Grana 1964). Our scenes ap- 
proach provides more precise tools to capture and analyze these ideas. As Murger, 
Balzac and Baudelaire suggested, an ideal-typical Bohemian scene has a distinct 
shape (see also Grana and Grana 1990). Our coding of Bohemia draws on such 
past and recent discussions to determine how a Bohemian scene combines the 15 
sub-dimensions of scenes, as shown in Table 3. 

Defined thusly, a scene is more Bohemian if it exhibits resistance to tradi- 
tional legitimacy, affirms individual self-expression, eschews utilitarianism, values 
charisma, promotes a form of elitism (Baudelaire’s “aristocracy of dandies”), en- 
courages members to keep their distance, promotes transforming oneself into an 
exhibition, values fighting the mainstream, affirms attending to the local (Balzac’s 
intense interest in Parisian neighborhoods), promotes ethnicity as a source of 
authenticity (cf., Lloyd 2006:76),"* attacks the distant, abstract state, discourages 
corporate culture and attacks the authenticity of reason (Rimbaud’s “systematic 
derangement of all the senses”). Scenes whose amenities generate profiles that are 
closer to this ideal type receive a higher score on our Bohemian Index (measured as 
the value distance from the “bliss point” defined by Table 3).'* This measurement 


Table 3: Ideal-Typical Bohemia 


Traditionalistic 
Self-Expressive 
Glamorous 


RO 
on 
eS 
> 
BO 
RO 
oo 
ow 


Charismatic 
Egalitarian 
Neighborly 
Formal 
— Corporate 
™ Rational 


Utilitarian 
oi Transgressive 


+ Exhibitionistic 


+ Local 
+ Ethnic 
~ State 


Note: 1 is negative, 3 is neutral and 5 is positive 
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Table 4: A Continuous Scale of “Bohemianness” as Dependant Variable: National 


Regression Results 
Standardized Regression 


Coefficients 
Independent Variables 
% 18-24 year old (1990) -.003 
(-.292) 
Change in % 18-24 (2000/ 1990) -.018* 
(-2.155) 
% 25-34 year old (1990) -.035* 
(-2.307) 
Change in % 25-34 year old (2000/ 1990) -.027** 
(-2.647) 
% Non-White (1990) -.086*** 
-(10.139) 
Change in % Non-White (2000/ 1990) eiBe 
% Baby Boomers (1990) 021 
(1.212) 
Change in % Baby Boomers (2000/ 1990) fi 001) 
% Retiree (1990) 046*** 
(3.766) 
Change in % retiree (2000/ 1990) 016 
(1.594) 
Total population in 1990 .209*** 
(24.982) 
Population change 2000/ 1990, logged 061"* 
(8.571) 
Vote cast for president, % democratic 1992 -.006 
(-.717) 
Crime rate (1998) 027*** 
(3.422) 
% College graduate (1990) .009 
(.597) 
Change in % college graduate (2000/ 1990) SG"? 
(4.397) 
% Graduate/ profession degree (1990) -.061** 
(-2.826) 
Difference in % profession/ graduate degree (2000/ 1990) 023 
1.467 
Per capita income (1990) eae 
4.636 
Change in per capita income (2000/ 1990) -.022** 


(-2.738) 
Notes: ® t-ratios are reported in parentheses 
p05 ee pO “nD < 001 (tests are two-tailed) 
° Dependent Variable: Bohemian Index 
4 Adj R®: .053 
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from a bliss point is analogous to policy distance analyses in voting (e.g., Riker 
and Ordeshook 1973). While we continue to investigate alternative measures of 
Bohemia, in practice the index identifies many neighborhoods which others cite as 
distinctly Bohemian: in Chicago, the highest scoring neighborhoods in 2000 in- 
clude Bucktown, Wicker Park, Humboldt Park and Logan Square, all commonly 
perceived as Bohemian at the time (Lloyd 2006), even if they have since changed. 

Analyzing our Bohemian Score as dependent variable in a regression including 
all U.S. zip codes and a number of standard urban development variables (sum- 
marized in Clark 2004) provides insight into where the most bohemian American 
scenes are (see Table 4). 

Bohemian clusters of amenities are stronger in locations with larger popula- 
tions, increasing populations, more retirees, higher income, increasing numbers 
of college graduates, more crime and fewer non-whites. Levels of baby boomers, 
youth and Democratic voting (in both simple correlations and regression coef- 
ficients) are not significant at the .001 level, nor are change in income, retirees, 
youth population and baby boomers."° 

Comment on these results: First, bohemian scenes are stronger in areas with 
higher crime rates. The “established” or “bourgeois” theory that crime indicates 
social disorganization and “would repel most residents” does not hold in a 
Bohemian scene, which inverts this anti-crime value. Our finding confirms the 
River Styx theme from Baudelaire to Lloyd. While Baudelaire noted “the magic” 
in “murky corners of old cities,”'” Lloyd (2006:78) stresses that “the manifest 
dangers of the neighborhood coincide with the bohemian disposition to value 
the drama of living on the edge.” 

A second set of important findings concerns age. Florida and Lloyd both stress 
the youthful nature of their neo-bohemias, but we find that retirees are more nu- 
merous in bohemias than youth. “Youth” does not necessarily translate into edgy 
creativity—there are “square” and “establishment” and many other types of youth. 
What seems to matter more is how various contexts channel and transform the 
energy of youth. Bohemias may include “old-timers” that lend a form of authen- 
ticity to the neighborhood (Brown-Saracino 2007), and the denizens of “rural 
bohemias” such as Carmel, CA may be older (Austin 1990). Moreover, there may 


bea particularly strong connection between bohemias and what we have elsewhere 


termed the “grey creative class.” '8 


Third, the finding that voting patterns are not significantly connected with 
Bohemian neighborhoods suggests that whatever sense of political legitimacy and 
activism Bohemias create often operates outside of standard notions of parties. 
The Red and Blue map is too simple. To understand how scenes generate political 
identification —in cafes, poetry groups, punk clubs and galleries—it is necessary to 
move past models building heavily on party voting. oe 

We find more when we repeat the same basic analysis of zip codes within the 
three largest cities. The main finding in Figure 5 is that in Chicago the percent 
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, Chicago, and Los Angeles. Change in college graduate share of the population is the dependent 


-year-old population, non-white population, total population, and per capita income are independent 


fficient appears as the height of each bar; starred variables are significant at the .05 level. 


Figure 5 summarizes regression results within New York 
variable and the Bohemian Index, 25 through 34 


variables. The standardized regression coe 


level regressions, we reduced the number of independent variables from the model in Table 7. 


Because the N of zipcodes is much smaller in these city- 


dependent variables than in the national model, results stay generally similar. Namely, bohemia is 
so important to note that the adjusted R2’s are higher in the L.A. (.27) and N.Y (.24) models than 


S distinctive urban dynamics. 


Despite stronger intercorrelations among the city-level in 


significant within Chicago but not in L.A. and N.Y. It is al 


in the Chicago model (.07), again illustrating Chicago’ 
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of college graduates increases in more Bohemian zip 
codes; this same effect is insignificant in Los Angeles 
and New York. The common explanation for such dy- 
namics is cost or income, but these Bohemian results 
hold strong after we control income and the other 
variables in the model. 

Interpretation? Bohemia is no silver bullet for urban 
development. In Chicago, the Neo-Bohemian thesis 
that artist neighborhoods fuse with educated young 
people to meet the needs of the new culture-driven 
economy is empirically supported. However, the 
thesis demands contextualization, as such Bohemian 
neighborhoods are mot significant attractors of the col- 
lege educated in New York and Los Angeles. In New 
York, zip codes with 25-34 year olds seem sharply 
distinct from bohemian neighborhoods. Moreover, 
in New York, both college graduates and 25-34 year 
olds reside in zip code scenes that feature corporate 
authenticity (r = .326 and .238, respectively), while 
both groups are declining over the 1990-2000 decade 
in more corporate-authenticated scenes in Chicago 
and Los Angeles. Los Angeles contrasts most with the 
“neo-Bohemia leads to growth thesis” because college 
grads in Los Angeles increase more in higher income 
zip codes and with more young persons. Related: 
glamorous scenes in Los Angeles attract the young 
and educated more strongly than in the other cites 
(r = .493 vs. .32 in Chicago and .17 in New York)."” 
Perhaps the unique ways that Chicago (as shown in 
figures 3 and 4) combines individual self-expression 
with utilitarian legitimacy and corporate authenticity 
increase the likelihood that its Bohemians may be- 
come “useful labor” (Lloyd 2006); or perhaps timing 
matters, as Chicago’s bohemian scene has bloomed 
only recently in comparison to New York's. Still, these 
strong results document the power of local scenes in 
transforming simpler national patterns. By pointing 
to specific differences in both levels and dynamics of 
scenes across three major cities, scene analysis helps 
cultural analysts become more conscious of the mul- 
tiple institutional and other mechanisms that join to 


create specific types of scenes. 
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A Scenes-Based Program for Cultural Policy and Urban Studies 


Leisure and consumption as vehicles of personal expression have increased in the 
past century. This brings, as Nobel economist Robert Fogel (2000) suggests, a need 
to engage in questions about quality of life that cuts across class divisions. If Fogel 
is right that, in a society where leisure time has massively risen, “non-material” or 
“spiritual” goods and inequalities are increasingly becoming key drivers of social 
change (as Inglehart 1990 and others stress), then differences about which spiritual 
goods and how to arrange them will become increasingly central social and policy 
questions. Scenes-based research suggests one way to address this more precisely. 

There is little use in speaking of the coming of the creative class or the rise of 
Neo-Bohemia in flexible capitalism or the advent of omnivorous cultural con- 
sumption among the new elite or the transformation to a knowledge economy or 
the transition to post-industrialism. Accepting these as important general trends, 
the more critical and sensitive question then becomes the concrete one about which 
creativity (and where), which Bohemianism (and where). The scene within which 
any of these processes occurs not only shapes the direction toward which they 
move, it helps to define what it means to be creative or Bohemian or omnivorous 
or knowledgeable or beyond industry, and so to pursue the goals associated with 
those terms. These are not clear uncontested concepts, as was illustrated by the 
positive and negative takes on crime in Bohemian and non-Bohemian scenes. 
As leisure and consumption increase in salience, disputes over how to answer 
questions of the sort captured in our 15 scenes dimensions are likely to heighten, 
sometimes as new points of conflict. 


These observations suggest eight axial points of a scenes-based agenda for urban 
and cultural policy studies: 


I. Conceptualize the city as pluralistic, diverse, filled with competing 
subcultures. Government typically acts in distinct policy arenas 
such as housing or culture, which differ just like neighborhoods. 


We see the world more as an ecology of games and scenes than 
as a monolithic unity. 


2. Identify growth dynamics of distinct scenes. Identify scenes with 
neighborhoods (via zip codes etc.). Invest in key amenities to 


make each scene more vital, relying on its impact on the specific, 
local scenescape. 


3. No city represents the nation or the world. There is no Middletown. 
Disputing Michael Dear’s claim that L.A. is “the city of the 
future,” our more culturally relativistic perspective suggests in- 
stead: No one city is The Future. 


4. In addition to production, feature consumption. 
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5. Culturally strong neighborhoods remain separate from the work- 
place. Chicago's remarkably rich neighborhoods differ from the 
European social democratic tradition, where workers would 
reside in homes built near their factories, and social life was 
more driven by production. Explore the implications of such 
work/home contexts as they transform scene dynamics. 


6. Multiple research methods. Use in-depth cases, oral history, eth- 
nography, content analysis, archival history, voting, interviews 
of leaders, qualitative, quantitative and more. 


7. Include the metro area. Think not solely of a single metropolitan 
government, but look for cooperative, voluntary civic and in- 
tergovernmental patterns, some built from specific agreements 
among local governments and private contracting groups, oth- 
ers involving citizen values that lead them to prefer one location 
over another. 


8. Connect global changes in many urban dynamics with /ocal in- 
terpretations of those changes. Theorizing more precisely about 
multiple levels of socio-economic processes—from global to 
metro to zip code—can lead to more precise operational models 
which methods such as Hierarchical Linear Modeling can help 
assess and calibrate. 


All of these require elaboration. The effort of this research has been to show 
how the concept and reality of scenes provide a new, powerful tool to help do so. 

Our concepts and our data can be fruitfully expanded and merged with other 
approaches to enhance the power of each. We are working with teams from 
Finland, France, Spain, Portugal and Korea to specify cross-national scenes pat- 
terns, Our ongoing work seeks to expand and refine our concepts and methods to 
distinguish globally-shared and nation-specific patterns, and then to generalize the 
nation-specific results. For example “Korean scenes” often feature extended fami- 
lies—in weddings, funerals and college preparatory schools. We analyze the impact 
of these variables on others, such as rent. Indeed rent differences across Korean 
scenes are strongly influenced by college prep schools, not so in the United States 
(Lee, Clark and Anderson 2007). These analyses need to be combined with more 
narrow economic variables or individualistic/socialization-oriented discussions of 
identity or self-realization. Studies of race, ethnicity, religion, class as well as civic 
groups and neighborhoods can be enhanced and sharpened by adding these scenes 
components, highlighting the specific cultural setting within which any of these 


operates, and allowing for systematic comparisons of the relative impact of each. 
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Notes 


i 


To be sure, some geographers and sociologists have recently discussed the concept of 
place or space (much of which is summarized in Gieryn 2000 and Relph 2006). 


See, e.g., Parsons 1951; Alexander 2003; Griswold 2004; Swidler 1986; Sewell 1999. 


Even if we frequently use the term “urban,” most comments and data cover all local 
contexts. 


By contrast, Tanner et al. (2010) stress that the concept of scene should not be over- 
extended, and that its applicability in particular to oppositional sub-cultures, such as 
some types of rap music, is limited. 


‘There is no doubt that the emergence of scenes as an increasingly important social 
formation generates new social strains, just as the differentiation of production and 
residence continues to do. Analysis of the interchanges and interpenetrations among 
scene, family, work, politics and religion is an important subject of our further work, 
as is how scenes change in and organize experience of time. Likewise, analysis of 
virtual scenes is an important field of research, although, following Gieryn 2000 and 
because of the distinct role of theatricality and hence visibility in scenes, we consider 
only non-virtual scenes here. 


From the perspective of work and class, the experiences in scenes are commonly 
interpreted as promoting or opposing the interests of different classes—elite art for 
the elite class, mass art for the non-elite, both judged by how they block or support 
the dominating or emancipatory interests of classes, depending on where one stands 
(Bourdieu 1984; Dimaggio 1982). From the perspective of the traditional residential 
neighborhood, the looser, more transient glue that holds a scene together can seem to 
offer short-term commitment, shallow friendships and anomie, unlike the deep ties 
of classic neighborhoods (Wirth 1938; Sennett 1998). From the scene perspective, 
the job one holds and the place one lives are driven by the scenes of which they are a 
part. For example, Lloyd (2006) shows that in Chicago's Wicker Park scene, “coolness” 
drives hiring decisions rather than the other way around. Clark (2004) shows that 
amenities drive location decisions (see also Florida 2002 and Brooks 2000). 


This move is understandable, as the data to study such questions have often 
been unavailable or difficult to acquire. This is hardly surprising, given that the 
cultural sector has traditionally been subdivided: those interested in opera or ballet 
have not considered restaurants or bookstores, while others exploring football or 
country music have ignored museums and jazz clubs. Omitting these associated 
key elements of a scene, however, has meant that past estimates of how amenities 
have an impact on urban development have been misspecified, statistically biased 
by omission of key variables. 


We have elsewhere identified 12 ideal-typical scenes such as Disney Heaven, Bobo’s 
Paradise, Black is Beautiful, that variously combine the 15 sub-dimensions, See Clark 
(2007). 


The coding process required many details and fine judgments and tests for validity and 
reliability reported elsewhere. Our general strategy was to use the coding process to add 
operational detail about each dimension. We computed intercorrelations among coders 
results almost weekly, and if they fell below .8, we would meet, discuss and add. more 
conceptual consistency in writing with more detail about how and what to code, to 
make the criteria as explicit and consistent as possible for any future coder/analysts to 
use or recode differently. See scenes.uchicago.edu for the complete database and scores. 
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10. This operational option does not preclude the possibility of applying the same 
framework to non-physical amenities such as cultural events: regular annual 
celebrations, festival, fairs, bike rallies and the like 


11. In comparative analysis it is critical to define the “theoretical unit” to be compared, 
because the result should make reference to this unit, not to the unit used to observe 
and/or measure the analytical properties intended to be studied (cf., Przeworski 
and Teune on “levels of analysis” vs. “level of observation” (1970:49-50)). In cross- 
national or cross-city analysis, the analyst has to transcend names of the city or 
country and interpret the analytical meaning they represent (Przeworksi 1987). Our 
approach to scenes follows a similar logic. 


12. Our database combines information from the surveys of individual attitudes and 
behavior from the DDB Lifestyle Survey, amenities from The Urban Institute, online 
Yellow Pages, the Census of Economic Activity, and various socio-economic variables 
as causes and correlates of scenes (scenes.uchicago.edu). Considerations of feasibility 


and cost-efficiency also guided our selections (Kaple et al. 1996). 


13. For neo-bohemians, “sharing the streets with... nonwhite residents... is part of their 
image of an authentic urban experience.” (Lloyd 2006:78) 


14. Operationally, we subtract the distance of each zip code on each of the 15 dimensions 
from the Bohemian “bliss point” defined in Table 6. We then aggregate these 15 
distances and take the reciprocal score. 


15. For the regression model, we included both level and change measures for a number 
of standard variables in the literature. Including so many independent variables raises 
the possibility of statistical bias generated by multi-collinearity. We omitted any 
variables with intercorrelations (Pearson 1’s) over .5 and then substituted the omitted 
variables in alternative specifications to look for consistent results. We applied log 
transformations to a few skewed variables like population size. Some variables still show 
high kurtosis scores, mainly generated, it seems, by a higher concentration of amenities 
in metropolitan areas. This is compounded by the fact that the U.S. Census omits many 
zip codes due to confidentiality concerns. We are currently conducting further statistical 
analyses to determine the extent to which these distributional biases affect results. 


16. Although change in income and change in 25-34 year olds are significant at the .01 
level, and change in 18-24 year olds and level of 25-34 year olds are significant at the 
.05 level (all negative). 


17. Lloyd cites the complete verse: “In murky corners of old cities where/everything—horror 
too—is magical,/ I study, servile to my moods, the odd/and charming refuse of humanity.” 

18. A paper called “The Grey Creative Class: Why it is Critical for Cities and Culture,” 
is in draft, and is available from the authors on request. 

19. It also may be worth noting that, while in N.Y. and L.A., youth and education tend 


to point in the same direction (both groups tend to rise in relation to the same 
dimensions), in Chicago the two often point in different directions (the educated are 


rising in Chicago's glamorous scenes, but youth are declining). 
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Appendix. Building Measures of Scenes 


Table 2 provides illustrative samples of our amenities indicators for the 15 dimen- 
sions. In producing our indexes, 140 amenities were used, drawing from the U.S. 
Census of Business, and over 500 more from online yellow pages services and the 
Urban Institute’s Unified Database of Arts Organizations. A team of coders scored 
each amenity 1-5 for each of the 15 scene dimensions, with 5 positive, 3 neutral, 
and 1 negative. Our approach to systemizing the coding was to try to lay out highly 
detailed criteria, then to structure them via simple decision-trees that would auto- 
mate the process. Whenever we found intercoder reliability fell below r=.8 we would 
add more detail to the criteria and decision-trees. Our operational definitions are 
thus highly detailed. We generated dozens of pages of definitional criteria for the 15 
scenes sub dimensions, available to others with complete scoring and amenity lists 
at http://public.me.com/tnclark1/Scenes Project Data and Syntax. 

‘The performance score measures all ofa scene’s amenities. A few stylized, extreme 
examples illustrate how it works. Let us imagine an absolutely pure transgressive 
scene — Berkeley's Telegraph Avenue to the max. And let us say that this scene con- 
tains 5 amenities, and that, in this example, all of these are body piercing studios and 
tattoo parlors. All would receive a score of 5 on the sub-dimension of transgressive 
theatricality. If we then multiply each of the 5 amenities by its transgression score 
of 5, we find that each one is putting out 5 units of transgressiveness (5 “transgres- 
sies,’ we could say). The sum total of transgressiveness in the scene would thus be 
25 transgressies. If we then divide this total by the total number of amenities in the 
scene (5), we find that the average experience of transgression in this scene is...5! 

Now, let’s consider what the performance score on transgressive-theatricality 
would be for a stylized version of Chicago's Wrigleyville. Let us assume that this 
scene has 2 body piercing salons, 4 bars, 2 Chinese restaurants, and 1 Starbucks. 
In this case, the bodypiercing salon would receive a 5 for transgression, the bars 
might receive 4’s, the Chinese restaurants 3’s (neither promoting nor denying 
resistant behavior), and the Starbucks a 2 (their standardization and ubiquity, let 
us say, may be viewed as impediments to instituting transgressive practices). So, by 
multiplying the number of each type of amenity by its transgressive-theatricality 
score, we see that the body piercing salons would be generating 10 transgressies, 
the bars 16, the Chinese restaurants 6, and the Starbucks 2. If we sum those, we 
find that the whole scene provides 34 units of transgression to its consumers. 
Divide that total by the number of amenities in the scene (9), and we see that the 
average experience of transgression per amenity in this scene is...3.8. Still more 
than a neutral experience (3), but not as intense as Telegraph's 5. 


Measuring Government Effectiveness and Its Consequences 
for Social Welfare in Sub-Saharan African Countries 


Audrey Sacks, University of Washington 
Margaret Levi, University of Washington and University of Sydney 


We introduce a method for measuring effective government and modeling its conse- 
quences for social welfare at the individual level. Our focus is on the experiences of 
citizens living in African countries where famine remains a serious threat. If a govern- 
ment is effective, it will be able to deliver goods that individuals need to improve their 
social welfare. At a minimum, effective governments facilitate reliable access to food 
for its citizens. We assess this conception of effective government via a multi-level 
model from 17 sub-Saharan countries sampled in 2005 by Afrobarometer. We find 
that citizens who live in regions and in countries with a civil bureaucracy, reliable law 
enforcement and good infrastructure enjoy higher levels of food security than those 
who live in regions with weaker institutional penetration. 








An effective government is one that is capable of protecting the population from 
violence while ensuring the honesty and competence of its bureaucracy and en- 
abling the provision and maintenance of infrastructure that makes possible the 
exchange of goods and delivery of services (Levi 2006)’ The more effective gov- 
ernment is, the higher the level of social welfare, ceteris paribus. 

In countries that face the risk of famine, more effective governments should 
lead to higher levels of individual food security, our indicator of social welfare. In 
this article, we introduce and test this argument using individual-, neighborhood, 
district- and country-level data from 17 sub-Saharan countries sampled in 2005. 
Our results are intuitive but strongly grounded in the empirical evidence: effective 
governments facilitate reliable access to food for its citizens. 

Our modeling strategy can be extended to identify the links between the qual- 
ity of government and other indicators of social welfare, such as mortality and 
housing, which are particularly relevant to developed countries. However, in 
this article, we limit our focus to the most salient issues for some of the poorest 


countries in the world. 
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Measuring Effective Government 


A substantial and growing body of literature reveals a significant relationship be- 
tween the quality of government measured at the country level and country-level 
development outcomes including economic growth, infant mortality and adult 
literacy (Kaufmann et al. 1999; Knack and Keefer 1995; La Porta et al. 1999). 
The majority of these studies rely on indicators of government quality, including 
reliance on the rule of law and the probability of expropriation, political stabil- 
ity, voice and accountability and control of corruption. The resulting research 
significantly enhances our ability to measure and assess the quality and role of 
government institutions. 

Despite their usefulness, studies relying exclusively on country-level indicators 
to identify the impact of government on social welfare are limited for one key 
reason. There is reason to believe that the quality of government institutions varies 
within countries, as well as between countries. Government responsiveness, cor- 
ruption and reliance on non-public resources vary considerably among localities 
with consequences for citizen understanding of and relationship to government 
(Gibson and Hoffman 2005). Large regional discrepancies exist in the expansion 
and penetration of physical infrastructure, often because of underlying economic 
conditions. For example, the British, French and other colonialists concentrated 
infrastructure in high-value zones where minerals could be extracted or where 
plantation agriculture could be practiced, to the neglect of other locales (Banerjee 
et al. 2005; Herbst 2000; Mann 2008). Contemporary governments often do the 
same. Quantitative, cross-country studies that only model country-level indicators 
will miss within-country variations and therefore may under- (or over-) estimate 
the effect of governmental characteristics on local social conditions. 

In this research, we present and test a model that describes how individuals’ 
experiences with government activities have consequences for their social welfare. 
Our model promises to do what macro-models cannot: identify how government 
institutions affect individual-level outcomes. Our success depends, however, on 
the availability of certain kinds of micro-level data. We were fortunate to find a 
source in the Afrobarometer surveys. This data is drawn from Africa, the continent 
with the most widespread malnutrition and most widespread famines. Of the 21 
famines that occurred worldwide in 1970-1999, all but two-—Bangladesh in 1974 
and North Korea in the late 1990s—occurred in sub-Saharan Africa (von Braun 
et al. 1999). Further, of the 39 countries worldwide that faced food emergencies 
at the beginning of 2003, 25 are found in Africa (Clover 2003). 

Analysis of Afrobarometer data permits us to investigate whether the level 
of infrastructure development, the nature of the bureaucracy and the quality of 
law enforcement explain a significant amount of variation in individuals food 
security beyond an individual’s and country’s wealth endowments. The empirical 
modeling strategy we adopt in this research allows us to make inferences about 
the impact that government— measured at the individual-, neighborhood-, district- 
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and country-levels—has on individuals’ food security. Although our empirical 
modeling is limited to only 17 countries on one continent at one point in time, 
it nonetheless provides a new and promising approach for understanding what 
features of government, if any, matter for improving individuals’ social welfare. 


Defining Food Security 


As the most basic human need, food security is a core indicator of absolute physical 
well-being. Food security refers not only to an adequate aggregate supply of food but 
also to a condition when all people at all times have physical and economic access 
to sufficient, safe and nutritious food to meet their dietary needs for an active and 
healthy life (FAO 1996; Lovendal and Knowles 2005). The effectiveness of a govern- 
ment should not be evaluated solely on the basis of the number of people it helps to 
feed but also by the quality and reliability of the food supply it is helping to support. 
Those who are food insecure and vulnerable do not constitute a homogeneous 
group. For some, food insecurity is chronic. Others may experience food insecurity 
intermittently or on a seasonal basis. There are also individuals who are potentially 
food insecure; although they have enough food today, they face a high probability 
of not having enough food in the future. Households may achieve temporary food 
security at the cost of becoming highly vulnerable to future food insecurity (von 
Braun et al. 1992). Food security thus constitutes a necessary but insufficient condi- 
tion for an individual’s attainment of an adequate level of social welfare. 


Effective Government and Food Security 


Risks to food security are numerous and originate from a variety of public and 
private sources including a country’s political and economic institutions, the envi- 
ronment, nature, pandemics and idiosyncratic ill health. In his studies of famines, 
Sen (1976, 1981) observes that households can experience food insecurity even 
when there is an abundant food supply and a functioning market. An overview 
found that 21 of the 32 major 20" century famines were primarily caused by 
conflict or poor government policies (Devereux 2000). Many other famines that 
were triggered by droughts or floods were aggravated by governments’ policies 
(Devereux 2001; Whiteside et al. 2002). 

We, too, focus on the relationship between the quality of government and food 
security. Our argument, consistent with a distinguished tradition of scholarship (e.g., 
Gerschenkron 1962; Hirschman 1971; O’Donnell 1978), is that sound government 
policies and planned and sustained government intervention are essential for social 
welfare and economic development. Specifically, we focus on the three distinct 
features of a state that have consequences for food security: the development of a 
civil bureaucracy, the prevalence of law and order and the extent of physical infra- 
structure (see Figure 1). Ifa government fails to facilitate a minimal level of social 
welfare for its citizens, it will likely face difficulties in establishing authority and 
building legitimacy in the long run (Lake 2009; Levi and Sacks 2009). 
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Figure 1. Conceptual Framework 
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Figure 1 illustrates our theoretical model. We argue that an effective government is one that 
is capable of providing a civil bureaucracy, law and order, and physical infrastructure. We 
expect social welfare to be higher, where governments are more effective, ceteris paribus. 
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National governments have control over each of these to varying degrees, al- 
though their implementation is likely to vary regionally within countries. Although 
corporations or donors sometimes build infrastructure, its continued mainte- 
nance and protection nonetheless depends on government actions and agencies. 
Although non-state actors, in particular NGOs, and bilateral and multilateral 
donors continue to pay for, or directly deliver, health and food in developing 
countries, they can do so only to the extent that they know where the needy are 
and can get to them safely with their goods and services. In the relatively peaceful 
countries we are studying, the quality of government should influence where they 
concentrate their normal (as opposed to disaster relief) work. 

Given the territorial unevenness of bureaucracy, law enforcement and physical 
infrastructure within countries, we expect to observe within-country variations in 
these variables and thus, within-country variation in food security. We will also 
investigate the extent to which variation in food security correlates with geography, 
rural/urban divides and status groups defined by class, ethnicity, gender and religion. 


Civil Bureaucracy 


There is a long line of research originating with Max Weber emphasizing how 
the quality of bureaucracies affects economic development. The introduction of 
a relatively professional, meritocratic and efficient administrative structure was 
a critical factor in the rapid development of East Asia (Amsden 1989; Johnson 
1983; Wade 1992) and its absence in other areas is an obstacle to state-building 
(Callaghy 1984; Evans 1989; Roth 1968). Red tape, patronage and corruption 
generally limit economic growth (de Soto 1989). 

A corrupt and inefficient public administration tends to siphon public re- 
sources away from the poor, slowing the pace of improvements in social welfare 
and possibly even undermining the welfare of citizens, Corruption often leads to 
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lower net tax and customs revenue to the government. It drives into the informal 
economy or to another country those firms that cannot afford the bribes or whose 
rents fail to secure the license they hoped to buy. Corruption can drive up the 
price and lower the level of government services, including the provision and 
financing of health care and education services. When teachers demand bribes 
for admitting students, the poor are crowded out of the market and lose access to 
critical services. Bribery also distorts sectoral priorities by, for example, creating 
incentives to contract for large defense projects rather than rural health clinics (de 
Soto 1989; Gray and Kaufmann 1998; Gupta et al. 2000; Mauro 1995, 1998; 
Schleifer and Vishny 1993). 

There are several reasons why competent and honest bureaucracies should have 
a positive impact on social welfare. First, by establishing meritocratic mechanisms 
for the selection and retention of qualified officials, bureaucracies reduce predatory 
activities and increase the capacity of bureaucracies to implement social policies 
that benefit a large portion of the population. States with competent and honest 
bureaucracies are therefore better able to implement their government’s objec- 
tives and, should they choose, higher levels of goods provision (Ziblatt 2008). 
Second, similar qualifications, tough selection processes, and prized and scarce 
skills combine with internal promotions and long-term careers to create among 
bureaucrats the sense of membership in a highly select club. Their shared norms 
can improve state cohesion while simultaneously increasing inhibitions against 
corrupt practices that hurt the poor (Chibber 2002; Evans and Rauch 1999). 

The third reason why a well-functioning national bureaucracy matters for social 
welfare is that it facilitates information and communication flows. Without de- 
pendable bureaucracies, governments or external aid agencies may not be able to 
acquire the information needed for accurate prediction of impending famines or 
widespread shortages of food. This is what happened in Africa in the mid-1980s 
(Buchanan-Smith et al. 1994). Good bureaucracies with good information are also 


necessary for proper identification of who needs aid. 


Law and Order 


A government’ ability to help ensure its citizens have sufficient access to food is 
directly linked to the level of security it provides to both people and property. A gov- 
ernment’ failure to preserve internal peace can compromise its citizens’ food security. 
There are multiple links between violent civil conflict and food insecurity. 
Recruitment of young men into militias reduces family income from agricultural 
productivity. During wars, employment opportunities also contract. Predatory 
activities of both militias and regular armies further diminish the food supplies of 
the unarmed population. To starve adversaries, militias and armies often resort to 
“scorched earth tactics” and even destroy food they cannot use. Anticipating theft 
and destruction, farmers will lose their incentive to plant crops (Paarlberg 1999). 
Wars and conflicts also disrupt normal trade activities that citizens depend on for 
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their livelihoods. The destruction of bridges, roads and mines, and the diversion 
of trucks and fuel for military uses interrupts the trade and transportation of food. 
For these reasons, countries experiencing conflict in Africa also tend to experience 
a significant drop in food production (Paarlberg 1999). 

Even where there is relative peace, food security depends on protection against 
the loss of property. Where individuals fear expropriation or theft, they have fewer 
incentives to invest in enhancing their economic productivity through the acquisi- 
tion of education and technology. In such circumstances, they will be more likely 
to divert part of their incomes towards private security and away from insurance 
against famines. The failure of the state to protect property hurts the poorest 
disproportionately because they cannot afford to protect themselves. Individuals, 
in their need for safety, are also likely to become wary of others, even those with 
whom they once cooperated in their livelihood strategies (Bates 2001). 

Where the state fails to provide security, people seek ways of protecting them- 
selves against predation. One such way is to choose to live without goods worth 
stealing (Bates 2001). By choosing to live in poverty, people are less likely to be 
attacked. Yet, as their set of exchange entitlements decline, they are left more 
vulnerable to food shocks. 


Physical Infrastructure 


‘The presence of physical infrastructure is likely to enhance food security.” Physical 
infrastructure enhances states’ ability to monitor agents and collect information 
on the population, whether it is for purposes of extraction or social service provi- 
sion. Transportation and communication technologies aid governments in reduc- 
ing corruption in a bureaucracy and improve its ability to collect taxes, leaving the 
government with more revenue to provide social services (Adams 1996; Herbst 
2000; Kiser and Sacks 2009). The expansion of infrastructure also contributes to 
the integration of communities into a larger economic and political unit. In the 
history of rural France, Eugene Weber (1976:196) noted, “Until roads spread, 
many rural communities remained imprisoned in semi-isolation, limited partici- 
pants in the economy and politics of the nation.” 

The development of transportation and communication infrastructure reduces 
vulnerability to food insecurity for citizens living in areas with persistent droughts 
or floods. Despite almost analogous climactic conditions in northern China in the 
1870s and the 1920s, an estimated 9 to 13 million people died during the former 
period while mortality rates only reached a half a million in the latter period. 
One can attribute this difference, in part, to greatly improved communications 
infrastructure and the construction of 6,000 miles of railway in the interim, which 
made timely relief intervention possible in the 1920s. In the Soviet Union, too, 
the critical factor that reduced vulnerability to famine was most likely the inte- 
gration of historically famine-prone regions with the national economy, through 
the development of communication and transport networks (Devereux 2000). It 
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was similar in India, where the expansion of railways diminished vulnerability to 
droughts beginning in the 20" century (Ravallion 1996).3 


Other Causes of Food Security 


With a few exceptions, most studies model food security as a function of house- 
hold and demographic characteristics, HIV/AIDS and environmental and natural 
disasters. We control for these variables in our model while also examining an 
alternative government institution that scholars have linked to food security—the 
presence of democratic institutions. 


Household and Demographic Factors 

‘There are a variety of household and demographic characteristics that may directly 
affect food security. For example, a loss of income reduces one’s access to food. 
Income loss can result from unemployment or reduced return from farming, small 

trade or manufacturing. Different households have different levels of access to as- 
sets, influencing their ability to prevent, mitigate or cope with shocks (Collier and 

Gunning 1999). Particularly vulnerable to income shocks are households depend- 
ing on subsistence farming in semi-arid conditions or those lacking alternative 

income sources or access to markets. Farmers face strikingly greater risks in Africa 

than elsewhere. In some areas of Ethiopia, Zimbabwe and Tanzania, near total 

crop failure has a probability of about 10 percent (Lovendal and Knowles 2005). 

Female-headed households and females, in general, are particularly vulnerable to 
experiencing food insecurity and illness because of a lack of access to land, education 
and health services (Paarlberg 1999). There are certain religious and ethnic groups 
that are vulnerable to discrimination, repression and likely food insecurity, as well 
(Minorities at Risk Project 2009). The rural poor are also likely to be more vulner- 
able to food insecurity than their urban counterparts due, in part, to inferior living 
conditions (Garrett and Ruel 1999; Smith et al. 2005; von Braun et al. 1993) and 
lack of formal market-based insurance and credit and savings instruments (Devereux 
2001; Heitzmann et al. 2002). Some attribute this urban/rural discrepancy to the 
political power of urban elites, arguing that urban elites use government institutions 
to direct resources to the cities and exploit the rural poor, who, by comparison, have 
weak collective action capacities (Bates 1981; Lipton 1977). 

Illness, disability and injury not only burden households with medical expenses, 
but can also adversely affect one’s employment and income. HIV/AIDS, tuber- 
culosis, cholera and sicknesses that result from poor sanitation and water quality 
are likely to affect the food security of clusters of households and communities 


(World Health Organization 2002). 


HIV/AIDS Pandemic 7 
Scholars have linked HIV/AIDS to growing food insecurity and famines in sub- 


Saharan Africa. As a result of HIV/AIDS, entire communities are experiencing 
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shortages of food due to a loss of assets and skills associated with adult mortal- 
ity, the burden of caring for sick household members and orphans, and general 
changes in dependency patterns (Barnett and Whiteside 2002; de Waal 2003). 


Natural Disasters, Environmental Risks 

Floods, droughts, pest attacks, landslides, high winds and other natural disasters 

obviously influence food production. The disaster may be national, but the effect 
is likely to vary regionally. Natural disasters raise production costs to the extent 

that they destroy crops and land, increase the difficulties in accessing quality water 

and generate the need for insecticides. Depending on the stock levels and the abil- 
ity of traders to bring in food from other non-affected areas, these natural risks 

may or may not interrupt food supplies or lead to price increases. Environmental 

risks such as desertification and declining soil fertility lead to lower crop yields and 

production levels while increasing unit production costs (Lovendal and Knowles 


2005; Miguel et al. 2004a) 


Democratic Institutions and Civil Liberties 

Studies show that democratic institutions are positively correlated with improved 
physical quality of life, the fulfillment of basic needs, more balanced urban-rural 
development and lower income inequality (Jenkins and Scanlan 2001; Moon 
and Dixon 1985; O’Donnell and Schmitter 1986). Democratic institutions, spe- 
cifically, a vigilant press that disseminates information about impending fam- 
ines and free elections, are crucial safeguards against famines. Political elites are 
likely to work harder to improve social welfare in a democratic, rather than an 
authoritarian, political setting to avoid punishment by their constituents at elec- 
tions. Democratic institutions are also likely to cultivate a vibrant civil society that 
responds to the needs of the poor through lobbying, protests and demonstrations 
(Dréze and Sen 1989; Sen 1990). 

Although competitive elections and protests may help protect populations from 
experiencing famines, democracies are not an automatic safeguard against food 
insecurity. The most representative and accountable governments are democra- 
cies, but not all democracies have effective governments, and there are relatively 
effective and even responsive governments in non-democratic states (Levi 2006). 
China is a case in point. During Deng Xiaoping’s regime, China’s total grain out- 
put increased 65 percent, and the absolute number of Chinese living in poverty 
fell from 250 million in 1978 to 34 million in 1999, These accomplishments 
were achieved not through competitive electoral elections but rather through the 
provision of public goods including roads in rural areas, property security includ- 


ing household control over land and access to a system of market-based exchange 
(Paarlberg 2002). a 
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Data and Methods 
Data 


We test our conceptual model using the third round of Afrobarometer data. 
Afrobarometer surveys Africans’ views towards democracy, economics and civil 
society with random, stratified, nationally representative samples. In 2005, trained 
enumerators conducted face-to-face interviews in local languages with 25,397 
respondents across 18 countries. The sample is designed as a representative cross- 
section of all citizens of voting age in a given country. 

‘The dataset used for this research has a multi-level structure; individuals are 
nested within primary sampling units, which are nested within districts, which 
are nested within countries. The PSUs, which we also refer to as neighborhoods, 
are the smallest, well-defined geographic units for which reliable population 
data are available, and they tend to be socially homogenous, thereby producing 
highly clustered data. In most countries, these will be Census Enumeration Areas 
(Afrobarometer 2007). Although respondents were not sampled based on their 
ethnic afhliation, there is likely to be a high level of clustering in the dataset 
around ethnicity. Across Africa, ethnicity plays a highly salient role in the alloca- 
tion of public goods (Kasfir 1979; Posner 2004). Case studies from a number of 
countries underscore instances in which African leaders have distributed goods to 
members of their own ethnic group or to their home area. 


Modeling Strategy 


Many efforts to untangle the effects of state policy on social welfare rely on 
country-level cross national comparison. We are lucky to have individual-level 
data drawn from enough regions that we can examine much more fine-grained 
patterns and sources of variation. However, this data structure complicates our 
analysis significantly; ignoring the multi-level nature of the data would generate 
a number of severe statistical problems. When observations are clustered into 
higher-level units, the observations are no longer independent. Failure to control 
for this clustering can result in biased parameter estimates and inefficient standard 
errors. Further, intercepts may be variable across countries, and failure to control 
for this may result in biased estimates. The individual-level variables may also have 
unequal slopes across countries. In this case, a pooled estimator may be biased for 
each particular country (Guo and Zhao 2000; Snijders and Bosker 1999). 

We deal with these issues by estimating a multi-level model that allows us to 
estimate varying intercepts and slopes while producing asymptotically efficient 
standard errors. In addition to correcting for biases in parameter estimates and 
standard errors, multi-level models offer additional advantages. They also allow 
us to examine how covariates measured at each level in our model affect our out- 
come variable. Because we hypothesize that food security is a function not only of 
individual characteristics but also of neighborhood, district, country and ethnic 
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characteristics (or context), we also include variables at each of these levels (Guo 
and Zhao 2000). 

Treating our dependent variable as a binary outcome and taking into account 
the multi-level nature of our data, we estimate random intercepts for PSUs, dis- 
tricts, countries and ethnic groups with the following systems of equations: 


Level 1 (Individual): Log KPa eo CP = Poscim) +s en) 
Level 2 (PSU): Pict) % Onde + Dom) 
Level 3 (District): Simi) = Von t Conk dat 


Level 4 (Country and Ethic Group) Myer © coat ae ee 


Yooodm) = 
In this model, i, j, k, 1 and m index the individual, PSU, district, country and 
ethnic group levels. The Greek symbols refer to parameters estimated at particular 
levels of analysis: 6, 5, y, 4 and £ refer to fixed effects at the individual, PSU, district, 
country and ethnic group levels, respectively. The random effects of intercepts at 
the individual, PSU, district, country and ethnic group level are described by a, 
De Cook(m), 
a sum of a linear function of the explanatory variables and random-group and 
random effect deviations (Guo and Zhao 2000). The log-likelihood statistic and 
the AIC statistic were used to determine model fit. 


im)? 
d,and e,_. The logistic multilevel model expresses the log-odds as 


Concept Measurement 
Food Security 


This study’s dependent variable is whether an individual and his or her household 
members enjoyed high levels of food security within the year preceding the survey. 
Access to food is measured by answers to the question “Over the past year, how 
often, if ever, have you or your family gone without enough food to eat?” We 
dichotomized a five-scale ordered response into “ever” or “never.” Going without 
enough food to eat constitutes food insecurity regardless of the frequency this 
event occurred within the year preceding the survey. Further, our results do not 
significantly differ when we treat our dependent variable as an ordered categori- 
cal variable with the original five response categories and estimate a multi-level 
ordered logit (see Table 3). Thus, we are confident that we did not lose any sub- 
stantial statistical information by dichotomizing the dependent variable. 

Across the 17 countries in our sample, there is wide variation in the percent- 
ages of respondents who report having experienced food security in 2004 (see 
Figure 2). Only 28.1 and 28.5 percent of Zambian and Malawian respondents 
report having experienced food security in the year preceding the survey, which 
is not surprising given that at the time, both countries were recovering from 
the devastating 2002-2003 famine. By contrast, 75.1 percent and 64.3 percent 
of Cape Verdeans and Ghanaians, respectively, report having experienced food 
security in the year preceding the survey. 
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Figure 2. Percent of Respondents Reporting Food Security 
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Figure 2 displays the percent of respondents for each country sampled by Afrobarometer 
in 2005 (excluding Zimbabwe) who report food security. 


Independent Variables 


While controlling for a variety of household and demographic characteristics 
and country-level factors including precipitation and GDP per capita, we test 
the effect of the civil bureaucracy, law and order and physical infrastructure on 
food security (see Table 1 for a summary of our concepts and measurement). We 
demonstrated why these particular components of effective government should 
have consequences for social welfare and food security. 


Civil Bureaucracy 
Our model includes three measures of bureaucratic competence that are created by 


ageregating individual responses to the district-level because we believe that individ- 
uals access bureaucratic services in their district headquarters. Given Africas gener- 
ally poor communication and transportation technologies (Herbst 2000), we do not 
expect individuals, especially in large countries, to travel to their country’s capital to 
obtain identity cards, for example. We selected three items from the Afrobarometer 
to capture the extent to which the bureaucracy has penetrated the country and the 
extent to which the bureaucracy is competent: (1. the ease or difficulty individuals 
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face in getting identity documents (such as a birth certificate, voter card, passport 
or driver's license); (2. the availability of places in primary school for their children; 
(3. and access to household services (i.e., piped water or electricity). 

We include three individual-level variables that indirectly measure the honesty 
of bureaucrats and government officials. Respondents were asked about their 
experiences with offering a bribe, gift or favor in exchange for the following: place- 
ment in schools, identity documents and household services. In states where it is 
common for bureaucratic agents to exchange services for bribes, we may observe 
that wealthier individuals, who are able to afford bribes, enjoy higher levels of 
food security. If this is the case, then what may be driving food security is not 
bureaucratic competence but the effectiveness of government in facilitating the 
exchange of bribes. We also include a World Bank country-level governance indi- 
cator, control of corruption, as a measure of bureaucratic honesty. This indicator 
measures the extent to which public power is exercised for private gain, as well as 
capture of the state by elites and private interest (Kaufmann et al. 2006). 

We include two individual-level measures of bureaucratic fairness: a variable 
indicating whether respondents perceive that the government treats members of 
their ethnic group fairly and a variable indicating whether respondents belong 
to the same ethnic group as their country’s head of state (Fearon et al. 2007).° 
Together, these two variables should capture the extent to which certain ethnic 
groups benefit from, or perceive that they are harmed by, government patronage.’ 


Infrastructure Measures 

Afrobarometer includes indicators of physical infrastructure that are measured 
at the PSU-level. Interviewers and survey supervisors indicated on the surveys 
whether they detected the presence of specific aspects of infrastructure in each of 
the PSUs. We include three variables that indicate whether a post office, electricity 
grid, and tarred or concrete roads were observed in each PSU. 


Law and Order 
We include three measures of law and order that we aggregate to the district level: 
whether a respondent had been physically attacked, whether a respondent had 


something stolen from their house, and whether a respondent faced difficulties in 
obtaining help from police.* 


Control Variables 


Socio-Demographic Variables 

Our control variables measure demographic and general household characteris- 
tics that can affect an individual's access to food. Measuring purchasing power 
yields unique obstacles. We cannot include a direct control for household iricome. 
Asking respondents to quantify their income can be problematic in the context 
of developing economies, where individuals are often embedded in barter or 
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commodity exchange, rather than market economies. Thus, a question probing 
respondents about their household income was not included in the third round of 
Afrobarometer surveys. There are reasonably good proxies, however. We include 
variables for whether respondents own each of the following household items: a 
television, radio, bicycle and book. 

We also control for age, whether a respondent missed work frequently due to 
physical health problems and whether the respondent is unemployed. Because 
subsistence farmers are particularly vulnerable to food insecurity, we control for 
whether the respondent produces food only for home consumption. We include a 
variable indicating whether the respondent lives in an urban or rural community. 
We also include respondents’ gender, religion and ethnicity, which may explain 
variation in food security. 


Natural Disasters and Environmental Risks 

We control for unfavorable climates by including a measure of precipitation for 
2003. The Global Precipitation Climatology Project database relies on a combi- 
nation of actual weather station rainfall gauge measures and satellite information 
on the density of cold cloud cover, which is closely related to actual precipita- 
tion. GPCP is the only source on climate that includes both gauge and satellite 
data, corrects for systematic errors in gauge measures and rejects gauge measures 
thought to be unreliable. Estimates are made at every 2.5 latitude and longitude 
degree intervals (Miguel et al. 2004b). The units of measurement are in millime- 
ters of rainfall per day and the estimates are averaged annually. We averaged each 
yearly rainfall estimate per 2.5 latitude/ longitude degree node over all nodes in a 
given country to produce an estimate of total yearly rainfall per country (Miguel 
et al. 2004a, 2004b; Huffman et al. 2009).'° 


HIV/AIDS Pandemic 

Prevalence data for HIV/AIDS is, unfortunately, not available at the district level 
for several countries included in our sample. Although country-level HIV/AIDS 
indicators disguise the vast within-country variance of HIV/AIDS, we include two 
measures of its impact: 2005 UNAIDS estimates of both the HIV/AIDS preva- 
lence level among 18 to 49 year olds and HIV/AIDS-related mortality (UNAIDS 
2006). Because these variables are highly skewed, we use their natural log. 


Democratic Institutions and Civil Liberties 

To test an alternative argument that emphasizes the relationship between demo- 

cratic institutions and food security, we include three variables: Polity IV’s de- 

mocracy index and Freedom House's measure of civil liberties and political rights 

(Freedom House 2003-2005; Marshall and Jaggers 2009)."" | 
Although we certainly acknowledge the importance off protest in ensuring 

government accountability, the time lag between citizens expression of griev- 
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Table 1: Concepts and Measurement 


Level of 
Variable Measurement Source 
Civil Bureaucracy 
Competency 
Ease of obtaining an identity document District Afrobarometer 
Ease of obtaining a place in primary school District Afrobarometer 
Ease of obtaining household services District Afrobarometer 
Honesty 
Experience with offering a bribe, gift or favor in Individual Afrobarometer 
exchange for an identity document 
Experience with offering a bribe, gift or favor in Individual Afrobarometer 
exchange for a place in primary school 
Experience with offering a bribe, gift or favor in Individual Afrobarometer 
exchange for household services 
Control of corruption Country World Bank 
Fairness 
Perception of government treatment of respondent's _Individual Afrobarometer 
ethnic group 


Membership in same ethnic group as head of state _— Individual Fearon et al. 2007 
Infrastructure Provision 


Presence of an electricity grid PSU Afrobarometer 
Presence of tarred or concrete roads PSU Afrobarometer 
Presence of tarred of a post office PSU Afrobarometer 
Law and Order 

Experience with theft District Afrobarometer 
Experience with assault District Afrobarometer 
Ease of obtaining help from police District Afrobarometer 


ances and the government's response makes it impossible to test the effect of 
collective action without longitudinal data. 


Other Country-Level Variables 


We include four country-level variables for 2004 that may affect food security, as 
well as a state’s ability to provide public goods: the log of GDP per capita, the 
log of the aggregate amount of food aid shipments countries received, the log of 


foreign aid per capita and the log of population size (FAO 2003-2005; World 
Bank 2003-2005). 


Results 


Using these data, we estimated the model described in the equations. Likelihood-ratio 
tests of model fit suggest that including the intercepts for each of the five levels signifi- 
cantly improved our model's fit (see Table 2 for Model 1’s coefficients and z-ratios).!2 
Because of the difficulties of directly interpreting multi-level logistic parameters, 
we focus our discussion on predicted probabilities and first differences, or changes 
in the expected values, of reporting food security. To calculate the first differences, 
we estimated the multilevel parameters and drew 10,000 sets of simulated coef- 
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Table 1 continued 


Level of 
Variable Measurement Source 
Demographic and Household Characteristics 
Ownership of a television Individual Afrobarometer 
Ownership of a radio Individual Afrobarometer 
Ownership of a bicycle Individual Afrobarometer 
Ownership of a book Individual Afrobarometer 
Age Individual Afrobarometer 
Physical health Individual Afrobarometer 
Subsistence farmer Individual Afrobarometer 
Employment status Individual Afrobarometer 
Gender Individual Afrobarometer 
Religion Individual Afrobarometer 
Ethnicity Individual Afrobarometer 
Urban or rural residence Individual Afrobarometer 
HIV/AIDS Pandemic 
Percent of population with HIV/AIDS (18-49 years) Country UNAIDS 
AIDS deaths in adults and children Country UNAIDS 
Natural Disasters and Environmental Risks 
Rainfall Estimates Country GPCP 
Arable land per capita Country World Bank 
Democratic Institutions and Civil Liberties 
Civil liberties Country Freedom House 
Political rights Country Freedom House 
Democracy Index Country Polity IV 
Economic Development 
GDP per capita Country World Bank 
Aid per capita Country World Bank 
Amount of food aid County’) 2s FAO. aes 


ficients and simulated random errors from their posterior distribution. We then 
set our conceptual measures for civil bureaucracy, infrastructure and law and 
order that were significant at the p < .05 level in Model 1 to their minimum and 
maximum values, while holding all other variables at their central tendencies, '° 
and calculated 10,000 expected values. The first difference is E(Y Ji E(Y }: 
We then averaged the simulated values to obtain a point estimate and sorted the 
values to obtain a 95 percent confidence interval. These point estimates of the first 
differences of reporting food security are graphically illustrated in Figure 3. ‘The 
vertical bars on the figure indicate 95 percent confidence intervals. 

Overall, the results from these models provide considerable evidence that food 
security is related to infrastructure provision, bureaucratic competence and hon- 
esty, and law and order. We obtained almost identical results when we tested the 
model using the second round of Afrobarometer (2002-2003), which included 
controls for income and household size. iyiee te 

First, we expected the presence of a civil bureaucracy to minimize the inci- 
dence of food insecurity by providing individuals with easy and equitable access 
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Figure 3. First Differences of Experiencing Food Security 
Regression Estimates 
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Figure 3: To calculate the first differences, we estimated the multilevel parameters and drew 
10,000 sets of simulated coefficients and random errors from their posterior distribution. 
We then set the following conceptual measures to their minimum and maximum values, 
while holding all other variables at their central tendencies, and calculated 10,000 expected 
values: civil bureaucracy (ease of obtaining household services and perceptions of ethnic 
discrimination); physical infrastructure (all three measures); and law and order (all three 
measures). The first difference is E(Ymax) - E(Ymin). 


to critical services. As hypothesized, there is a positive relationship between the 
competence of bureaucracies, which is measured at the district level, and food 
security. When we include all of our indicators of bureaucratic competence in the 
same model, only the variable measuring the average ease of obtaining household 
services in a district is statistically significant (p < .01). Living in a district where 
it is easy to obtain household services corresponds to an increase of 9 percentage 
points in the probability of enjoying high levels of food security. 

When we include one measure of bureaucratic competence at a time, our indi- 
cators of the average ease of obtaining an identity document and a place in primary 
school are also statistically significant (p < .05) (see Table 1 in the appendix). '* The 
first difference in the impact of a bureaucracy that provides identity documents 
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easily on an individual's probability of having food security is 11 percentage points. 
Similarly, living in a district where it is easy to obtain places in primary school 
corresponds to a percentage point increase of 7.8 on an individual’s probability 
of having food security. 

We also find evidence of a relationship between bureaucratic honesty and food 
security (see Table 2 in appendix). Respondents who report having had offered 
bribes, gifts or favors in exchange for documents or permits or to get a child into 
primary school are more likely to go without food (p < .001). We do not find 
any correlation between reported experiences with offering bribes in exchange for 
household services and food security. 

Results suggest that there is a positive relationship between perceptions of 
government fairness and food security. A perception that the government treats 
members of one’s ethnic group unfairly translates into a .53 probability respon- 
dents and household members have enough food. By contrast, a perception that 
the government always treats members of one’s ethnic group fairly corresponds 
to a .6 probability that respondents and household members have enough food, 
a difference of 7.7 percentage points. The relationship between belonging to the 
same ethnic group as the country’s head of state and enjoying high levels of food 
security is positive but not significant (p > .05). We do not find any evidence of a 
relationship between the World Bank's country-level indicator of corruption and 
our measure of food security. 

Second, we suggested that the provision of functioning infrastructure is neces- 
sary for individuals to be able to attain food security. The presence of electricity 
grids, tarred or concrete roads, and post offices in PSUs has a direct positive effect 
on individuals’ food security. The probability of experiencing food security is 
.58 for individuals who live in neighborhoods that lack basic infrastructure. By 
contrast, living in a district with infrastructure translates into a .67 probability 
respondents and household members did not go hungry. ‘The first difference 
between those living in districts without and with infrastructure is 9 percentage 
points on experiencing food security in the year preceding the survey. 

Third, we find support for our hypothesis regarding the influence of law and 
order on food security. Poor law and order, which is measured at the district level, 
is negatively correlated with food security. Living in a neighborhood with wide- 
spread crime and a poor record of police responsiveness reduces the probability an 
individual has enough food to .33. By contrast, living in a neighborhood with no 
crime and responsive police increases the probability that the average individual and 
household members have enough food to .74, a difference of 41 percentage points. 

Overall, effective governance as measured by the presence of civil bureaucracies, 
law and order, and a high level of infrastructure development, has a large cumu- 
lative effect on food security. The combined effect of developed infrastructure, 
reliable law and order and a competent bureaucracy corresponds to an extra 61 
percentage points in the predicted probability of enjoying high levels of food 
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security. This finding is significant and substantial and is consistent with our 
expectations that an effective government can either help or hinder citizens from 
attaining high levels of food security through the provision of infrastructure, law 
and order, and bureaucratic capacity. 

We find that variation in food security can be attributed more to within-country 
than between-country differences. We calculate the intraclass correlation coefficient”? 
for each level in the model. The ICC can be defined as the proportion of variance in 
the dependent variable that can be accounted for by the group level (Snijders and 
Bosker 1999:17). The ICC is only .05 for the country-level meaning only 5percent 
of the variation in food security can be attributed to country-level characteristics. 
By contrast, the ICC for the PSU—and district—levels are .15 and .14, respectively. 


Control Variables 


As expected, we find statistically significant relationships between membership in 
certain status groups and food security. Household wealth has a positive impact 
on levels of food security. All else equal, as the socio-demographic status of respon- 
dents shift from low to high, the probability of experiencing food security shifts 
from .47 to .78. Respondents who live in an urban rather than a rural area (p < 
.01) are more likely to enjoy food security, as well as women (p < .01). 

The ICC for ethnicity, .04, is evidence that ethnic group affiliation has a weak 
but significant effect on food security. Membership in certain religious groups is 
associated with lower levels of food security (see Table 3 in appendix). 

We do not find any correlations between our country-level indicators of HIV/ 
AIDS prevalence, HIV/AIDS-related mortality and food security. We suspect that 
our model is capturing governments’ capacity to respond to HIV/AIDS through our 
measures of administrative capacity, law and order and infrastructure development. 
Our indicator of rainfall levels for 2004 is significant at the p < .05 and is negatively 
correlated with food security. Yet, when we include this variable in our model, the 
predicted probabilities and first differences for our measures of bureaucratic com- 
petence, infrastructure and law and order do not change (see Table 4 in appendix). 

We do not find any evidence that GDP per capita, aid per capita and food aid 
are linked to food security (see Table 5 in appendix). Economic development may 
affect food security by providing governments with the incentives and resources 
necessary to build infrastructure and to maintain a civil bureaucracy. We find 
moderate correlations between GDP per capita and our measures of infrastructure 
development and administrative capacity (Pearson’s r > .10). 

Neither do we find any correlation between our measure of democracy, Polity 
IV's democracy index and food security. Regression results suggest that Freedom 
House’s measures of political rights and civil liberties are both significantly related 
to food security p < .05. Including these two variables does not affect our estimates 
of bureaucratic competence, law and order and infrastructure development." (See 


‘Table 6 in appendix.) 
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Table 2: Multi-level Logistic Regression on High Levels of Food Security 


Socio-demographic Variables 

(Intercept) -2.16 ** -8.74 
Age -.01 kK -7.50 
Rural at -2.65 
Female 09227 3.14 
Physical health: missed work once or twice 245 -13.30 
Physical health: missed work many times “bose -14.65 
Physical health: missed work always -.48 *** -6.06 
Physical Health: don't know --48 * -2.42 
Employed Meet 4.56 
Own a television 52 tan 13.47 
Own a book 44 *** 13.04 
Own a radio Moe 4.29 
Own a bicycle OG 5.47 
Bureaucratic Competence and Honesty 

Ease of obtaining identity document - easy .03 .20 
Ease of obtaining identity document - never tried -.45 -1.66 
Ease of obtaining identity document - don’t know 12/3 2.26 
Ease of obtaining place in primary school - easy 20) 1.52 
Ease of obtaining place in primary school - never tried ma 1.08 
Ease of obtaining place in primary school - don’t know -.13 -.28 
Ease of obtaining household services - easy 42 ** 2.60 
Ease of obtaining household services - never tried ‘50. ZF 
Ease of obtaining household services - don’t know 21 59 
Government treatment of ethnic group - always fair 4 9.57 
Government treatment of ethnic group - NA ao fos 
Government treatment of ethnic group - don't know aD ran 4.15 
Ethnic group of head of state .20 1.76 
Law and Order 

Ease of obtaining help from police - easy (bias 300 
Ease of obtaining help from police - never tried .20 84 
Ease of obtaining help from police - don’t know 24 44 
Experience with theft - never tos 4.93 
Physically assaulted - never 08 ™ 2.75 
Infrastructure Provision in PSU 

Roads Ho 4.49 
Post office 12° 2.96 
Electricity grid 13 2.92 
RandomIntercepts  ===S—s——C~CSCS~“Vrriance SD 
T? (PSUs) (N=2,638) 06 24 
T? (Districts) (N=1,268) 19 43 
T? (Ethnic groups) (N=341) 15 39 


oe 0s .0in" p< .001 
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Table 3: Multi-Level Ordered Logit on High Levels of Food Security 


Estimate z-ratios 
Socio-Demographic Variables 





Age 2 O7 CE. -11.27 
Rural =09.°%* -3.90 
Female Ope 3.21 
Physical health: missed work once or twice spe te 7.67 
Physical health: missed work many times Aes 3.08 
Physical health: missed work always -.02 -.41 
Physical Health: don't know a 1.11 
Employed sion 8.87 
Own a television ee 13.46 
Own a book 205 15.57 
Own a radio OT Seicae 6.60 
Own a bike 40 5.14 
Bureaucratic Competence and Honesty 

Ease of obtaining identity document - easy .00 .03 
Ease of obtaining identity document - never tried -.07 -.68 
Ease of obtaining identity document - don't know ozs 2.92 
Ease of obtaining place in primary school - easy chy ae 2.50 
Ease of obtaining place in primary school - never tried .22 1.94 
Ease of obtaining place in primary school - don’t know 30 1.41 
Ease of obtaining household services - easy alse 4.01 
Ease of obtaining household services - never tried Is 216683 
Ease of obtaining household services - don’t know 14 .80 
Gov't treatment of ethnic group - always fair 2D 12.37 
Gov't treatment of ethnic group - NA Zone 5.90 
Gov't treatment of ethnic group - don't know sloee 4.61 
Ethnic group of head of state MA 6.96 
Law and Order 

Ease of obtaining help from police - easy Og 3.73 
Ease of obtaining help from police - never tried -.05 -.49 
Ease of obtaining help from police - don't know -.20 -.19 
Experience with theft - never SIa5 7.92 
Physically assaulted - never Zi 22% 
Infrastructure Provision in PSU 

Roads AOS 4.97 
Post office OF 2.86 
Electricity grid 0 415 
Cutpoints 

1 05 48 
2 aR 8.74 
3 1OoES 15.41 
4 2a aeSE 19.69 
Random Intercepts Variance SD 
T? (PSUs) (N = 2,639) 09 01 


“ps.00 . 05.01) = p=n004 
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To summarize, our empirical tests suggest that some features of state institutions 
affect individuals and their households’ food security above and beyond the effect of 
socio-demographic factors. By improving the quality and quantity of roads, electric- 
ity grids, post offices, police forces and bureaucracies, our results suggest, the state 
can improve individuals’ food security, and thus, their overall welfare. 


Conclusion 


Recent empirical analyses on good governance, with a few exceptions, focus on 
how cross-country variation in the quality of governance affects economic growth 
or social development. One study estimates that an improvement in governance, 
such as an improvement in the rule of law or a reduction in corruption, leads to 
a two- to four-fold improvement in per capita incomes and in infant mortality 
rates, and about a 20 percent improvement in adult literacy (Kaufmann et al. 
1999). We argue that political economists must widen their research agenda to 
examine within-country variation on the impact of governance on social welfare. 
We illustrate how we can measure the governance at the individual, neighborhood, 
district and country levels in developing countries. 

Our findings are consistent with recent studies that find strong correlations 
between the quality of government and economic and social outcomes. We have 
presented data and empirical tests that suggest that the quantity and quality of 
infrastructure, civil bureaucracy, and law and order affect individuals food security. 
These results are independent of respondents’ household and demographic char- 
acteristics and country-level characteristics including GDP, foreign aid, rainfall, 
food aid, democratic institutions, civil liberties and political rights. Our evidence 
reveals that there is more variation within than between countries in how effective 
governments have performed in facilitating food security. 

We do not find any relationship between our country-level democracy index 
and our individual-level measure of food security. Our results also suggest that 
combined, the availability of infrastructure, police and a bureaucracy have a sub- 
stantially stronger effect on individual-level food security than our country-level 
measures of political rights and civil liberties. This supports our claim that more 
important for individuals’ food security and social welfare than free and open 
multiparty elections are the availability of infrastructure, police and a bureaucracy. 

We suspect that there may be a feedback loop in our framework that we cannot 
capture without longitudinal data. As the effectiveness of government increases, 
citizens’ social welfare is also likely to increase. Consequently, citizens will be more 
likely to perceive government as fulfilling its end of the “fiscal contract” and thus, 
quasi-voluntarily comply with taxes (Levi 1997). The more taxes a government is 
able to collect from its citizens without resorting to coercion alone, the more pub- 
lic goods it is able to provide, leading to further improvements in social welfare. . 

Our article has shed light on how governments can positively affect citizens 
food security. Although food security is rarely problematic outside of Africa and 
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parts of Asia, malnutrition, infant and adult mortality, disease and poverty, are of 
concern in both developing and developed countries. With appropriate measure- 
ment tools, we can begin to identify where effective governments exist and how 
to improve governments that are failing to provide basic goods citizens need to 

maintain and improve their social welfare above and beyond food security. ‘The 

ability to assess government performance and its effect on individuals can facilitate 

the capacity of governments and aid agencies to identify how best to allocate 

resources to improve individuals’ food security, health and general well being. 


Notes 


1. Some scholars, including one of the authors of this piece (e.g., Levi 1988, 2002), 
have sometimes used the term “state” to mean the range of institutions that produce 
social order and that regulate (and extract from) civil society and government to 
mean the actual individuals and parties in office. However, in recent years governance 
and government are the more common terms of use, particularly in reference to 
the institutions and implementation processes that affect the goods and services 
that public officials demand of and provide to citizens and other inhabitants of the 
relevant polity. We follow this more recent terminology. 


2. We expect that there is curvilinear relationship between state penetration and 
government effectiveness. Excessive state penetration is likely to reduce a governments 
effectiveness. This is not a hypothesis we can test with the available data. 


3. Evidence suggests that high and unstable food prices were largely responsible for the 
famines in 19" century India (Ravallion 1996). 


4. For information on Afrobarometer and their sampling methods, see the 
Afrobarometer website and technical manuals (Afrobarometer Network 2007). 
We excluded Zimbabwe from our analysis because of concerns with data quality 
and because of missing data on key covariates. 


5. Weestimated a multilevel ordered probit using the GLLAMM package in STATA 
(Rabe-Hesketh et al. 2004). One difference between the results of the multilevel 
ordered probit and the multilevel logistic regression is the estimate of our variable, 
head of state (i.e., whether the respondent is a member of the same ethnic group 
as the head of state). In the former model, the variable is significant at the p < 
.000 and in the latter, the variable is not significant at the p < .05 level. 


6. We could not find the ethnicity of Mozambique’s current President Armando 
Guebuza, so we coded the head of state variable for Mozambique as all zeros. 


7. It is possible that food insecurity causes ethnic scapegoating and that ethnic 
discrimination causes food insecurity. We only examine the latter hypothesis. 


8. Low self-reported experiences with crime may be more of an indicator of respondents’ 
ability to pay for private protection than an indicator of effective governance. By 
aggregating the law and order variables, we can test whether living in a district with 
low self-reported crime and high levels of accessibility to the police increases the 
probability of experiencing food security. 


9. We also estimated a model using the amount of arable land per capita as a proxy 
for droughts. This variable is not significant and adding it does not change our 
coefficient and variance estimates. 
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10. We use the country nodes collected by Miguel et al. (2004a). No degree grid 
node was listed for Cape Verde; we use the longitude and latitude for Praia, the 
capital of Cape Verde. 


11. This variable is calculated from three other Polity IV variables: the competitiveness 
of political participation; the openness and competitiveness of executive recruitment; 
and, constraints on the chief executive. 


12. To conserve space, we left the appendix with the results from all of our alternative 
models out of this article. This appendix is available at: http://faculty.washington.edu/ 
mlevi. 


13. We hold interval-ratio variables at the mean and categorical variables at their mode. 


14. We excluded the indicator, the average ease of obtaining help from the police, from 
the regression that includes the variable, the average ease of obtaining an identity 
document, because of a moderately strong correlation between these two variables. 


15. The formula we use to calculate the intraclass correlations (Pebley et al. 1996) is 
posted on the appendix, http://faculty.washington.edu/mlevi. 


16. The predicted probabilities and first differences shift by less than 1 percentage point 
when we include political rights and civil liberties in two separate regression models. 
All else equal, a shift in political rights from a Freedom House rating of 4 to a rating 
of 1 corresponds to a percentage point increase of 25 in the predicted probability 
individuals will enjoy food security. Similarly, all else equal, a shift in political rights 
from a Freedom House rating of 5 to a rating of 1 corresponds to a percentage point 
increase of 22 in the predicted probability individuals will enjoy food security. 
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The Lasting Effect of Civic Talk on Civic Participation: 
Evidence from a Panel Study 


Casey A. Klofstad, University of Miami 


Extant research shows that individuals who discuss politics and current events 
with their peers also participate more actively in civil society. However, this 
correlation is not sufficient evidence of causation due to a number of analyti- 
cal biases. To address this problem, data were collected through a panel study 
conducted on students who were randomly assigned to dormitories during 
their first year of college. In addition, the data were preprocessed before analy- 
sis with a matching procedure. These data show that discussing politics and 
current events caused these students to participate in civic activities during 
their first year of college. A follow-up study conducted on the same popula- 
tion during their fourth year of college shows that the positive effect of civic 
talk on civic participation still exists despite the passage of three years. Further 
analysis shows that the boost in civic participation initially after engaging in 
civic talk is the mechanism by which the effect of civic talk lasts into the future. 





Because civic participation is integral to the performance of democracy, the ques- 
tion of what causes a person to step out of his or her private life and enter the 
public sphere has been a subject of constant study in the social sciences. Within 
this research tradition, a growing number of political scientists have focused their 
work on the sociological antecedents of civic participation. More specifically, a 
number of studies have shown a positive correlation between “civic talk” —infor- 
mal discussion of politics and current events that occurs in an individual’s “peer 
group” or “social network” —and civic participation. However, it is challenging 
to analyze this phenomenon with precision because it is difficult to determine 
if our peers influence us, or if our own patterns of behavior influence how we 
select and act with our peers (Klofstad 2007, 2009; Laver 2005; Nickerson 2008). 
Consequently, political scientists largely ignore the role of social-level antecedents 
of civic participation, and instead focus on individual-level factors (e.g., strength 
of political preferences, psychological engagement with politics and the like).! 
To address this problem new data were collected over three points in time from 
a panel of undergraduate students at a large public university in the Midwestern 
United States. This study allows for a more precise examination of civic talk be- 
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cause it more closely resembles a controlled experiment than previous studies; 
the study is quasi-experimental because the students who participated in it were 
randomly assigned to their first year dormitory roommates. In addition, the panel 
aspect of this study allows for examination of whether the effect of civic talk on 
civic participation lasts beyond the initial point of exposure. 

The results generated from this study show that there is a meaningful causal 
relationship between civic talk and participation in civil society. Initially after 
engaging in civic talk with their roommates, the population under study increased 
their participation in voluntary civic organizations by 38 percent. Moreover, the 
effect of civic talk is lasting. Study subjects who engaged in civic talk during their 
first year of college continued to participate in more voluntary civic organiza- 
tion activities during their fourth year of college. Further analysis shows that the 
initial boost in civic participation caused by civic talk is the mechanism by which 
the effect of civic talk lasts into the future. These findings illustrate that social 
scientists need to consider both individual- and social-level antecedents of civic 
participation in order to gain a comprehensive understanding of how participatory 
democracy functions. 


Sociological Explanations of Civic Participation 


A number of different lines of social science research assert that the individuals 
in one’ social environment have an effect on one’s opinions and behaviors. For 
example, works in sociology and social psychology show that individuals emulate 
the attitudinal and behavioral norms of their social network (e.g., Festinger et al. 
1950; Latané and Wolf 1981; Michener and DeLamater 1999). Economists and 
sociologists have shown that one’s college roommate can influence a variety of 
behaviors, such as educational attainment (Sacredote 2001) and binge drinking 
(Duncan et al. 2005). Research on households shows that people living under the 
same roof can influence each other to vote (e.g., Nickerson 2008). The literature 
on public deliberation shows that individuals become more informed about poli- 
tics through the process of formulating public policy options with other citizens 
(Barabas 2004; Delli Carpini, Cook and Jacobs 2004; Page and Shapiro 1992; 
Mendelberg 2002). Works on social capital and cooperation illustrate that inter- 
acting with fellow citizens causes individuals to have a greater sense of attachment 
to their communities, which leads to more frequent participation in civic activities 
(Dawes, van de Kragt and Orbell 1990; Putnam 2000; Sally 1995). Research 
on political communication, opinion formation, the mass media and political 
socialization shows that the individuals around us influence how we learn about 
politics because civically-engaged individuals provide the rest of us with informa- 
tion (Barker 1998; Dawson, Prewitt and Dawson 1977; Downs 1957; Lazarsfeld, 
Berelson and Gaudet 1968; Silbiger 1977; Stimson 1990; Zaller 1992). 

With regard to civic talk among peers—the specific focus of this article—the 
literature on social networks contends that talking about politics with the people 
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in our immediate social environment leads us to participate in civic activities (e.g., 
Campbell and Wolbrecht 2006; Huckfeldt and Sprague 1991, 1995; Huckfeldt et 
al. 1995; Kenny 1992, 1994; Klofstad 2007; Lake and Huckfeldt 1998; McClurg 
2003, 2004; Mutz 2002). For example, Lake and Huckfeldt show that the amount 
of political discussion occurring in an individual’s network of friends correlates 
with his or her level of political participation. More recent research has also begun 
to identify the mechanisms that allow individuals to translate discussion into ac- 
tion (Klofstad 2007; McClurg 2003). These studies suggest that civic talk causes 
civic participation because such discussions provide individuals with the motiva- 
tion and resources that are necessary in order to participate in civil society. For 
example, McClurg (2003) shows that peers are an important source of informa- 
tion on politics and current events. Information motivates participation because 
it increases civic competence (the ability to participate) and civic engagement 
(having an interest in participating in the first place). Klofstad (2007) comes to a 
similar conclusion on the role of information. 


Civic Participation as a Self-Reinforcing Process 


While there is a growing literature concerned with social networks, the question 
of whether the influence of civic talk on civic participation lasts beyond the ini- 
tial point of exposure has not been answered. Nonetheless, the political science 
literature on civic participation suggests that the effect of civic talk could be 
lasting because civic participation is a self-reinforcing phenomenon. For example, 
Gerber and colleagues (2003) show that individuals who have been induced to 
vote in the past are more likely to vote in the future (also see Fowler 2006; Plutzer 
2002). Additional research suggests that other forms of civic activity may also be 
self-reinforcing (e.g., Brady, Schlozman and Verba 1999; Burns, Schlozman and 
Verba 2001; Putnam 2000; Rosenstone and Hansen 1993; Verba, Schlozman and 
Brady 1995). For example, Verba and colleagues (1995) find that individuals who 
participate in civic activities through their churches or voluntary civic organiza- 
tions also tend to be active in other civic activities such as political campaign 
voluntarism. Research on political socialization also shows that past patterns of 
civic participation, especially the experiences a person has during adolescence and 
young adulthood, are highly influential in determining how civically active he or 
she will be in the future (e.g., Campbell 2006; Jennings and Niemi 1981). 

Civic participation is self-reinforcing because the more civically active an in- 
dividual is today, the easier it becomes for him or her to participate in the future. 
Individuals are not automatically equipped to participate in civil society. Instead, 
we require resources (e.g., knowledge on how to participate) and psychological 
) in order to participate in civic activities (Verba 
et al 1995). These prerequisites can be obtained as individuals take the resources 


and motivations they acquire through participating in civic activities today and 
apply them to participation in the future (Verba, Schlozman and Brady 1995). 


motivations (i.e., civic engagement 
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For example, a person can apply the experience he or she gained organizing a 
public service project for his or her church to organizing a partisan “get out the 
vote” drive. Citizens who are mobilized to participate in civic activities also tend 
to already be civically active because agents of civic mobilization (e.g., political 
parties and other civic organizations) are “rational prospectors.” (Brady, Schlozman 
and Verba 1999) These agents want their mobilization efforts to result in civic 
activity, so they target individuals who are already participating in civil society. 
Also, in a recent study Campbell (2006:5) finds that “...the civic norms within 
one’s adolescent social environment have an effect on civic participation well 
beyond adolescence...” 

If we assume that civic participation is a self-reinforcing behavior, as the extant 
literature suggests, past patterns of participation will help determine future pat- 
terns of participation. Consequently, if engaging in civic talk causes an individual 
to become more active in civil society, that effect could still be felt after the initial 
point of exposure to civic talk as the individual parlays his or her past participatory 
experience into future participation in civic activities. In other words, causing an 
initial increase in civic participation could be the mechanism by which the effect 
of civic talk lasts into the future. 


The Collegiate Social Network Interaction Project 


Despite the growing list of scholars who are concerned with social-level anteced- 
ents of civic participation, research on peer networks has been heavily criticized 
because it is difficult to provide evidence of a causal relationship between civic 
talk and civic participation. Existing works struggle to produce definitive results 
because it is difficult to determine if our peers influence us, or if our own patterns 
of behavior influence how we select and interact with our peers (Klofstad 2007, 
2009; Laver 2005; Nickerson 2008). For example, the central argument made in 
this literature is that talking about politics and current events with our peers leads 
us to become more active in civil society. However, an equally plausible explana- 
tion is that being active in politics causes you to talk about politics with your 
peers (reciprocal causation). Individuals who are more active in politics may also 
explicitly choose to associate with peers who are more interested in talking about 
politics (selection bias). Finally, some factor that has not been accounted for could 
be causing people to have political discussions with their peers and participate in 
civic activities (endogeneity bias). 

Traditionally, non-recursive regression models are used to overcome analytical 
biases such as these. Specifically, the independent variable of interest (in this 
case, the amount of conversation about politics and current events an individual 
has with his or her peers) is modeled with instrumental variables that do not 
correlate with the outcome variable being predicted (in this case, the amount 
of civic participation an individual engages in). However, it is difficult to think 
of any variable that could reliably predict the level of civic talk occurring in an 
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individual's peer group, yet not be correlated with how civically active he or she 
is. Instrumental variables like these have not been identified.” 

An ideal method to ameliorate these analytical problems would be to randomly 
assign one group of individuals to engage in civic talk (the treatment group), and 
another group of like individuals to not engage in civic talk (the control group). 
Under random assignment, treated and untreated subjects are identical to one an- 
other, save that one is exposed to the treatment while the other is not. This research 
design would allow us to be confident that the outcomes of the study are actually 
being caused by civic talk instead of any other observed or unobserved factors.* 

With this ideal research design in mind, data were collected from first-year 
college students who lived in university housing at the University of Wisconsin- 
Madison during the 2003-2004 academic school year. Random assignment is 
incorporated into the Collegiate Social Network Interaction Project Panel Survey 
design because study participants were assigned to their first-year college dormi- 
tory roommate based on a lottery. Incoming first year dormitory residents ranked 
the 16 dormitories on campus in order of where they wanted to live. Subjects were 
then randomly sorted by a computer in order to determine the order in which 
they would be assigned to dormitories. If space was available in the student's first 
housing choice at the time that his or her name was reached in the randomly- 
sorted list, the student was placed in a room in that dormitory. If space was not 
available, an attempt was made to place the student with a roommate in his or her 
second-choice dormitory, and so on. 

C-SNIP participants initially completed two survey questionnaires: one at 
the beginning of the 2003-2004 academic year before they were affected by their 
randomly-assigned roommate, and a second at the end of the 2003-2004 school 
year. During the first wave of the study, students were asked about their patterns 
of civic participation during high school. During the second wave of the study 
students were asked about their patterns of civic participation during their first 
year of college, as well as about their roommate. In the spring of 2007, during 
their fourth year of college, this same population was re-interviewed. The 2007 
questionnaire repeated most of the questions asked in the 2003-2004 studies. 
This additional data point allows for an assessment of whether the effect of civic 
talk felt by these students during their first year in college lasted into their final 
year of college. These data also further reduce problems associated with reciprocal 


causation, the possibility that civic participation causes civic talk, since the two 


phenomena are temporally separated from one another by three years (with talk 


occurring before participation). 
Because these data are collected from a sample of college students they are not 


representative of the wider public. This is, however, the inevitable tradeoff between 
internal and external validity.‘ In order to obtain a more accurate estimate of the 
causal relationship between civic talk and civic participation, it was necessary to 
study a more controlled, and subsequently less representative, population. In ad- 
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dition to increased internal validity, college is a uniquely useful setting in which 
to study civic talk because it represents a “crucial” case of peer influence (Eckstein 
1975; Gerring 2001). When young people leave their families to begin life as an 
independent adult, peers are likely to be become highly influential (Beck 1977; 
Campbell et al. 1960). Thus, if we do not find evidence of a causal relationship 
between civic talk and civic participation in this environment, we are less likely 
to find it in other contexts where peers may be less influential. An individual’s 
first year of college is also a crucial case because it is a “paradigmatic” case of peer 
influence (Gerring 2001). Collegiate peers are a paradigm of peer influence be- 
cause peers are a central facet of the individual’s life as he or she begins adulthood. 
Moreover, collegiate peers illustrate the importance of peer influence because they 
are likely to influence the patterns of civic participation that young people carry 
with them throughout life. 


Measures 
Dependent Variable: Civic Participation 


Civic participation is measured based on how active students reported being 
in voluntary civic organizations during their first and fourth years of college. 
In total, seven different types of group afhliations are accounted for: charitable 
and voluntary service, leadership and civic training, groups that “take stands on 
political issues or current events,” partisan groups, student government, student 
publications (e.g., newspaper), and speech clubs and teams (e.g., forensics, debate). 
For each organization, students were asked to rate how active they were in that 
organization on a four-point scale, ranging from “not at all active” to “very active.” 
Civic participation is operationalized as the total amount of organizational activity 
that each student engaged in (the sum of the seven four-point scales) .° 


Independent Variable: Civic Talk 


The independent variable of interest in this analysis is the amount of civic talk that 
occurred between college roommates. Specifically, in the C-SNIP questionnaire 
each student was asked, “When you talk with your roommate, how often do you 
discuss politics and current events: often, sometimes, rarely, or never?” 

‘Two features of this measure are necessary to explain in greater detail. First, while 
use of self reports is standard practice in studies of social networks (e.g., Campbell 
and Wolbrecht 2006; Huckfeldt and Sprague 1991, 1995; Huckfeldt, Beck, Dalton 
and Levine, 1995; Kenny 1992, 1994; Lake and Huckfeldt 1998: McClurg 2003, 
2004; Mutz 2002), an alternative approach would be to use a more exogenous mea- 
sure, the report supplied by each subject's roommate. Based on the small number 
of subjects who were willing to report their dormitory address, however, only 84 
roommate pairs were identified. ‘That said, this small amount of data shows that 
roommates agreed about the amount of civic talk that they engaged in (t = -1.14, p 
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= .16). As such, in this population self reports of civic talk behavior are likely to be 
observationally equivalent to an exogenous measure of civic talk. 

Second, one might question what types of conversations the subjects in this 
study recalled when asked to report how often they discussed “politics and current 
events.” This question was worded broadly in order to prompt study participants 
to report on a wide variety of relevant discussions. For example, if the C-SNIP 
questionnaire were to have only asked for a report of conversations about “politics,” 
study participants may not have been cued to recall conversations about events 
on campus that motivated them to participate in campus-based voluntary civic 
organizations. Moreover, focus group data collected from the 2007-2008 cohort of 
first-year students° give greater insight into what types of conversations are recalled 
when subjects are asked to recall discussions they had about “politics and current 
events.” These data show that the majority of conversations about “politics and 
current events’ are centered on sharing information about civically-relevant events 
that are happening on campus and/or that are covered in the news. 


Control Variables 


Based on data collected in the first C-SNIP survey, the analysis controls for how 
active each subject was in voluntary civic organizations during high school, before 
they engaged in civic talk in college (i.e., a lag of the dependent variable). This allows 
for an assessment of the effect of civic talk on civic participation, given the subject's 
a priori predilection to participate in civic activities. To increase the precision of the 
analysis, the analysis also uses fixed effects to account for how the dormitory assign- 
ment process was executed (i.e., a dichotomous indicator variable for each dormitory). 


Method: Data Preprocessing 


While the process of assigning C-SNIP participants to dormitory roommates was 
random, subjects were allowed to discuss politics and current events with their 
roommates as much or as little as they wished. Because of this deviation from 
random assignment, exogenous factors could be affecting both the treatment (the 
amount of civic talk each student was exposed to) and the outcome of interest 
(civic participation) in this study (Dunning 2008; Achen 1986).’ This feature of 
the C-SNIP study can be accounted for, however, by preprocessing the data with 
a “matching” procedure (Dunning 2008; Ho, King and Stuart 2007a, 2007b). 


Under this procedure the effect of civic talk is measured by comparing the civic 
ects who are similar to one another, save the fact that 
mate and the other did not. By com- 
ects who did and did not engage in 


participation habits of subj 
one engaged in civic talk with his/her room 
paring the participatory habits of similar subj oa OOO 
civic talk, we can be confident that any observed difference in civic participation 


between them is unrelated to the factors that the subjects were matched on, and 


as such is a consequence of civic talk. ® A full description of how this procedure 


was conducted is included in the appendix. 
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Table 1: The Effect of Civic Talk on Civic Participation 





Year in College 1st 4th 
Peer Influence 
Civic talk among roommates BS .62* 
(2.70) (2.00) 
Pre-Treatment Level of Civic Participation 
Participation in civic organizations in high school Leen An oe 
(7.33) (5.67) 
Treatment Assignment Controls: Dormitories 
1 -.32 -1.27 
(-.13) (-.51) 
2 -1.04 -17 
(-.55) (-.31) 
3 -.30 -1.03 
(-.16) (-.43) 
4 -1.66 -.19 
(-.89) (-.09) 
2 -1.44 -.32 
(-.80) (-.16) 
6 -.88 -.87 
(-.51) (-.40) 
7 -.14 -.91 
(-.08) (-.44) 
8 -.79 -.19 
(-.45) (-.09) 
9 -.87 -.58 
(-.51) (-.29) 
10 -1.20 -.48 
(-.68) (-.23) 
11 -1.22 -.44 
(-.64) (-.20) 
12 -1.60 -1.07 
(-.91) (-.53) 
13 -2.44 -1.23 
(-.26) (-.14) 
14 -1.44 -.29 
(-.84) (-.14) 
15 -1.51 -.56 
(-.87) (-.28) 
Constant 1.67 2.50 
95 
Adjusted R’ oe " ne 
N 1,044 1,044 


Source: Collegiate Social Network Interaction Project Panel Study 
Model Type: Ordinary Least Squares (Imai, King and Lau 2007c) 
*p<.10 “p<.05 **p <.01 (t-values in parentheses) 
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Results 
Civic Talk Has a Lasting Effect on Participation in Voluntary Civic Organizations 


To what extent does civic talk influence how active a person chooses to be in civil 
society? I start to answer this question by examining how active subjects were in 
voluntary civic organizations during their first year of college. The results of a 
multivariate regression analysis of participation in voluntary civic organizations 
are presented in Table 1. 

In the first column of Table 1 the data show that subjects who reported engaging 
in civic talk when conversing with their roommate were more likely to participate 
in voluntary civic organizations during their first year of college. All other factors 
equal, participation for treated subjects was 38 percent higher than that of untreated 
subjects (an increase from 2.1 to 2.9 on the voluntary organization participation 
scale).’ Similar results appear in the second column. Here, instead of estimating the 
immediate influence of civic talk on civic participation, the effect of the treatment is 
estimated three years after the subject engaged in civic talk. Regardless of this multi- 
year gap between treatment and outcome, the influence of civic talk is still statistically 
significant and substantively meaningful. All else equal, exposure to civic talk during 
the 2003-2004 academic school year increased civic participation by 20 percent in 
2007 (an increase from 3.0 to 3.6 on the voluntary organization participation scale)."° 

While the results in Table 1 show that the influence of civic talk is lasting, they 
also suggest that the effect might diminish over time; the treatment effect esti- 
mated from the matched data set drops from a 38 percent increase in participation 
in voluntary organizations in 2004 to a 20 percent increase in 2007. However, 
Figure 1 shows that this difference is not statistically significant. ‘The light-colored 
bars in this figure represent the estimated increase in civic participation due to 
engaging in civic talk. While the estimated effect declines between 2004 and 2007, 
the confidence intervals around the 2004 and 2007 estimates (represented by the 
vertical lines running through each bar) overlap. This suggests that the positive 
impact of civic talk on participation in voluntary civic organizations did not 
decrease even after the passage of three years. 


In order to assess the magnitude of the effect that civic talk has on participation 


in voluntary civic organizations, Figure 1 also compares the effect of civic talk to the 


effect of having participated in voluntary civic organizations in high school before 
engaging in civic talk in college. These results illustrate that while the effect of civic 
talk is statistically significant and lasting, the effect is not as large as that of prior 
participatory experience. In both 2004 and 2007, the effect of engaging in civic talk 
is less than the effect of having above average prior participatory experience. 


Why Does the Effect Last? 


What explains the lasting effect of civic talk on civic pa 
esized that the effect of civic talk would last into the fut 


rticipation? It was hypoth- 
ure by having an effect on 


2362 © Social Forces 88(5) 


Figure 1. Comparing the Effects of Civic Talk and Past Participation on Civic 
Participation 
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Source: Collegiate Social Network Interaction Project Panel Study 

Notes: The line on each bar represents the 95 percent confidence interval about the 
estimate. Figures are based on the regression analysis presented in Table 2. The civic 
talk first difference is calculated between treated and untreated subjects. The past 
participation first difference is calculated by comparing the expected levels of participation 
for subjects with average levels of prior experience to those with the maximum level of 
prior experience. 


patterns of civic participation in the present. In other words, causing an initial 
increase in civic participation is the mechanism by which the positive effect of 
civic talk on civic participation lasts into the future. 

Table 2 offers two tests of this hypothesis. In Table 1, the lasting effect of civic 
talk on civic participation was estimated while controlling for the amount of civic 
participation subjects engaged in during high school. In the first column of Table 
2, the analysis now also accounts for the amount of civic activity subjects engaged 
in during their first year of college. The goal of adding this variable to the analysis 
is to “explain away” the peer effect. If the boost in civic participation caused by 
civic talk during one’s first year in college explains why the effect of civic talk lasts 
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Table 2: Explaining the Lasting Effect of Civic Talk on Civic Participation 


a ee eee 
One-Stage Two-Stage 
Model’ Model” 





Peer Influence 
Civic talk among roommates 33 = 
(1.06) 
Pre-Treatment Level of Civic Participation 
Participation in voluntary civic organizations in high school .09** -.04 
(2.25) (-.50) 
Level of Civic Participation Initially After Treatment 
Participation in voluntary civic organizations/1st year of college 0a .94*** 
(9.00) (3.36) 
Treatment Assignment Controls: Dormitories 
1 -1.14 -2.36 
(-.49) (-1.10) 
2 -.38 -1.20 
(-.17) (-.81) 
3 -.94 -1.71 
(-.43) (-1.18) 
4 43 62 
(.23) (.38) 
5 .20 -17 
(.12) (-.12) 
6 -.55 -1.03 
(-.28) (-.73) 
i -.86 -1.63 
(-.46) (-1.14) 
8 10 -.33 
(.06) (-.24) 
9 -.26 -.97 
(-.15) (-.66) 
10 -.03 -.31 
(-.02) (-.22) 
11 04 -.12 
(.02) (-.08) 
12 -.48 -.83 
(-.27) (-.59) 
13 -.34 2 
(-.01) (.05) 
14 24 01 
(.13) (.007) 
15 -.004 -.46 
1.88 1.88 
Constant (.98) (1.33) 
2 24 n/a 
es R 1,044 1,044 


Source: Collegiate Social Network interaction Project Panel Study. Model Type: tet 
Least Squares (Imai, King and Lau 2007c); "Two Stage Least Squares (Alimadhi, Lu an 
Villalon 2007) *p<.10 *p<.05 ***p<.01 (t-values in parentheses) 
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into one’s fourth year in college, the civic talk coefficient should no longer be 
statistically significant after a measure of the subject’s level of civic participation 
during his or her first year in college is added to the model. This will only occur 
if participation during one’s first year in college accounts for the variance in civic 
participation during one’s fourth year in college that was originally accounted for 
by civic talk. The results in the first column of Table 2 show that this is the case. 
Once civic participation during one’s first year in college is added to the analysis, 
the civic talk coefficient is no longer statistically significant. 

A second examination of why the effect of civic talk lasts appears in the second 
column of Table 2. These results are the final outcome of a two-stage least squares 
analysis of civic participation during one’s fourth year in college. The first stage 
of the analysis uses civic talk to estimate the amount of civic activity that each 
subject participated in during his or her first year in college. The second stage of 
the analysis uses the estimate of civic participation during one’s first year in col- 
lege from the first stage of the model to estimate civic participation during one’s 
fourth year in college. The results of this analysis show that civic participation 
during one’s first year in college is the only variable in the model that can account 
for civic participation during one’s fourth year in college. What these results show, 
as expected, is that civic talk has an immediate effect on how civically active an 
individual chooses to be. This initial effect on an individual's patterns of behavior 
then has a direct effect on patterns of behavior in the future. 


Discussion and Conclusion 


Of the myriad explanations for why individuals choose to participate in the pro- 
cesses of democratic governance, no one theory has a monopoly on the truth. 
However, one thing we do know is that the people in our immediate social en- 
vironment have a place on this list of explanations. Human beings may not be 
Aristotelian political animals, but we are social animals. We experience politics 
with and through our peers. 

Against this logical presumption, political science research on civic participa- 
tion has been dominated by theories that focus on individual-level characteristics 
and largely ignore the role of social context. A number of studies assert that an 
individual's peers have an impact on whether he or she decides to participate 
in civic activities. However, this argument has been heavily criticized because 
researchers have been unable to accurately measure the causal relationship between 
peer influence and individual-level civic participation. As such, the question of 
how much influence civic talk has on participatory democracy has remained 
largely unresolved. 

I have addressed this important methodological and substantive question 
with new evidence. Using quasi-experimental panel data, in concert with a 
matching data preprocessing procedure, this research shows that civic talk can 
have a causal influence on how citizens participate in the processes of self goy- 
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ernance. ‘This is the case even after accounting for how civically active subjects 
were before they engaged in civic talk, arguably one of the best measures of an 
individual's predilection to participate in civic activities. Moreover, the effect of 
civic talk is substantively meaningful because it is lasting. The evidence shows 
that subjects were still more likely to participate in voluntary civic organiza- 
tions three years after they engaged in discussions of politics and current events. 
Further analysis shows that the initial boost in civic participation caused by civic 
talk is the mechanism by which the effect of civic talk lasts into the future. In 
other words, engaging in civic talk early in their college careers placed the sub- 
jects in this study on a self-reinforcing path of higher levels of civic participation 
compared to those who did not engage in civic talk. 

Despite the significant and meaningful effect that peers have on civic participa- 
tion, however, the results of this study do not suggest that sociological explana- 
tions of civic participation should supplant individual-level explanations. To the 
contrary, the estimated effect of civic talk on civic participation is less than that of 
having prior participatory experience. As such, the results presented in this analy- 
sis show that in order to more comprehensively understand how contemporary 
participatory democracy functions, both social- and individual-level antecedents 
of civic participation need to be considered. Neither factor on its own is a suf- 
ficient explanation for why an individual chooses to participate in civil society. 

While these results add to our understanding of participatory democracy, fur- 
ther research is needed in order to understand the relationship between civic talk 
and civic participation. The data utilized in this study come from one group of 
college students at one university. As such these results should be verified in other 
contexts. Future studies should make further use of methods that allow for more 
effective study of complex causal relationships, such as the quasi-experimental 
panel study described in this article, fully-controlled experiments (e.g., Nickerson 
2008), participant observation (e.g., Eliasoph 1998; Harris-Lacewell 2004; Walsh 
2004), focus groups (e.g., Hibbing and Theiss-Morse 1995), and agent-based 
modeling (e.g., Johnson and Huckfeldt 2004). 

Future research should also give consideration to the complex mechanisms that 
govern the relationship between civic talk and civic participation. Extant research 
on civic talk shows that the mechanism that links talk to action is information 
transfers (Klofstad 2007; McClurg 2003). The analysis presented in this article, 
however, did not account for whether the quantity and/or quality of information 
being exchanged during civic talk discussions affects how active one chooses to 
be in civic activities. For example, is one conversation sufficient to encourage 
participation, or is more sustained interaction necessary? Does the answer to this 
question change if we are talking about the long-term effects of civic talk? In a 
similar vein, future studies should also address the potentially dynamic relation- 
ship between civic talk and civic participation. The initial positive effect of civic 
talk on civic participation is the mechanism which governs the lasting relationship 
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between these two variables (i.e., talk begets participation today, which begets 
participation in the future). However, it could also be the case that civic talk en- 
courages more civic talk, which in turn encourages civic participation in the future. 
Data on subsequent civic talk interactions, as well as a more complex modeling 
approach, would be necessary to test this proposition. 

In conclusion, I note that there is currently a great amount of concern over the 
strength of participatory democracy, largely because of declines in civic participa- 
tion that have occurred over the past half-century (e.g., Putnam 2000; but, also 
see McDonald and Popkin 2001). As such, it is incumbent upon social scientists 
to continue to examine why individuals choose to participate in the processes 
of democratic governance. Social-level factors such as peer networks deserve a 
meaningful place in this agenda. 


Notes 


1. A quintessential example is the seminal “Michigan School” of political behavior. 
The founders of this research tradition, Campbell et al. 1960:27, went so far as to 
say that “[b]y and large we shall consider external conditions as exogenous to our 
theoretical system.” 


2. Non-recursive models have been used when the independent variable of interest is 
peer behavior (e.g., vote choice), not political discussion (Kenny 1992). 


3. Nickerson (2008) utilizes this type of research design to test whether individuals 
living in the same household influence each other to vote. However, this study 
does not examine whether civic talk is the causal agent behind civic participation. 
Moreover, the study does not examine whether the influence of peers lasts beyond 
the initial point of exposure to treatment. 


4. See Kam et al. 2007 for a detailed discussion of this issue. 


Alternatively, one could operationalize this variable as a count of the number of 
organizations that each subject was active in. The dependent variable was coded as the 
sum of the full activity scales, however, for two reasons. First, the full scales were used 
in order to avoid discarding information provided by respondents. Second, there are 
substantive differences between individuals who are passive and active participants in 
voluntary civic organizations (e.g., Putnam’s 2000). This said, Poisson and Negative 
Binomial analyses of the event count version of the dependent variable yielded results 
that are comparable to those presented in this study. 


6. With regard to case comparability with the C-SNIP panel survey data, the focus 


group participants were also randomly assigned to their dormitories. 


7. For example, students who were civically active before they came to college were more 
likely to discuss politics and current events with their new roommates (r = .17, p< 
01), and more likely to participate in civic activities in college (r = .37, p < .01). 


8. Matching is less precise than a controlled experiment because the procedure does 
not account for unobserved differences between treated and untreated subjects (e.g., 
Arceneaux et al. 2006). However, given the extensive set of pre-treatment covariates 
that were used in the matching procedure, it is difficult to think of any meaningful 
unobserved factors that are not accounted for in the analysis. Moreover, unobserved 
differences between treated and untreated subjects are likely to correlate with observed 
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differences (Stuart and Green 2008). Also, given the fact that a true experiment is 
an extremely difficult (if not impossible) research design to execute for this research 
question, matching (in concert with quasi-random assignment to treatment and 
controlling for a lag of the dependent variable) is a next-best alternative. 


9. Substantive interpretations of regression coefficients were calculated with the “setx” 
and “sim” procedures in the “Zelig” package for R (Imai et al. 2007a and b). The 
estimated treatment effect in the unmatched data set is a 45 percent increase in 
participation, suggesting that I would have slightly overestimated the influence of 
civic talk if | had not matched the data. 


10. The treatment effect in the unmatched data set is estimated to be a 29 percent increase 
in participation. 


11. ASM (The Associated Students of Madison) is the student government body. 
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Appendix 
Descriptive Statistics and Survey Questions 


Table A1: Descriptive Statistics 





Standard 
Min Max Mean _ Deviation N 
Civic Talk Among Roommates 
Full Scale -.55 4.26 1.40 .89 1,044 
Dichotomous Treatment Coding .00 1.00 AT .50 1,044 
Civic Participation 
High School -.76 19.00 6.60 3.96 1,044 
1st Year of College -6.54 21.00 2.43 2.91 1,044 
4th Year of College -4.98 14.00 3.20 2.66 1,044 
Dormitory Assignment 
1 .00 1.00 .002 .05 1,044 
2 .00 1.00 .06 noo 1,044 
3 .00 1.00 .09 28 1,044 
4 .00 1.00 02 2 1,044 
5 .00 1.00 .08 .26 1,044 
6 .00 1.00 oad 32 1,044 
7 .00 1.00 10 30 1,044 
8 .00 1.00 10 30 1,044 
9 .00 1.00 07 29 1,044 
10 .00 1.00 07 .26 1,044 
11 .00 1.00 .04 21 1,044 
12 .00 1.00 .06 20 1,044 
13 .00 1.00 .001 .04 1,044 
14 .00 1.00 “1 132 1,044 
15 00 1.00 .09 29 1,044 


ee See 


Source: Collegiate Social Network Interaction Project Panel Study 

Notes: The values presented in this table are means calculated from five imputed data 
sets. The minimums of some variables are negative because a range prior was not 
specified for ordinal variables. 


Participation in Voluntary Civic Organizations 


“How active were you in the following types of organizations (during high school dur- 
ing your first year here at the University of Wisconsin/ at the University of Wisconsin 
this year]: very active, somewhat active, not very active, or not at all active? 

¢ Student government (for example, student council/ASM,11 etc.) 
¢ Partisan political groups (for example, Young/College Republicans or 


Democrats, etc.) 
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* Organizations that take stands on political issues or current events (for 
example, a group interested in protecting the environment, etc.) 

¢ Charitable or voluntary service organizations (that is, working in some 
way to help others without pay and not for course credit) 

¢ Leadership training or civic organizations (for example, community ser- 
vice organizations, etc.) 

¢ Student publications (for example, yearbook, newspaper, etc.) 

¢ Forensics, debate, or other speech clubs or teams 


Civic Talk 
“When you talk with your roommate, how often do you discuss politics and 
current events: often, sometimes, rarely, or never?” 


The Collegiate Social Network Interaction Project Panel Survey (C-SNIP) 


The population surveyed was all 4,358 first-year students at the University of 
Wisconsin-Madison living in university housing during the 2003-2004 academic 
year (82% of the 5,322 first year students who entered the University in 2003). 

" Study participants initially completed two questionnaires over the internet dur- 
ing the 2003-2004 academic year: one at the beginning of the year (October- 
November, 2003), and a second at the end of the year (March-April, 2004). A 
third questionnaire was administered April-May of 2007. During each wave of 
the study, three attempts were made by email to recruit the population to fill out 
a questionnaire. Email addresses were obtained from the University of Wisconsin- 
Madison Office of the Registrar and from publicly accessible student directories. 
Unique login names and passwords were assigned to each respondent to prevent 
subjects from completing more than one questionnaire. To increase participa- 
tion from a broad cross-section of the population under study, each student who 
completed a questionnaire was entered into a prize drawing for 1 of 50 $20 prizes. 
‘The recruitment emails also were worded to make the prospect of participating in 
the study appealing to a wide audience. 

In total, 23 percent of the population fully completed the first two question- 
naires in 2003-2004 (N = 999). Just under 24 percent of the eligible population 
completed at least some portion of both questionnaires (N = 1044). Of the 1,044 
students who at least partially completed both questionnaires in 2003-2004, 53 
percent of subjects (N = 557) fully completed, and more than 57 percent of 
subjects (N = 598) at least partially completed, the 2007 questionnaire. These 
response rate figures exclude subjects who were eliminated from the analysis to 
reduce bias: subjects who moved from the dormitory room they were initially 
assigned to, subjects who chose their own roommate, and subjects who had no 
roommates. To account for missing data, the data set was preprocessed using, the 
Amelia IT multiple imputation package for R (Honaker, King and Blackwell 2007; 
see also King et al. 2001). The data were imputed five times. To aid the imputation 
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process, the tolerance level was set to .001, and a ridge prior of 5 percent of the 
cases in the dataset was used. All dichotomous variables were imputed using the 
nominal transformation; no other transformations were used. 
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Table A2: Characteristics of Respondents and Non-Respondents 
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indicates an insignificant difference of means at p < .10 (two-tailed t-tests) 


Note: > or < indicates a significant difference of means at p <= .10, 


While imputation compensates for 
missing data, it is still important to ad- 
dress the issue of response rate. While 
the C-SNIP study recruitment proce- 
dures were designed to attract a broad 
cross-section of participants, it could be 
the case that certain types of individuals, 
say those who are more interested in the 
subject of the study, choose to partici- 
pate at higher rates. For example, after 
participating in the first survey, each 
respondent knew that subsequent waves 
of the study would address politics and 
current events. This might prompt in- 
dividuals who are more interested and 
active in these matters to complete the 
study. It this is the case, the civic talk 
effects could be inflated. 

Table A2 offers a test of this propo- 
sition by examining the characteristics 
of respondents and non-respondents in 
each of the three waves of the C-SNIP 
panel study. The top portion of the table 
examines the demographic characteris- 
tics of respondents and non-respondents. 
Fortunately these three measures were 
available for the entire population that 
was surveyed, and as such they can be 
used to assess response bias in all three 
waves of the study. These data show that 
when compared to non-respondents, re- 
spondents scored higher on their ACT 
college entrance exams, were more 
likely to be female, and were less likely 
to belong to a racial or ethnic minority 
group. However, while these differences 
are statistically significant, in the case of 
ACT score and race the substantive dif- 
ferences between respondents and non- 
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respondents is small. Moreover, all three of these demographic characteristics 
were included in the matching data pre-processing procedure. Consequently, any 
differences between respondents and non-respondents on these variables are au- 
tomatically accounted for in the analysis. The bottom portion of Table A2 shows 
survey responses provided by respondents in previous waves of the C-SNIP study 
to gauge differences between respondents and non-respondents in subsequent 
waves of the study. No differences are found between these two subsets of the 
population in either Wave 2 or Wave 3 of the study. 


Matching Procedure 


For this analysis a “full matching” procedure was used (Gu and Rosenbaum 
1993; Hansen 2004; Ho, King and Stuart 2007a; Ho, King and Stuart 2007b; 
Rosenbaum 1991; Stuart and Green 2008). The procedure was conducted using 
the “MatchIt” package for R (Ho, Imai, King and Stuart 2007a; Ho, Imai, King 
and Stuart 2007b), which makes use of the “optmatch” package (Hansen 2004). 
The C-SNIP panel data set is tailor-made for matching because subjects were 
surveyed about their characteristics before they engaged in civic talk with their 
college roommate (i.e., before they were exposed to the civic talk “treatment”). In 
total, 109 pre-treatment variables were used in the matching procedure. Matching 
on a large number of pre-treatment covariates increases the validity of the final 
analysisbecause it is more likely that all covariates of assignment to treatment are 
accounted for (Ho et al. 2007b). This set of variables included measures of civic 
participation in high school, measures of why each student ranked the dormitories 
before being placed, indicators of which dorm each subject was eventually placed 
into, pre-treatment information on the subject's roommate and dormitory, demo- 
graphics, measures of home life before coming to college, and civically-relevant 
attitudes and characteristics. 

The full matching procedure involves three steps. First, subjects who engaged 
in an above-average amount of civic talk with their roommate were classified 
as having been treated (N = 495), while those who engaged ina below-average 
amount of civic talk were classified as untreated (N = 549). Second, the variables 
included in the matching procedure were used to estimate a score of one’s pro- 
pensity to engage in civic talk (Hansen 2004; Ho, King and Stuart 2007a; Ho, 
King and Stuart 2007b). Third, at least one untreated subject was matched to 
at least one treated case based on how close the propensity scores were between 
treated and untreated cases (i.e., a process of creating “subclasses” where more 
than one treated subject could be matched to an untreated subject and vice- 
versa). Each untreated case was only matched to one treated case, and vice-versa 
(i.e., matching without replacement). Also, after a case was initially matched it 
could have been moved and matched to a different case in order to improve the 
overall similarity between treated and untreated subjects in the data set (i.e., the 
process is “optimal” not “greedy”). 
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Table A3: Percent Improvement in Balance Between Treated and Untreated Cases 


Overall 99.70 
QQ Plot Summary Statistics 

Median 94.10 
Mean 93.42 
Max 89.53 


Source: Collegiate Social Network Interaction Project Panel Study 
Note: For the purposes of standardization, the overall balance measure is measured in 
standard deviations. 


The results of the matching procedure were incorporated into the analysis by 
weighting the regression models. All treated cases were given a weight of 1, while 
untreated cases were assigned a weight equal to the number of treated cases in the 
subclass that they were assigned to, divided by the number of untreated cases in 
the subclass that they were assigned to. For example, an untreated case that was 
assigned to a subclass with 10 treated cases and 1 untreated case was assigned a 
weight of 10, while an untreated case that was assigned to a subclass with 1 treated 
case and 10 untreated cases was assigned a weight of .10. Applying this weight 
caused the regression model to pay more attention to untreated cases that are 
similar to treated cases, and less attention to untreated cases that are dissimilar to 
treated cases, making the analysis a better comparison between the treated and 
untreated cases than if the data were not weighted. 

The results presented in Table A3 illustrate how the matching procedure in- 
creased the similarity, or “balance” (Ho et al. 2007a and 2007b), between subjects 
who did and did not engage in civic talk. The first row in the table shows the over- 
all improvement in similarity between treated and untreated subjects, as measured 
by the subject’s estimated propensity to engage in civic talk (i.e., the propensity 
score created by the matching procedure). Overall, the similarity in the propensity 
to engage in civic talk between subjects who did and did not engage in civic talk 
with their roommates increased by nearly 100 percent as a result of matching. ‘The 
remaining rows of the table show the summary statistics from “QQ plots.” QQ 
plots are two-dimensional graphs which plot the empirical distribution of a vari- 
able among treated subjects on one axis against the empirical distribution of that 
same variable among untreated subjects on the other axis. The closer this plotted 
line is to the 45-dergee line on the graph, the closer treated and untreated subjects 
are to being perfectly balanced on that variable. The results in Table A3 show that 
the median, mean and maximum distance of the propensity score QQ plot from 
the 45-degree line were all greatly improved due to the matching procedure. 
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